Qwen3Guard: Real-time Safety for Your Token Stream
Alibaba released Qwen3Guard, the first safety guardrail model in the Qwen family, fine-tuned from Qwen3 for prompt and response safety classification.
Excerpt
Tech Report GitHub Hugging Face ModelScope DISCORD
Introduction We are excited to introduce Qwen3Guard, the first safety guardrail model in the Qwen family. Built upon the powerful Qwen3 foundation models and fine-tuned specifically for safety classificatoin, Qwen3Guard ensures responsible AI interactions by delivering precise safety detection for both prompts and responses, complete with risk levels and categorized classifications for accurate moderation.
Qwen3Guard achieves state-of-the-art performance on major safety benchmarks, demonstrating strong capabilities in both prompt and response classification tasks across English, Chinese, and multilingual environments.
Read at source: https://qwenlm.github.io/blog/qwen3guard/