Qwen3Guard: Real-time Safety for Your Token Stream

Qwen Blog · Sep 22, 2025

Alibaba released Qwen3Guard, the first safety guardrail model in the Qwen family, fine-tuned from Qwen3 for prompt and response safety classification.

Categories: Model Releases, OSS & Tools

Excerpt

Tech Report GitHub Hugging Face ModelScope DISCORD Introduction We are excited to introduce Qwen3Guard, the first safety guardrail model in the Qwen family. Built upon the powerful Qwen3 foundation models and fine-tuned specifically for safety classificatoin, Qwen3Guard ensures responsible AI interactions by delivering precise safety detection for both prompts and responses, complete with risk levels and categorized classifications for accurate moderation. Qwen3Guard achieves state-of-the-art performance on major safety benchmarks, demonstrating strong capabilities in both prompt and response classification tasks across English, Chinese, and multilingual environments.

Read at source: https://qwenlm.github.io/blog/qwen3guard/