Virtue Guard
Advanced AI guardrail models providing real-time multimodal content moderation for enhanced AI safety and security
Complete Guardrail Suite
Text Guard - Lite
Fast and efficient text content moderation
Text Guard - Pro
Advanced text moderation with clear explanation
Image Guard
Visual content moderation and explanation
Video Guard
Comprehensive video content protection with explanation
Audio Guard
Real-time audio content safety moderation
Comprehensive Risk Categories
| Category | Description |
|---|---|
| S1 (Violent Crimes) | Content related to violent criminal activities |
| S2 (Non-Violent Crimes) | Content related to non-violent criminal activities |
| S3 (Sex-Related Crimes) | Content involving sexual crimes or exploitation |
| S4 (Child Sexual Exploitation) | Content related to exploitation of minors |
| S5 (Specialized Advice) | Potentially harmful specialized guidance or instructions |
| S6 (Privacy) | Content that may compromise personal privacy |
| S7 (Intellectual Property) | Content that violates intellectual property rights |
| S8 (Indiscriminate Weapons) | Content related to weapons of mass destruction |
| S9 (Hate) | Hate speech, discrimination, or extremist content |
| S10 (Suicide & Self-Harm) | Content promoting self-injury or suicide |
| S11 (Sexual Content) | Inappropriate sexual content or explicit material |
| S12 (Jailbreak / Prompt Injections) | Attempts to bypass AI safety measures |
Performance Comparison
| Model | Open AI Mod AUPRC (↑) | ToxicChat AUPRC (↑) | Overkill AUPRC (↑) | TwinSafety AUPRC (↑) | Agies AUPRC (↑) |
|---|---|---|---|---|---|
| VirtueGuard Text-Lite | 0.948 | 0.912 | 0.918 | 0.796 | 0.907 |
| Llama Guard 1 7B | 0.796 | 0.651 | 0.862 | 0.715 | 0.897 |
| Llama Guard 2 8B | 0.803 | 0.525 | 0.893 | 0.764 | 0.883 |
| Llama Guard 3 8B | 0.820 | 0.570 | 0.886 | 0.796 | 0.903 |
| ShieldGemma 2B | 0.630 | 0.597 | 0.894 | 0.701 | 0.889 |
| ShieldGemma 9B | 0.894 | 0.767 | 0.914 | 0.735 | 0.904 |
| ShieldGemma 27B | 0.689 | 0.653 | 0.884 | 0.715 | 0.876 |
| Open AI Moderation API | 0.870 | 0.562 | 0.804 | 0.607 | 0.845 |
| Perspective API | 0.787 | 0.499 | 0.567 | 0.583 | 0.825 |
| Toxic Chat T5 | 0.742 | 0.563 | 0.796 | 0.607 | 0.755 |
Processing Speed
Lightning-Fast Processing
On-premises Latency
1ms
API Latency
20ms
Supported Languages
Supporting 90+ Languages
European
Asian
Middle Eastern
African
South Asian
Other Regions
Full language support:
Afrikaans, Amharic, Arabic, Asturian, Azerbaijani, Bashkir, Belarusian, Bulgarian, Bengali, Brazilian Portuguese, Breton, Bosnian, Catalan/Valencian, Cebuano, Chinese, Czech, Welsh, Danish, German, Greek, English, Spanish, Estonian, Persian, Fulah, Finnish, French, French Canadian, Western Frisian, Irish, Scottish Gaelic, Galician, Gujarati, Hausa, Hebrew, Hindi, Croatian, Haitian Creole, Hungarian, Armenian, Indonesian, Igbo, Iloko, Icelandic, Italian, Japanese, Javanese, Georgian, Kazakh, Central Khmer, Kannada, Korean, Luxembourgish, Ganda, Lingala, Lao, Lithuanian, Latvian, Malagasy, Macedonian, Malayalam, Mongolian, Marathi, Malay, Burmese, Nepali, Dutch/Flemish, Norwegian, Northern Sotho, Occitan, Oriya, Punjabi, Polish, Pashto, Portuguese, Romanian/Moldavian/Moldovan, Russian, Sindhi, Sinhala, Slovak, Slovenian, Somali, Albanian, Serbian, Swati, Sundanese, Swedish, Swahili, Tamil, Thai, Tagalog, Tswana, Turkish, Ukrainian, Urdu, Uzbek, Vietnamese, Wolof, Xhosa, Yiddish, Yoruba, Zulu