prem-research
/

MiniGuard-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions

ojus1 commited on 15 days ago

Commit

44f5bdf

·

verified ·

1 Parent(s): a2cc2d5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ tags:
   <img src="assets/MiniGuard-hero.png" alt="MiniGuard-v0.1 Hero" width="25%">
 </p>
-MiniGuard-v0.1 is a compact content safety classifier fine-tuned from [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B). It classifies content in both LLM inputs (prompt classification) and LLM responses (response classification), outputting whether a given prompt or response is safe or unsafe, along with the violated safety categories.
 MiniGuard-v0.1 achieves **~99% of Nemotron-Guard-8B's benchmark accuracy** with **13x fewer parameters**.
@@ -23,7 +23,7 @@ MiniGuard-v0.1 achieves **~99% of Nemotron-Guard-8B's benchmark accuracy** with
 ## Compatibility
 MiniGuard-v0.1 uses the **same prompt template** as [nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3). It is a drop-in replacement for this model.
-- A few sections of this model card are copied directly from `[nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3)`.
 > **Note:** The chat template has been modified to disable Qwen3's thinking mode. MiniGuard-v0.1 generates safety labels directly without chain-of-thought reasoning at inference time.

   <img src="assets/MiniGuard-hero.png" alt="MiniGuard-v0.1 Hero" width="25%">
 </p>
+MiniGuard-v0.1 is a compact content safety classifier fine-tuned from [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B). It classifies content in both, User inputs (prompt classification) and LLM responses (response classification), outputting whether a given prompt or response is safe or unsafe, along with the violated safety categories.
 MiniGuard-v0.1 achieves **~99% of Nemotron-Guard-8B's benchmark accuracy** with **13x fewer parameters**.
 ## Compatibility
 MiniGuard-v0.1 uses the **same prompt template** as [nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3). It is a drop-in replacement for this model.
+- A few sections of this model card are copied directly from [nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3).
 > **Note:** The chat template has been modified to disable Qwen3's thinking mode. MiniGuard-v0.1 generates safety labels directly without chain-of-thought reasoning at inference time.