MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier
Blog post from HuggingFace
MiniGuard-v0.1, a new safety classifier developed by Prem Research, offers significant advancements in efficiency and cost-effectiveness compared to larger models like NVIDIA's Nemotron-Guard-8B. With only 0.6 billion parameters, MiniGuard achieves 99.5% of the benchmark accuracy of Nemotron, despite being 13 times smaller, 2.5 times faster, and 67% cheaper to operate on modern GPUs. This is achieved through targeted synthetic data, step-by-step distillation, model soup, and FP8 quantization, collectively addressing the limitations of smaller models in handling context-dependent safety decisions. These techniques not only compress the knowledge of larger models but also enhance MiniGuard's performance on out-of-distribution production data, retaining 91.1% of Nemotron's performance at a fraction of the cost. The model is available under an MIT license and serves as a drop-in replacement for existing safety classifiers, offering a promising solution for applications where cost and latency are critical considerations.