Home / Companies / HuggingFace / Blog / Post Details
Content Deep Dive

Anonymizer SLM series: Privacy-first PII replacement models (0.6B/1.7B/4B)

Blog post from HuggingFace

Post Details
Company
Date Published
Author
Pratyush Ranjan Tiwari and Eternis Team
Word Count
1,962
Language
-
Hacker News Points
-
Summary

The discussed article explores privacy challenges and solutions associated with large language models (LLMs) that handle sensitive user data, emphasizing the need for privacy-preserving models. It critiques the limitations of current methods like Trusted Execution Environments (TEEs) and introduces a novel approach focusing on on-device anonymization that detects and surgically replaces personally identifiable information (PII) with semantically equivalent placeholders. This process ensures that queries sent to external models do not expose private data, while maintaining contextual integrity. The approach leverages lightweight models for precise PII replacement, combined with network-level protections such as TEE proxies and traffic obfuscation to safeguard against potential data breaches. The article highlights the performance and training of these models, noting improvements through techniques like Group Relative Policy Optimization (GRPO), which align performance with state-of-the-art models while being optimized for consumer hardware deployment. It discusses the development and deployment of Silo, an app that offers these privacy guarantees, allowing users to utilize LLMs for sensitive tasks without compromising personal data.