Home / Companies / HuggingFace / Blog / Post Details
Content Deep Dive

Introducing the Ettin Reranker Family

Blog post from HuggingFace

Post Details
Company
Date Published
Author
Tom Aarsen
Word Count
5,698
Company Posts That Month
55
Language
-
Hacker News Points
-
Summary

Tom Aarsen announced the release of the Ettin Reranker Family, a set of six new state-of-the-art Sentence Transformers CrossEncoder rerankers, each built on Ettin ModernBERT encoders. These models, ranging from 17 million to 1 billion parameters, are designed to enhance the accuracy of document retrieval systems by reordering results based on relevance scores. They employ a pointwise mean squared error (MSE) distillation from a strong teacher model, using a broad dataset of approximately 143 million query-document pairs. The rerankers are particularly efficient due to their architecture, which supports modern attention mechanisms like Flash Attention 2, offering significant speed improvements over previous models. The Ettin rerankers outperform existing models such as the MiniLM series on both MTEB and NanoBEIR benchmarks while maintaining high throughput. The release includes training recipes and data, making it accessible for further development and optimization by the community.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Vector Search 19 2,268 422 128 +30%
AI Model Fine-tuning 5 615 196 69 +46%
AI Coding Assistant 2 1,798 527 167 +21%
LLM 2 9,074 1,640 224 +53%