Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Post Details

Company

Hugging Face

Date Published

May 14, 2026

Author

Radu Florian, Parul Awasthy, Aashka Trivedi, and Madison Lee

Word Count

3,411

Company Posts That Month

55

Language

-

Hacker News Points

-

Post removed?

No

Source URL

huggingface.co/blog/ibm-granite/granite-embedding-multilingual-r2

Summary

The Granite Embedding Multilingual R2 release introduces two new Apache 2.0 licensed multilingual embedding models, offering significant advancements in retrieval quality across 200+ languages and code. The compact 97M-parameter model, built on ModernBERT architecture, achieves outstanding retrieval scores, outperforming other sub-100M parameter models on the MTEB Multilingual Retrieval benchmark, while the full-size 311M-parameter model ranks second among open models under 500M parameters. These models, which feature 32K-token context handling and cross-lingual code retrieval, are designed to be enterprise-ready, having been trained with IBM-curated datasets and stringent governance processes to ensure commercial use safety. They are compatible with sentence-transformers and other frameworks like LangChain, LlamaIndex, Haystack, and Milvus, requiring no task-specific instructions for integration. The models also support Matryoshka Representation Learning, allowing flexible embedding dimensions to optimize storage and computation costs. These innovations represent a leap forward in multilingual model efficiency, offering robust performance in cross-lingual, code, and long-document retrieval tasks.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Vector Search	82	2,268	422	128	+30%
RAG	2	2,105	333	83	+124%
AI Model Fine-tuning	1	615	196	69	+46%
LLM	1	9,074	1,640	224	+53%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.