voyage-3-large: the new state-of-the-art general-purpose embedding model

Post Details

Company

Voyage AI

Date Published

Jan. 7, 2025

Author

Voyage AI

Word Count

838

Company Posts That Month

1

Language

English

Hacker News Points

-

Post removed?

No

Source URL

blog.voyageai.com/2025/01/07/voyage-3-large

Summary

Voyage-3-large is a cutting-edge, general-purpose, multilingual embedding model that leads in performance across eight domains and 100 datasets, including law, finance, and code. It surpasses OpenAI-v3-large and Cohere-v3-English by 9.74% and 20.71% on average, respectively, and is enabled by Matryoshka learning and quantization-aware training, which support smaller dimensions and quantization options to significantly reduce vectorDB costs while maintaining retrieval quality. The model offers various embedding precisions, including 32-bit, int8, and binary, with a 32K-token context length, enabling it to balance retrieval quality with storage efficiency. This model establishes a new accuracy-cost frontier, outperforming previous Voyage models and OpenAI-v3-large with reduced storage needs, and offers improvements in retrieval quality when used with binary rescoring. Voyage-3-large is now available, with the first 200 million tokens offered for free, and further information can be accessed via their documentation or through their social media and contact platforms.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Vector Search	11	2,433	274	99	-40%
RAG	1	1,794	220	80	+16%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.