Home / Companies / Voyage AI / Blog / Post Details
Content Deep Dive

voyage-multilingual-2: Multilingual Embedding Model

Blog post from Voyage AI

Post Details
Company
Date Published
Author
Voyage AI
Word Count
414
Language
English
Hacker News Points
-
Summary

Voyage-multilingual-2 is a newly released model optimized for multilingual retrieval and retrieval-augmented generation (RAG), outperforming alternatives such as OpenAI v3 large and Cohere multilingual v3 across major languages like French, German, Japanese, Spanish, and Korean, while maintaining strong performance in English. It excels with an average of 5.6% improvement over the second-best performing model and supports a large 32K context length, making it suitable for expertise-intensive domains including code, law, and finance. Evaluated using over 85 datasets covering 27 languages, voyage-multilingual-2 achieves superior retrieval accuracy, particularly in multilingual contexts, and is assessed using the normalized discounted cumulative gain (NDCG@10) metric. This model is designed to enhance Gen AI applications for global users and multilingual developers, providing a promising tool for those in need of advanced multilingual support.