Company
Date Published
Author
Multiple Authors
Word count
1427
Language
English
Hacker News points
None

Summary

Cohere has unveiled Rerank 3 Nimble, the latest addition to its Rerank model series, designed to enhance enterprise search and Retrieval-Augmented Generation (RAG) systems with a significant increase in speed and efficiency—approximately three times faster than its predecessor, Rerank 3, without compromising accuracy. This model is available in both English and a multilingual version supporting over 100 languages and can process very long documents, making it versatile for a range of data types. Rerank 3 Nimble is engineered to improve search relevancy by reordering documents based on their relevance to a query, and is particularly beneficial for high-volume workloads in industries such as retail, where reduced latency can lead to higher conversion rates. By integrating with Cohere's Command R generative model series, it enhances the efficiency of RAG applications by minimizing the documents passed to language models for grounded generation. Available on Amazon SageMaker and for on-premise deployments, Rerank 3 Nimble maintains competitive pricing with its predecessor and is set to launch on Amazon Jumpstart in July 2024.