Scaling Semantic Search at Vectara: Corpora
Blog post from Vectara
Vectara Semantic Search is a fully managed platform designed to simplify the deployment of neural information retrieval systems by integrating essential subsystems like ML inference, vector indexes, and monitoring tools, among others, into a seamless offering accessible via REST and gRPC APIs. Despite the complexity and infrastructure challenges often associated with such systems, Vectara ensures scalability and reliability, boasting over 99.9% uptime and the capacity to handle high query loads through replication across multiple availability zones. The platform is tailored for SaaS applications, with the flexibility to manage thousands of corpora per customer account, and recent improvements have enhanced its ability to support even larger scales, such as a million corpora in a single account, by addressing bottlenecks in data retrieval and transmission. Vectara's unique approach to semantic search supports cross-language hybrid search, aiming to deliver precise, context-aware responses in natural language, distinguishing itself from other solutions like AWS Kendra by offering extensive corpus management capabilities.