ScyllaDB Vector Search: 1B Vectors with 2ms P99s and 250K QPS Throughput

Post Details

Company

ScyllaDB

Date Published

Dec. 1, 2025

Author

Szymon Wasik

Word Count

1,462

Company Posts That Month

5

Language

English

Hacker News Points

-

Source URL

www.scylladb.com/2025/12/01/scylladb-vector-search-1b-benchmark

Summary

ScyllaDB Vector Search is a high-performance solution designed to handle billion-scale datasets with ultra-low latency and high throughput, as validated by a benchmark using the yandex-deep_1b dataset containing 1 billion vectors of 96 dimensions. The system achieves this through an architecture that separates storage and indexing duties while maintaining a unified user perspective, with nodes storing structured data and vector embeddings in a distributed table. A dedicated Vector Store service, implemented in Rust and powered by the USearch engine, builds approximate-nearest-neighbour indexes in memory to ensure predictable single-digit millisecond latencies. Two usage scenarios were tested: one prioritized ultra-low latency with moderate recall, achieving 252,000 queries per second, while the other focused on high recall with slightly higher latency, maintaining 6,500 queries per second. ScyllaDB integrates structured and unstructured data retrieval, simplifying operational complexity by eliminating the need for separate systems and reducing network costs. With planned enhancements, including scalar quantization and sharding, ScyllaDB aims to further boost performance for real-time AI applications, offering a scalable and reliable solution for latency-critical tasks such as fraud detection and recommendation systems.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Vector Search	25	1,445	313	116	+11%
Real-time	5	7,285	1,202	224	+60%
LLM	2	3,775	638	202	-32%
Data Pipeline	1	896	273	69	+167%
Developer Experience	1	454	241	96	-6%
RAG	1	909	198	86	-19%