Home / Companies / Weaviate / Blog / Post Details
Content Deep Dive

BlockMax WAND: How Weaviate Achieved 10x Faster Keyword Search

Blog post from Weaviate

Post Details
Company
Date Published
Author
AndrĂ© MourĂ£o, Joon-Pil (JP) Hwang
Word Count
2,960
Language
English
Hacker News Points
-
Summary

Weaviate's hybrid search utilizes keyword and vector search to enhance data retrieval, with the BlockMax WAND algorithm offering significant improvements in document scoring speed. By optimizing the inverted index and employing advanced compression methods like varenc and delta encoding, BlockMax WAND reduces the number of documents inspected during keyword searches, leading to faster query times and a decrease in disk space usage by 50-90%. The process involves dividing posting lists into blocks with local max impact, which allows for more efficient data skipping and loading, while the compression techniques applied to term frequencies and doc IDs further enhance storage efficiency. As a result, p50 query times are reduced to 10-20% of the original, and Queries Per Second (QPS) are significantly increased, marking a crucial step towards making Weaviate's hybrid search scalable to billions of documents. However, BlockMax WAND is currently available as a technical preview in version 1.29, with potential changes expected in future updates, and it is not yet recommended for production environments.