Qdrant 1.18 - TurboQuant
Blog post from Qdrant
Qdrant 1.18 introduces several enhancements, including TurboQuant, a new quantization method developed with Google Research, which doubles the compression ratio of scalar quantization while maintaining similar recall and speed, and improved memory monitoring via a Web UI and API endpoint to aid in capacity planning. The update allows adding or removing named vectors in a collection's schema without needing to recreate it, simplifying embedding model migration. Audit logging sees upgrades with a new API endpoint for querying logs and support for request tracing IDs, enhancing security reviews and compliance audits. Per-collection API metrics are now available, providing specific insights into response times and error rates, and new strict mode guardrails help prevent system overload by rejecting excessive memory operations and capping batch search requests.