Company
Date Published
Author
Adrien Grand
Word count
723
Language
English
Hacker News points
None

Summary

The text discusses the benefits and implementation of "doc values" in Elasticsearch, a feature introduced in the 1.0 release, which improves memory management by storing field data on disk rather than in-memory. This change addresses issues related to large memory usage and garbage collection delays when handling analytics through facets. Doc values, a Lucene 4.x feature, optimize storage by using a column stride format, which is effective for sorting and faceting, providing faster refresh times and efficient compression compared to traditional field data methods. While they can increase index size, doc values are advantageous in constrained hardware environments and for memory management, as they allow the Elasticsearch process to run with significantly less RAM. To enable doc values, they must be set in the mappings before indexing documents, and they are currently supported on non-analyzed string fields and numeric fields. The text encourages users to experiment with the latest beta release and provide feedback on their experience.