The Great Query Refactoring: Thou shalt only parse once

Post Details

Company

Elastic

Date Published

Sept. 14, 2016

Author

Christoph Büscher • Luca Cavanna • Isabel Drost-Fromm

Word Count

1,874

Language

-

Hacker News Points

-

Source URL

www.elastic.co/blog/the-great-query-refactoring-thou-shalt-only-parse-once

Summary

In the blog post, Christoph Büscher, Luca Cavanna, and Isabel Drost-Fromm discuss a significant refactoring process undertaken in Elasticsearch 5.0 to improve the way search requests are handled internally. Previously, search requests were parsed at the shard level, resulting in multiple parses and inefficiencies, as well as difficulties in error handling and testing. The refactoring introduced an intermediate representation of queries, allowing them to be parsed once at the coordinating node and serialized for shard-level execution, thus improving efficiency and enabling optimizations like "Instant Aggregations." This change required extensive testing and incremental development to ensure stability and maintainability, ultimately leading to cleaner code architecture and paving the way for new features. The authors reflect on the challenges and benefits of this architectural shift, emphasizing the importance of frequent testing and small, manageable updates throughout the process.