Company
Date Published
Author
Russ Cam
Word count
2958
Language
English
Hacker News points
None

Summary

The article by Russ Cam discusses the evolution and future of handling attachments in Elasticsearch, specifically for .NET developers using the NEST high-level client. Historically, the mapper-attachments plugin allowed for the indexing and searching of text-based document attachments, such as Word documents and PDFs, by extracting metadata and content. However, with the release of Elasticsearch 5.0, this plugin became deprecated in favor of the ingest-attachment processor plugin, which offers a more efficient and streamlined approach to processing attachments by performing extraction before indexing. The article provides a detailed walkthrough of transitioning from the mapper-attachments plugin to using the ingest-attachment processor, including installation, configuration, and mapping steps, as well as demonstrating how to index and search attachments within Elasticsearch. This new method, which is part of the ingest node suite, improves the predictability and performance of attachment handling, ensuring that extracted metadata is readily available in the search results and stored in the source field.