Introducing the enrich processor for Elasticsearch ingest nodes

Post Details

Company

Elastic

Date Published

Dec. 10, 2019

Author

Martijn van Groningen

Word Count

1,403

Company Posts That Month

36

Language

-

Hacker News Points

-

Post removed?

No

Source URL

www.elastic.co/blog/introducing-the-enrich-processor-for-elasticsearch-ingest-nodes

Summary

Elasticsearch 7.5.0 introduced a new enrich processor for ingest nodes, allowing users to enrich documents with additional data from reference datasets during the ingestion process. Ingest nodes pre-process documents before indexing by using pipelines composed of various processors, each performing specific operations like splitting or removing fields. The enrich processor uses data from another index to enhance documents, such as adding a user's full name based on an email address match. For example, flight data can be enriched with airport data by creating an enrich policy that manages reference data efficiently, like indexing US airport codes to enrich flight information with airport-specific details. This policy creates a system index for quick document enrichment, and once executed, a pipeline can be defined to enrich and clean up documents by removing redundant fields and renaming others. The enriched data is then incorporated into the documents, replacing fields with enriched JSON objects containing additional information. This feature allows users to enhance their data workflows and is accessible by upgrading to Elasticsearch 7.5 or using Elasticsearch Service.

Trends Found in this Post

No tracked trend matches for this post yet.

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.