Company
Date Published
Author
Dominic Page
Word count
832
Language
English
Hacker News points
None

Summary

At Elastic{ON} 2018 in San Francisco, the Elastic Training team organized a drawing for an Online Annual Training Subscription, which was open to attendees who had their badges scanned at the event. The process involved using Elasticsearch to handle the data, ensuring fairness and privacy throughout. They began by removing duplicate entries and filtering out Elastic employees, then redacted personally identifiable information before indexing attendee emails. By using Elasticsearch's _bulk API and a specialized ingest pipeline, they assigned document IDs, extracted email domains, and routed entries based on eligibility, ensuring all potential personal data was redacted. The eligible entries were then indexed into a single-sharded database to prevent data inconsistency, and a random score query with a seed value ensured a fair selection process. Ultimately, Stephen Steck was announced as the winner, receiving the training subscription prize. The entire process, including technical details and sample data, was shared on a Github Gist for public exploration.