Company
Date Published
Author
MongoDB
Word count
415
Language
English
Hacker News points
None

Summary

The Large Hadron Collider (LHC) generates massive amounts of data that is distributed across a worldwide grid, making it challenging to query across the whole grid until now. The CMS project developed a Data Aggregation System using MongoDB as a cache to allow for querying across all machines. The system uses a simple, SQL-like language to transform queries into MongoDB queries and iterates over locations to gather results, merging them and inserting the aggregated information into the cache. The goal is to field test it, horizontally scale the system with sharding, and apply it to problems outside of the LHC and CERN.