How MemGQL Lets You Query Distributed Data Without ETL
Blog post from Memgraph
Memgraph Zero is a new product line that addresses the challenges of querying distributed data without the need for ETL by enabling the data to remain in its original locations while still being accessible as a graph. The first component, MemGQL, was introduced by Memgraph CTO Marko Budiselic, offering a federated GQL layer that allows for querying across different data sources such as Postgres, ClickHouse, and MySQL, through a graph model. MemGQL is designed to support a "multi-player" mode for AI agents, allowing them to share and reuse data findings rather than duplicating efforts. Implemented in Rust and utilizing the Bolt protocol, MemGQL facilitates the translation of queries into backend-native languages like Cypher and SQL, leveraging existing storage and execution capabilities of target systems. The architecture centers on a virtual graph, avoiding unnecessary data copying and enabling the execution of queries directly on source systems. While still developing features such as authentication and advanced pushdown, MemGQL aims to integrate with various systems, providing a cohesive interface for both developers and AI agents to work with distributed and heterogeneous data sources.