Home / Companies / ClickHouse / Blog / Post Details
Content Deep Dive

If it’s in your catalog, you can query it: The DataLakeCatalog engine in ClickHouse Cloud

Blog post from ClickHouse

Post Details
Company
Date Published
Author
Tom Schreiber
Word Count
3,018
Company Posts That Month
21
Language
English
Hacker News Points
-
Summary

ClickHouse has expanded its capabilities to query Iceberg and Delta Lake tables directly through the DataLakeCatalog engine, enabling seamless integration with catalogs such as AWS Glue Catalog and Databricks Unity Catalog. This advancement allows users to connect ClickHouse to these catalogs, automatically detecting table formats and querying them instantly, enhancing ClickHouse's role as a high-performance lakehouse query engine. Through the ClickHouse Cloud platform, features like a highly parallel native Parquet reader, distributed cache layer, and stateless compute nodes provide fast, scalable querying across both native and external data sources. The DataLakeCatalog engine facilitates querying lakehouse data as native tables, supporting full Iceberg and Delta Lake compatibility, including schema evolution and catalog integration. Demonstrations with AWS Glue and Unity Catalogs highlight ClickHouse's ability to perform federated queries across different data sources, showcasing its potential for integrating and analyzing diverse datasets in a unified analytics environment. Future developments include support for Iceberg V3, write operations, and optimized file handling, positioning ClickHouse Cloud as a comprehensive solution for lakehouse data management and analysis.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Data Pipeline 1 529 243 71 +9%