Announcing chDB 4: write Pandas, run ClickHouse, now on Hex
Blog post from ClickHouse
chDB 4, launched in collaboration with Hex, integrates a Pandas-like DataStore API into the Hex platform, allowing users to write familiar Pandas code that runs on the efficient ClickHouse engine without needing local installations or additional infrastructure. This innovation bridges the gap between SQL and function chain approaches, offering a seamless experience for data practitioners by enabling Pythonic function chaining with the power of ClickHouse's optimized SQL execution. The new DataStore component employs a four-layer architecture that transforms Pandas-style code into ClickHouse execution, utilizing lazy execution to optimize performance and minimize memory overhead. By maintaining Pandas compatibility while leveraging ClickHouse's computational efficiency, chDB 4 addresses common challenges in data science workflows, such as scale limitations and intermediate data handling, and offers smart caching, segment execution, and unified data source support to enhance productivity. This integration is particularly beneficial for users of Hex notebooks, facilitating faster and more efficient data analysis without altering existing workflows, and is complemented by a strong focus on AI-assisted development and Pandas compatibility to ensure seamless integration with modern LLMs.