Home / Companies / ClickHouse / Blog / Post Details
Content Deep Dive

Scaling our Observability platform beyond 100 Petabytes by embracing wide events and replacing OTel

Blog post from ClickHouse

Post Details
Company
Date Published
Author
Rory Crispin, Dale McDiarmid
Word Count
5,944
Company Posts That Month
24
Language
English
Hacker News Points
-
Summary

Over the past year, LogHouse, an internal logging platform initially designed to monitor ClickHouse Cloud, has undergone substantial growth and transformation, handling over 100 petabytes of data across nearly 500 trillion rows. This expansion necessitated significant architectural changes and the development of new tools, such as the System Tables Exporter (SysEx), to address the inefficiencies of OpenTelemetry (OTel) in managing high-throughput, high-fidelity system logs. SysEx has enabled a dramatic increase in event processing efficiency, achieving a 20-fold surge in data handling with only a tenth of the previous CPU resource usage. The shift from a one-size-fits-all approach to specialized tooling, combined with the integration of HyperDX, a ClickHouse-native UI, has not only improved data management but also fostered a cultural shift towards high-cardinality, wide-event-based observability. HyperDX facilitates seamless log exploration and correlation, supporting both standardized OTel formats and specialized data from SysEx, thus providing a unified user experience. As LogHouse continues to evolve, future enhancements, including zero-impact scraping and potential migration to JSON, are anticipated to further refine the platform's capabilities and efficiency.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
OpenTelemetry 39 336 51 32 -13%
Observability 33 1,870 422 128 +10%
Kubernetes 10 1,613 282 85 +4%
Real-time 4 4,075 1,042 211 +22%