Home / Companies / Tinybird / Blog / Post Details
Content Deep Dive

Blog

Blog post from Tinybird

Post Details
Company
Date Published
Author
Daniel Pozo
Word Count
1,084
Language
English
Hacker News Points
-
Summary

Tinybird utilizes a modified version of ClickHouse to optimize data processing by employing a compute-storage separation model, where data is stored in AWS S3 or Google Cloud Storage and cached on local SSDs for enhanced speed and efficiency. The architecture supports zero-copy replication, allowing multiple replicas to reference a single data copy, thus reducing storage costs and improving replication speed. Data ingestion into the ClickHouse cluster is achieved through either a streaming process, managed by the Gatherer to batch events for efficient processing, or a direct batch process, with all writes directed to object storage. The system also employs a packed part format to minimize S3 write operations, significantly cutting infrastructure costs for clients with high data ingestion rates. Tinybird manages all underlying infrastructure elements, including the local cache and replication processes, allowing users to focus on data management without handling the complexities of the system architecture.