/plushcap/analysis/clickhouse/s3-gcs-clickpipes-beta

ClickPipes for Batch Data Loading: Introducing S3 and GCS Support

What's this blog post about?

ClickHouse, a popular open-source columnar database management system, has expanded its connectivity platform with new beta connectors for Amazon S3 and Google Cloud Storage (GCS). These connectors aim to improve the data loading process by ensuring resiliency against interruptions and offering continuous loading capabilities. The key behind this resilience lies in smart use of ClickHouse's destination service ingest capabilities, orchestration with temporary staging tables, a custom KeeperMap state for tracking progress, and the robust underlying infrastructure of ClickPipes. Currently, the beta connectors support JSON, CSV, TSV, and Parquet formats, as well as public and private buckets with various authentication methods. The platform is expected to evolve further with more updates and enhancements in the future.

Company
ClickHouse

Date published
April 18, 2024

Author(s)
Ryadh Dahimene

Word count
533

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.