Apache Kafka Example: How Rollbar Removed Technical Debt

Company

Rollbar

Date Published

March 10, 2020

Author

Word count

1844

Language

English

Hacker News points

None

URL

rollbar.com/blog/apache-kafka-example-how-rollbar-removed-technical-debt-part-1

Summary

Rollbar's blog series discusses their use of Kafka to modernize their data ingestion architecture by replacing legacy components. The first post explains how their existing system processes event payloads through an API, writing them to disk for batch insertion into databases, which poses scalability and future growth challenges. Rollbar evaluates Kafka, alongside Cloud Pub/Sub, to streamline this process and integrate with Kubernetes, opting for Kafka due to its mature ecosystem and suitability for their processing model. Kafka allows for flexible data retention, parallel data processing, and ease of scaling, with the blog detailing considerations for Kafka cluster sizing, partitioning, and broker configuration to accommodate their ingestion throughput requirements. Rollbar chose the node-rdkafka library for their Node.js API to ensure efficient message publication, emphasizing the importance of configuring broker and producer settings to meet performance and reliability goals. The post concludes by highlighting Kafka's manageability, provided configurations are carefully tailored to expected traffic and system throughput, and hints at future discussions on consumer logic and system deployment.

Apache Kafka Example: How Rollbar Removed Technical Debt – Part 1

Summary