Kafka Connect Deep Dive – JDBC Source Connector

Company

Confluent

Date Published

Feb. 12, 2019

Author

Lucia Cerchie, Robin Moffatt, Josep Prat

Word count

5191

Language

English

Hacker News points

None

URL

www.confluent.io/blog/kafka-connect-deep-dive-jdbc-source-connector

Summary

The integration of Apache Kafka with relational databases is popular due to their rich source of events, allowing existing data and changes to be streamed into Kafka topics. This enables data to drive applications, be streamed to other stores, and stored for analytics. The JDBC connector for Kafka Connect, available with Confluent Platform or separately from Confluent Hub, facilitates this process by pulling data from databases like Oracle, SQL Server, MySQL, and Postgres into Kafka and vice versa. Users can configure Kafka Connect to ingest data from databases by ensuring the JDBC driver is accessible and setting up appropriate configurations, such as connection URLs and table selections. Options include bulk data import or incremental data streaming based on column changes, with Single Message Transforms (SMTs) enabling further customization of Kafka messages. Concurrency can be managed by adjusting the number of tasks to optimize database connections. The article provides detailed instructions and troubleshooting tips for setting up and managing these configurations, emphasizing the importance of correct driver installation and error handling.