Apache Kafka JDBC Source Connector: What could go wrong?

Francesco Tisiot - Developer Advocate @ftisiot - @aiven_io JDBC Source
Connector What Could Go Wrong?

@ftisiot | @aiven_io

@ftisiot | @aiven_io Kafka Connect

@ftisiot | @aiven_io JDBC Source Connector

@ftisiot | @aiven_io List of Tables Polling Interval Query Mode

@ftisiot | @aiven_io Bulk Mode

@ftisiot | @aiven_io Incremental Mode WHERE ID > 4 WHERE
ID > 6

@ftisiot | @aiven_io WHERE TS > 10.03 WHERE TS >
10.05 Timestamp Mode

@ftisiot | @aiven_io Query Mode WHERE COL = △

@ftisiot | @aiven_io Problems

@ftisiot | @aiven_io Which JDBC Connector ? https:/ /github.com/aiven/jdbc-connector-for-apache-kafka

@ftisiot | @aiven_io Common Challenges Data Types Out of Memory
Errors Number Mapping numeric.mapping defaultRowFetchSize

@ftisiot | @aiven_io ERROR java.lang.IllegalArgumentException: Number of groups must be
positive table.types

@ftisiot | @aiven_io Everything is Fine Not

@ftisiot | @aiven_io Fast Events

@ftisiot | @aiven_io Polling Interval State Event

@ftisiot | @aiven_io Ghost Events

@ftisiot | @aiven_io 1 2 3 Id Name 1 2
3 Incremental = No Updates!

@ftisiot | @aiven_io Name Change Timestamp 10:00 10:01 10:02 10:03
10:00 10:01 10:02 10:03 No Hard Deletes!

@ftisiot | @aiven_io Out of Order Events

@ftisiot | @aiven_io Name Change Timestamp 10:00 10:01 10:03 10:03
10:00 10:01 10:03 10:02

@ftisiot | @aiven_io Why? Device Clock Network Lag Transaction Duration
Batching

@ftisiot | @aiven_io Polling Interval 10:01 10:02 10:02 Polling Interval
10:02 > 10:01

@ftisiot | @aiven_io Polling Interval timestamp.delay.interval.ms Delay

@ftisiot | @aiven_io JDBC Limits Polling Time Out of Order
Events Load on the DB Updates/Deletions Require Extra Fields

@ftisiot | @aiven_io Log Based Approach

@ftisiot | @aiven_io Write Ahead Log - PostgreSQL binlog -
MySQL oplog - MongoDB

@ftisiot | @aiven_io Debezium Connector

@ftisiot | @aiven_io Video

@ftisiot | @aiven_io JDBC Limits Polling Time Out of Order
Events Load on the DB Updates/Deletions Require Extra Fields All Events Near Real Time Tracked as per Log Minimal Load No Extra Fields

@ftisiot | @aiven_io Additional Benefit - Enhanced Metadata!

@ftisiot | @aiven_io Timestamps Pre-Post status Operation Type Sequence Number

@ftisiot | @aiven_io

@ftisiot | @aiven_io JDBC Debezium

@ftisiot | @aiven_io https:/ /aiven.io Debezium Connector JDBC Source Connector
in Action Debezium Connector in Action JDBC Connector https:/ /ftisiot.net/talks/kafka-jdbc-what-can-go-wrong/ kafka-summit-2022 500$

Apache Kafka JDBC Source Connector: What could ...

Apache Kafka JDBC Source Connector: What could go wrong?

More Decks by FTisiot

Other Decks in Technology

Featured

Transcript