From Postgres to OpenSearch in No Time

Image © massmatt https://flic.kr/p/25eF9D3 (CC BY 2.0) From Postgres To
OpenSearch In No Time Gunnar Morling Software Engineer, Decodable @gunnarmorling

From Postgres to OpenSearch | @gunnarmorling Today’s Mission Learn About…

From Postgres to OpenSearch | @gunnarmorling • Software engineer at
Decodable • Former project lead of Debezium • kcctl 🧸, JfrUnit, ModiTect, MapStruct • Spec Lead for Bean Validation 2.0 • Java Champion Gunnar Morling

From Postgres to OpenSearch | @gunnarmorling Updating the Search Index
One Idea? 🤔

From Postgres to OpenSearch | @gunnarmorling Debezium Log-Based Change Data
Capture

From Postgres to OpenSearch | @gunnarmorling Debezium in a Nutshell
Open-Source Change Data Capture • A CDC Platform ◦ Based on transaction logs ◦ Snapshotting, filtering, etc. ◦ Outbox support ◦ Web-based UI • Fully open-source, very active community • Large production deployments

From Postgres to OpenSearch | @gunnarmorling Change Data Capture Liberation
for Your Data

From Postgres to OpenSearch | @gunnarmorling • Core ◦ MySQL,
MariaDB ◦ Postgres ◦ SQL Server ◦ MongoDB ◦ Db2, Informix ◦ Oracle • Community-led: ◦ Vitess, Cassandra, Spanner • External: ScyllaDB, Yugabyte Debezium Supported Databases

From Postgres to OpenSearch | @gunnarmorling Debezium: Data Change Events
• Old and new row state • Metadata on table, TX id, etc. • Operation type, timestamp

From Postgres to OpenSearch | @gunnarmorling Becoming the De-Facto CDC
Standard https://debezium.io/blog/2021/09/22/deep-dive-into-a-debezium-community-connector-scylla-cdc-source-connector/ Debezium

Apache Flink Colin Howley https://flic.kr/p/698F5j (CC BY-ND 2.0)

From Postgres to OpenSearch | @gunnarmorling Apache Flink Stateful Computations
over Data Streams https://flink.apache.org/

From Postgres to OpenSearch | @gunnarmorling • Real-time reporting/dashboards •
Low-latency alerting, notifications • Materialized view maintenance, caches • Real-time cross-database sync, lookup joins, windowed joins, aggregations • Machine learning: model serving, feature engineering • Change data capture, data integration Apache Flink Common Use Cases https://flink.apache.org/poweredby.html

From Postgres to OpenSearch | @gunnarmorling Apache Flink APIs for
Application Development Image source: “Change Data Capture with Flink SQL and Debezium” by Marta Paes at DataEngBytes (https://noti.st/morsapaes/liQzgs/change-data-capture-with-flink-sql-and-debezium)

From Postgres to OpenSearch | @gunnarmorling Apache Flink Stream Processing
of Change Data Events

From Postgres to OpenSearch | @gunnarmorling Debezium and Apache Flink
Integration Options

Demo © Luke Jones https://flic.kr/p/sEq4MA (CC BY-SA 2.0)

From Postgres to OpenSearch | @gunnarmorling Driving Full-Text Search Propagating
Joined Data to OpenSearch

From Postgres to OpenSearch | @gunnarmorling Nested Data Structures UDFs
to the Rescue

to the Rescue

to the Rescue https://www.youtube.com/@decodable

From Postgres to OpenSearch | @gunnarmorling Ingest Once... ...Process Multiple
Times

From Postgres to OpenSearch | @gunnarmorling Transactional Aggregation Correlating Events
From Same Transaction

From Postgres to OpenSearch | @gunnarmorling Wrap-Up

From Postgres to OpenSearch | @gunnarmorling • Debezium: Real-time change
event streams for your data • Debezium and Apache Flink: Power house of change stream processing ◦ Data Integration ◦ Data Cleansing ◦ Denormalization ◦ Aggregations ◦ Pattern Matching Take Aways 🤩

From Postgres to OpenSearch | @gunnarmorling • Provisioning and updating
infrastructure • Deployment and (auto-)scaling • Observability • State management • Schema management and inference • Developer experience • CI/CD • Security and access control Towards Production What To Consider

From Postgres to OpenSearch | @gunnarmorling • Debezium: @debezium |
https://debezium.io/ • Apache Flink: @ApacheFlink | https://flink.apache.org/ • Getting started with Flink: github.com/decodableco/examples → flink-learn Learn More

From Postgres to OpenSearch | @gunnarmorling gunnar@decodable.co @gunnarmorling 📧 Thank
You! Q & A

From Postgres to OpenSearch in No Time

From Postgres to OpenSearch in No Time

More Decks by Gunnar Morling

Other Decks in Programming

Featured

Transcript