Slide 1

Slide 1 text

Trends in Event Streaming Ben Stopford Office of the CTO, Confluent

Slide 2

Slide 2 text

Event Storage Kafka stores petabytes of data Stream Processing Real-time processing over streams and tables Scalability Clusters of hundreds of machines. Global. + + + An Event Streaming Platform is a Data Platform for Data in Motion

Slide 3

Slide 3 text

Why should I care?

Slide 4

Slide 4 text

Logos

Slide 5

Slide 5 text

1. Batch to Real-time

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

2. Contextual Event Driven Applications

Slide 9

Slide 9 text

Geospatial Matching Route Re- planning Business Events Business Events Customer Driver Example: Taxi

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

Event Streaming Platform

Slide 12

Slide 12 text

3. Businesses processes becoming increasingly automated

Slide 13

Slide 13 text

Loan Application Human centric: 1-2 Weeks Software centric: Seconds

Slide 14

Slide 14 text

Databases are designed to aid human- computer interaction Saving inputs, returning inputs to the screen, etc.

Slide 15

Slide 15 text

Fundamental Assumption: Data is Passive

Slide 16

Slide 16 text

Event Streaming Database

Slide 17

Slide 17 text

As business processes become increasingly automated they need a “database” designed for machineàmachine interaction

Slide 18

Slide 18 text

4. More copies of data, less deviation

Slide 19

Slide 19 text

Data in one place vs data in many places Apps Search NoSQL Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M Apps Search NoSQL Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M Apps h L Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M App App Apps Search Monitoring Apps Apps App Apps Search Monitoring Apps Apps Apps h Monitoring Apps Apps App App Data is accurate. Very tightly coupled. Easy for different apps to evolve independently How do I join? Data gets out of sync

Slide 20

Slide 20 text

Three options • Single database: all data in one place • Doesn’t scale in people terms • Microservices: Data spread in many “golden sources” • Opperationanal issues ensue • Data get’s copied everywhere as teams attempt to hit deadlines • Event Streaming • Single source of truth • Applications take a copy, that can be re-sourced when necessary

Slide 21

Slide 21 text

Apps Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M Apps Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M Apps Search NoSQL Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M App Apps Search NoSQL Monitoring Security Apps Apps S T R E A M I N G P L AT F O R M App Apps Search NoSQL Apps Apps S T R E A M I N G P L AT F O R M Apps Search NoSQL Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL Mon Sec Apps Apps S T R E A M I N G P L AT F O R M Apps Search NoSQL Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL Apps App S T R E A M I N G P L AT F O R M Apps Search NoSQL Monitoring Security Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL S T R E A M I N Apps Search NoSQL Apps DWH S T R E A M I N G P L AT App Apps Search NoSQL Apps S T R E A M I N G P L AT Apps Search NoSQL Apps Apps DWH Hadoop S T R E A M I N G P L AT F O R M App Apps Search NoSQL S T Apps Search NoSQL DWH S T R E App Apps Apps Search Apps App Apps Apps Apps Search Apps Apps App Apps Apps Search App Kafka: Source of truth Evolution of software systems Monolith Distributed Monolith Microservices Event-Driven Microservices

Slide 22

Slide 22 text

Event Storage Kafka stores petabytes of data Stream Processing Real-time processing over streams and tables Scalability Clusters of hundreds of machines. Global. + + + Event Streaming

Slide 23

Slide 23 text

5. Self-Service Data

Slide 24

Slide 24 text

Trade Surveillance Project • 9 months sourcing 16 data sets • Different formats (including for historical extracts) • Batch based approach

Slide 25

Slide 25 text

Event Streams Orders Payments Customers Distinct Visits Destination Spark Postgres Lambda Other Kafka Select Organizational Events Stream Processing SELECT * FROM ORDERS O, CUSTOMERS C WHERE O.REGION = ‘EU’ AND C.TYPE = ‘Platinum’ Msgs/Day Customers Stream Processing DB App Orders History 1w All Making Data Self Service

Slide 26

Slide 26 text

Self-Service • Netflix • Monsanto • Nordea • …

Slide 27

Slide 27 text

Event Streaming Database is the abstraction for this

Slide 28

Slide 28 text

6. Migration to cloud

Slide 29

Slide 29 text

Confluent Cloud • Dramatically cheaper than self managed for most use cases • No operational burden • No need to size clusters or pre- purchase hardware • Scale from zero (i.e. free) to whatever you need.

Slide 30

Slide 30 text

Trends • Batch to real-time • Contextual event driven applications • Increasingly automated business processes • More copies, less deviation • Self-service data • Migration to cloud

Slide 31

Slide 31 text

Evolution

Slide 32

Slide 32 text

Thank you