Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[DevNexus-2018] Apache Kafka A Streaming Data P...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Viktor Gamov
February 22, 2018
Technology
330
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
[DevNexus-2018] Apache Kafka A Streaming Data Platform
Viktor Gamov
February 22, 2018
More Decks by Viktor Gamov
See All by Viktor Gamov
Processing Streaming Data with KSQL
vikgamov
4
450
[VirtualJUG] Apache Kafka — A Streaming Data Platform
vikgamov
3
440
[SF JUG] Apache Kafka — A Streaming Data Platform
vikgamov
4
110
[OracleCode NYC-2018] Apache Kafka A Streaming Data Platform
vikgamov
1
190
[OracleCode NYC-2018] Rethinking Stream Processing with KStreams and KSQL
vikgamov
2
260
[JBreak-2018] Это кто там твитить про #jbreak?
vikgamov
0
240
[DataSciCon] Divide, Distribute and Conquer: Stream v. Batch
vikgamov
0
120
[Philly JUG] Divide, Distribute and Conquer: Stream v. Batch
vikgamov
0
510
[Atlanta JUG] Testing containers with TestContainers
vikgamov
0
1.3k
Other Decks in Technology
See All in Technology
就職⽀援サービスにおけるキャリアアドバイザーのシフトスケジューリング
recruitengineers
PRO
1
150
アンオフィシャルな、オフィシャルからのお願い
wyamazak_devrel
0
140
MUSUBI 田中裕一『AIと共に行う「しごとのリデザイン」- スモールバックオフィス編』AI Ops Lab #4
musubi
0
280
生成 AI 実践ガイド (概略版) AIガバナンス編
asei
0
140
アジャイルな経理と Claude Code と経営の未来
kawaguti
PRO
3
170
エラーバジェットのアラートのタイミングを考える.pdf
kairim0
0
180
AIチャット検索改善の3週間
kworkdev
PRO
2
150
SteampipeとExcel Power QueryでAWS構成定義書の作成を自動化する
jhashimoto
0
160
Oracle AI Database@Google Cloud:サービス概要のご紹介
oracle4engineer
PRO
6
1.6k
AIAU_UMEMOGU_ninomiya_slide
ninomiya_ii
0
240
10年間のブログ発信を振り返って見えたWebアプリケーションエンジニアとしての軌跡
stefafafan
0
170
いまさら聞けない「仕様駆動開発入門」 〜AI活用時代の開発プロセスを考える〜
findy_eventslides
2
170
Featured
See All Featured
A designer walks into a library…
pauljervisheath
211
24k
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.3k
Leveraging LLMs for student feedback in introductory data science courses - posit::conf(2025)
minecr
1
290
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
38
2.9k
It's Worth the Effort
3n
188
29k
Rebuilding a faster, lazier Slack
samanthasiow
85
9.5k
Mobile First: as difficult as doing things right
swwweet
225
10k
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
530
How to Ace a Technical Interview
jacobian
281
24k
Become a Pro
speakerdeck
PRO
31
6k
SEO for Brand Visibility & Recognition
aleyda
0
4.6k
Transcript
@ Apache Kafka A Streaming Data Platform
@ @gamussa @confluentinc Who am I?
@ @gamussa @confluentinc Solutions Architect Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate @gamussa in internetz
Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate @gamussa in internetz
Hey you, yes, you, go follow me in twitter © Who am I?
@ @gamussa @confluentinc
@ @gamussa @confluentinc A company is build on
@ @gamussa @confluentinc A company is build on DATA FLOWS
but All we have is DATA STORES
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Core abstraction
@ @gamussa @confluentinc Core abstraction DB - table
@ @gamussa @confluentinc Core abstraction DB - table Hadoop -
file
@ @gamussa @confluentinc Core abstraction DB - table Hadoop -
file Messaging -?
@ @gamussa @confluentinc LOGS
@ @gamussa @confluentinc Producing to Kafka Time
@ @gamussa @confluentinc Producing to Kafka Time C C C
@ @gamussa @confluentinc Producing to Kafka - With Key Time
A B C D hash(key) % numPartitions = N
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Consuming From Kafka - Single Consumer C
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers C
C C1 C C C2
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers C
C C C
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0
1 2 3
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0
1 2 3
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0,
3 1 2 3
@ @gamussa @confluentinc Producers Consumers
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Why Store?
@ @gamussa @confluentinc Scalability of a filesystem
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server Commodity Hardware
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server Commodity Hardware O(1) writes
@ @gamussa @confluentinc Guarantees of a database
@ @gamussa @confluentinc Guarantees of a database Persistence
@ @gamussa @confluentinc Guarantees of a database Persistence Strict ordering
@ @gamussa @confluentinc Distributed by Design
@ @gamussa @confluentinc Replication Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Partitioning Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Partitioning Scale Distributed by
Design
@ @gamussa @confluentinc
@ @gamussa @confluentinc Partition Leadership and Replication Broker 1 Topic1
partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
@ @gamussa @confluentinc Partition Leadership and Replication - node failure
Broker 1 Topic1 partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc What is Stream Processing? A machine for
combining streams of events
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc https://www.confluent.io/download/
@ @gamussa @confluentinc We are hiring! https://www.confluent.io/careers/
@ @gamussa @confluentinc One more thing…
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc A Major New Paradigm
@ @gamussa @confluentinc Thanks! questions? @gamussa
[email protected]
We are hiring!
https://www.confluent.io/careers/