Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Streaming Ingestion & Processing at Flipkart
Search
Siddhartha Reddy
May 15, 2015
Technology
0
400
Streaming Ingestion & Processing at Flipkart
Presented at the Bangalore Hadoop Meetup held on 15th May 2015.
Siddhartha Reddy
May 15, 2015
Tweet
Share
More Decks by Siddhartha Reddy
See All by Siddhartha Reddy
Future Patterns in Data Ecosystem
sids
1
200
CAP Theorem: You don’t need CP, you don’t want AP, and you can’t have CA
sids
6
12k
Other Decks in Technology
See All in Technology
Amazon Bedrock Knowledge Bases × メタデータ活用で実現する検証可能な RAG 設計
tomoaki25
6
2.1k
Kiro を用いたペアプロのススメ
taikis
4
1.6k
特別捜査官等研修会
nomizone
0
540
通勤手当申請チェックエージェント開発のリアル
whisaiyo
3
380
日本の AI 開発と世界の潮流 / GenAI Development in Japan
hariby
1
210
Snowflake導入から1年、LayerXのデータ活用の現在 / One Year into Snowflake: How LayerX Uses Data Today
civitaspo
0
2.2k
202512_AIoT.pdf
iotcomjpadmin
0
130
ESXi のAIOps だ!2025冬
unnowataru
0
290
JEDAI認定プログラム JEDAI Order 2026 エントリーのご案内 / JEDAI Order 2026 Entry
databricksjapan
0
170
SREには開発組織全体で向き合う
koh_naga
0
410
SREが取り組むデプロイ高速化 ─ Docker Buildを最適化した話
capytan
0
130
たまに起きる外部サービスの障害に備えたり備えなかったりする話
egmc
0
390
Featured
See All Featured
GraphQLの誤解/rethinking-graphql
sonatard
73
11k
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
0
96
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
Agile that works and the tools we love
rasmusluckow
331
21k
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
26
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
320
Paper Plane
katiecoart
PRO
0
44k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
280
Why Our Code Smells
bkeepers
PRO
340
57k
Code Review Best Practice
trishagee
74
19k
The Cost Of JavaScript in 2023
addyosmani
55
9.4k
Transcript
Streaming Ingestion & Processing at Flipkart Siddhartha Reddy @sids
Flipkart Data Platform (an oversimplified view)
Streaming Ingestion
Choices • push, not pull • schemas & validations
Streaming Ingestion v1.0
None
• Push 㱺 accountability (with source teams) • good call!
• Schemas 㱺 contracts for consumers • can make assumptions that are assured to be true • Insufficient tooling 㱺 too many “ingestion frameworks” • adopt some frameworks & offer as tools! • Synchronous error handling 㱺 complexity • accept all data
Streaming Ingestion v2.0
Stream Processing
An Example
Streaming Joins: Example It works! But… how do we deal
with lookup failures?
Streaming Joins: Handling Failures
None
None
Streaming Joins: Bootstrapping With a little help from MR friends
Streaming Joins: But… The example that doesn’t really work correctly
Streaming Joins
In summary • Streaming Ingestion: push, schemas & validation, HTTP
service, local daemon, change data capture • Streaming Joins: indexing, lookup tables, map-joins, retry queue, batch re-driver sid@flipkart.com