Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Building a streaming database service
Search
Siddharth Kothari
May 21, 2015
Technology
1
220
Building a streaming database service
Streaming Queries with ElasticSearch
Siddharth Kothari
May 21, 2015
Tweet
Share
More Decks by Siddharth Kothari
See All by Siddharth Kothari
Data Streams with Elasticsearch
siddharthlatest
0
130
Appbase.io - Customer Deck
siddharthlatest
0
140
Appbase.io - Investor Deck
siddharthlatest
0
330
Meteor Talk: Building Mitter
siddharthlatest
1
100
Data Streams on ElasticSearch
siddharthlatest
0
160
Appbase - A realtime events-based document store
siddharthlatest
1
830
Scalability Best Practices
siddharthlatest
0
190
Other Decks in Technology
See All in Technology
小さく始めるBCP ― 多プロダクト環境で始める最初の一歩
kekke_n
1
410
Data Hubグループ 紹介資料
sansan33
PRO
0
2.7k
Claude_CodeでSEOを最適化する_AI_Ops_Community_Vol.2__マーケティングx_AIはここまで進化した.pdf
riku_423
2
570
Digitization部 紹介資料
sansan33
PRO
1
6.8k
2026年、サーバーレスの現在地 -「制約と戦う技術」から「当たり前の実行基盤」へ- /serverless2026
slsops
2
240
コスト削減から「セキュリティと利便性」を担うプラットフォームへ
sansantech
PRO
3
1.5k
OWASP Top 10:2025 リリースと 少しの日本語化にまつわる裏話
okdt
PRO
3
740
ブロックテーマ、WordPress でウェブサイトをつくるということ / 2026.02.07 Gifu WordPress Meetup
torounit
0
180
M&A 後の統合をどう進めるか ─ ナレッジワーク × Poetics が実践した組織とシステムの融合
kworkdev
PRO
1
450
Embedded SREの終わりを設計する 「なんとなく」から計画的な自立支援へ
sansantech
PRO
3
2.4k
Amazon S3 Vectorsを使って資格勉強用AIエージェントを構築してみた
usanchuu
3
450
Ruby版 JSXのRuxが気になる
sansantech
PRO
0
150
Featured
See All Featured
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
Site-Speed That Sticks
csswizardry
13
1.1k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
196
71k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.3k
Utilizing Notion as your number one productivity tool
mfonobong
3
220
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
55k
State of Search Keynote: SEO is Dead Long Live SEO
ryanjones
0
120
Balancing Empowerment & Direction
lara
5
890
My Coaching Mixtape
mlcsv
0
48
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
Context Engineering - Making Every Token Count
addyosmani
9
660
GitHub's CSS Performance
jonrohan
1032
470k
Transcript
Streaming @siddharthlatest
The title could also be: * Streaming Queries with ElasticSearch
* Streaming DBs will take over the world * What the heck is a streaming database?
Siddarth, sith in making Co-founder and CEO, appbase.io Give me
a shout at @siddharthlatest GET /programming/stacktrace [“Games”, “C”, “Python”, “Java (sigh) ”, “AI”, “JS”, “Databases”]
Streaming, like videos?! You see …
Not Video Streaming via Ben Rubin, CEO @ Meerkat
• live
Not just synchronous and 1:Many, but also: CRUD with structured
data
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Results as Streams
Results as Streams
1. Streams and Firehoses from #IoT 2. Monitoring Systems 3.
Analytics 4. E-commerce: Search, Price Monitoring Use-cases
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Elasticsearch Distributed Full-text Search based on Lucene Can scale to
many nodes and highly available Analytics, Document Oriented, Open Source
ES: Percolation aka Search in Reverse 1. Indexing a Query
2. Matches when new documents are added 3. Distributed design since v1.0.0
Not just synchronous and 1:Many, but also: CRUD with structured
data
An Example Snippet Registering a Query
Match a Document Response
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Streaming Topology • Queries are subscriptions (HTTP Streaming / Websockets)
• Publish matches to subscribers. • Works as is with the ES API.
Streaming Topology
Streaming Topology • Beyond Percolation, keep the document store model
of ES. • Every document is a topic, which can have references. • When a doc is created, updated, or deleted; notify all the docs that refer to it.
Streaming Workflows • Every document has an ES Path exposed
by REST. Endpoint Worker Push back to stream • Topology like Apache Storm, but you can notify the workers using the entire ElasticSearch API.
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
How does it distribute, scale? • Underlying Store: ES is
highly available, can scale to many nodes. • Were able to ingest 100,000 documents per second on 20 C4.2x large nodes (AWS). • Eventually consistent, with a very small t. • Distributed Streaming Topology is a work in progress.
Looking forward “The web has moved to #realtime, why shouldn’t
the Backend Infrastructure too?” “DBs are moving to having RESTful APIs, percolators, streaming interfaces are the next steps”.
Bold claim “In three years, every modern DB will have
a streaming interface.”
@siddharthlatest