Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Building a streaming database service
Search
Siddharth Kothari
May 21, 2015
Technology
1
220
Building a streaming database service
Streaming Queries with ElasticSearch
Siddharth Kothari
May 21, 2015
Tweet
Share
More Decks by Siddharth Kothari
See All by Siddharth Kothari
Data Streams with Elasticsearch
siddharthlatest
0
130
Appbase.io - Customer Deck
siddharthlatest
0
140
Appbase.io - Investor Deck
siddharthlatest
0
330
Meteor Talk: Building Mitter
siddharthlatest
1
100
Data Streams on ElasticSearch
siddharthlatest
0
160
Appbase - A realtime events-based document store
siddharthlatest
1
830
Scalability Best Practices
siddharthlatest
0
190
Other Decks in Technology
See All in Technology
Codex 5.3 と Opus 4.6 にコーポレートサイトを作らせてみた / Codex 5.3 vs Opus 4.6
ama_ch
0
150
Contract One Engineering Unit 紹介資料
sansan33
PRO
0
13k
AI駆動PjMの理想像 と現在地 -実践例を添えて-
masahiro_okamura
1
110
Webhook best practices for rock solid and resilient deployments
glaforge
1
290
小さく始めるBCP ― 多プロダクト環境で始める最初の一歩
kekke_n
1
410
Greatest Disaster Hits in Web Performance
guaca
0
230
広告の効果検証を題材にした因果推論の精度検証について
zozotech
PRO
0
170
GitHub Issue Templates + Coding Agentで簡単みんなでIaC/Easy IaC for Everyone with GitHub Issue Templates + Coding Agent
aeonpeople
1
220
15 years with Rails and DDD (AI Edition)
andrzejkrzywda
0
190
Oracle Cloud Observability and Management Platform - OCI 運用監視サービス概要 -
oracle4engineer
PRO
2
14k
Bill One 開発エンジニア 紹介資料
sansan33
PRO
4
17k
Kiro IDEのドキュメントを全部読んだので地味だけどちょっと嬉しい機能を紹介する
khmoryz
0
190
Featured
See All Featured
Six Lessons from altMBA
skipperchong
29
4.1k
A Soul's Torment
seathinner
5
2.3k
Applied NLP in the Age of Generative AI
inesmontani
PRO
4
2k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.9k
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
380
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.4k
What does AI have to do with Human Rights?
axbom
PRO
0
2k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
Java REST API Framework Comparison - PWX 2021
mraible
34
9.1k
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.3k
Designing for humans not robots
tammielis
254
26k
Exploring the relationship between traditional SERPs and Gen AI search
raygrieselhuber
PRO
2
3.6k
Transcript
Streaming @siddharthlatest
The title could also be: * Streaming Queries with ElasticSearch
* Streaming DBs will take over the world * What the heck is a streaming database?
Siddarth, sith in making Co-founder and CEO, appbase.io Give me
a shout at @siddharthlatest GET /programming/stacktrace [“Games”, “C”, “Python”, “Java (sigh) ”, “AI”, “JS”, “Databases”]
Streaming, like videos?! You see …
Not Video Streaming via Ben Rubin, CEO @ Meerkat
• live
Not just synchronous and 1:Many, but also: CRUD with structured
data
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Results as Streams
Results as Streams
1. Streams and Firehoses from #IoT 2. Monitoring Systems 3.
Analytics 4. E-commerce: Search, Price Monitoring Use-cases
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Elasticsearch Distributed Full-text Search based on Lucene Can scale to
many nodes and highly available Analytics, Document Oriented, Open Source
ES: Percolation aka Search in Reverse 1. Indexing a Query
2. Matches when new documents are added 3. Distributed design since v1.0.0
Not just synchronous and 1:Many, but also: CRUD with structured
data
An Example Snippet Registering a Query
Match a Document Response
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
Streaming Topology • Queries are subscriptions (HTTP Streaming / Websockets)
• Publish matches to subscribers. • Works as is with the ES API.
Streaming Topology
Streaming Topology • Beyond Percolation, keep the document store model
of ES. • Every document is a topic, which can have references. • When a doc is created, updated, or deleted; notify all the docs that refer to it.
Streaming Workflows • Every document has an ES Path exposed
by REST. Endpoint Worker Push back to stream • Topology like Apache Storm, but you can notify the workers using the entire ElasticSearch API.
Topics: 1. What is a streaming database? 2. The use-cases
3. ElasticSearch as the query layer 4. Streaming Topology 5. How does it scale? 6. The future
How does it distribute, scale? • Underlying Store: ES is
highly available, can scale to many nodes. • Were able to ingest 100,000 documents per second on 20 C4.2x large nodes (AWS). • Eventually consistent, with a very small t. • Distributed Streaming Topology is a work in progress.
Looking forward “The web has moved to #realtime, why shouldn’t
the Backend Infrastructure too?” “DBs are moving to having RESTful APIs, percolators, streaming interfaces are the next steps”.
Bold claim “In three years, every modern DB will have
a streaming interface.”
@siddharthlatest