Introduction to Druid

Introduction to Druid, fast distributed data store Nikita Salnikov-Tarnovski @iNikem

Why me? • I introduced Druid to a team •
We are happy about it • I like their design choices •but not there code :(

Problem • Monitor what end-users are doing • Monitor what
servers are doing • Put it together

Questions to ask • What was the slowest service during
last ﬂash sale? • What sql query has the biggest impact on user satisfaction? • Who are my most unhappy users this week? • Are we getting better?

Data to collect { "accountId": "XXXX", "transactionId": "9b6bbb93-0f64-389b-beae-ccd294f2286d", "jvmId": [
"H0QXvFsZ"], "originatingJvm": "H0QXvFsZ", "applicationKey": "535624dd815fb8762c378ac6b15937dc", "rootCause": [454708], "problemId": ["454708:600492565"], "problemsDuration": 4572, "userId": 42, "transactionStart": "1493548894024", "transactionDuration": "5432", "success": "0", "slow": "0", "failed": "1", "status": "failed", "serviceId": "6d17705ebf2724d96da48cc349e6c12d", "jobId": null, "isBrowser": false, "browserAgent": "MSIE", "country": "US" } { "accountId": "XXXX", "jvmId": "YYY", "timestamp": 1493548969876, "allocationRate": 14968164, "usedMemHeap": 574524040, "usedMemNative": 1125826560, "usedPermGen": 64078208 }

Data point • Timestamp • Dimensions •who, where, what •means
to select a subset of data • Metrics •how many •measured values you are interested in

What smart people said • You need columnar DB!

http://cs-www.cs.yale.edu/homes/dna/talks/abadi-sigmod08-slides.pdf • Easy to insert new records • Faster to
read full record Row based format

http://cs-www.cs.yale.edu/homes/dna/talks/abadi-sigmod08-slides.pdf • More expensive inserts • Can read only relevant
data Column based format

Columnar databases we tried • InﬂuxDB • MonetDB • Druid

Results • InﬂuxDB • MonetDB • Druid

Why? • Stability issues • We don’t have expertise neither
in Go no in C

Druid • druid.io • Open source • Active community •
Open for extensions

Imply • imply.io • Packages and supports Druid • Add-ons
Pivot, Plywood and PlyQL

Druid tribe https://imply.io/docs/latest/

Practical implications +Failure of a single node does not affect
you much -Very high operational overhead

Data storage • All data is stored in ﬁles called
“segments” • Contains all the information for some period of time •including indices, dictionaries • Immutable columnar format • Can be further sharded

Practical implications +Easy to distribute -Cannot update individual records. Have
to rebuild and replace the whole segment.

Data distribution • Segments are held in deep storage •HDFS,
S3, Azure, Google Cloud, Cassandra, etc • Coordinator says to each historical node what to load • Historicals can be organised in tiers

Distributed query

Practical implications +Every single historical can die without any impact
+Coordinator can die with very little impact +Separate hot and cold data +Trade money for speed -None for historical :) -Broker is a single point of failure!

Queries • SQL-like PlyQL • JSON over HTTP • We
have built a small DSL over that json format

Timeseries

GroupBy

Practical implications +None :) -No joins -Javascript functions or custom
extensions

Benchmarks are lies! • They are all requests to our
Druid for some period • It says nothing about performance in your case

Data ingestion • Files/Hadoop • Stream push via Tranquility •
Stream poll via Kafka

http://druid.io/docs/0.10.0/design/indexing-service.html

Kafka indexing • m4.xlarge, 16GB RAM, 4vCPU • 20-60K/sec per
partition, ~ 1.5-5B/day

Querying recent data

Practical implications +“Guaranteed” exactly-once delivery +New data is available immediately
-Complex machinery which easily breaks

Roll-up • We collect data every 5 seconds • But
query with granularity 1 minute • We can aggregate data during index • usedMemHeap -> max(usedMemHeap) • allocationRate -> avg(allocationRate) { "accountId": "XXXX", "jvmId": "YYY", "timestamp": 1493548969876, "allocationRate": 14968164, "usedMemHeap": 574524040, "usedMemNative": 1125826560, "usedPermGen": 64078208 }

Roll-up results • 1B records • 484G in Kafka, uncompressed
json • 185G with unique ids • 9.18G rolled-up data without ids

Practical implications +Huge savings in size -Lost individuality

Take away • We are quite happy with it :)
• Good tool for quite narrow problem

Pros • Works OK even without much tuning • Extensible
• Easy to change “schema”

Cons • Has operational overhead • Effectively non-updatable • Good
only for quite speciﬁc queries

Solving performance problems is hard. We don’t think it needs
to be. @JavaPlumbr/@iNikem http://plumbr.eu

Introduction to Druid

Introduction to Druid

More Decks by Nikita Salnikov-Tarnovski

Other Decks in Programming

Featured

Transcript