Not all Nodes are Created Equal - Scaling Elasticsearch

Slide 1

Slide 1 text

Scaling Elasticsearch Not all Nodes are Created Equal Boaz Leskes @bleskes

Slide 2

Slide 2 text

Copyright Elasticsearch 2014. Copying, publishing and/or distributing without written permission is strictly prohibited Elasticsearch • Real time Search and Analytics Engine • Schema-free, REST & JSON based document store • Distributed and horizontally scalable • Open Source: Apache License 2.0 • Zero configuration • Written in Java, extensible

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Copyright Elasticsearch 2014. Copying, publishing and/or distributing without written permission is strictly prohibited Big Numbers Are No Fun they cost money, time and hairTM

Slide 5

Slide 5 text

Copyright Elasticsearch 2014. Copying, publishing and/or distributing without written permission is strictly prohibited so we want to know that…. • that money needs to be spent • but also that we’re safe

Slide 6

Slide 6 text

Copyright Elasticsearch 2014. Copying, publishing and/or distributing without written permission is strictly prohibited So how do we go from • I need to index 500GB (or 500MB) per day of application data • I need to serve 10.000 (or 3) requests per seconds

Slide 7

Slide 7 text

Copyright Elasticsearch 2014. Copying, publishing and/or distributing without written permission is strictly prohibited To… • I need 20 nodes (or 2). • With SSDs (or maybe spinning disks are fine) • With 64GB (8GB) each.

Slide 8

Slide 8 text

the essentials How Elasticsearch Works

Slide 9

Slide 9 text

Each node: • stores data: • indexes, stores and searches   data  • can become master: • performs cluster   administration  • receives requests: • coordination and response merging 3 nodes node node node

Slide 10

Slide 10 text

data  node data  node data  node data  node data  node data  node data  node data  node master  node master  node master  node role separation data  node data  node client  node client  node client  node client  node

Slide 11

Slide 11 text

Lots of data == Sharding index shard 3 shard 4 shard 1 shard 2 node node shard 3 shard 1 shard 4 shard 2 node node copy 1 copy 4 copy 3 copy 2

Slide 12

Slide 12 text

More nodes, less sharing node node copy 1 copy 3 node node node node node node shard 1 shard 3 copy 4 copy 2 shard 4 shard 2 shard 3 copy 4 copy 2 shard 4

Slide 13

Slide 13 text

indexing single doc shard 4 shard 1 # curl -XPUT localhost:9200/index1/type/id -d { f: 1 } any node shard 2 shard 3

Slide 14

Slide 14 text

bulk indexing shard 4 shard 1 # curl -XPUT localhost:9200/index1/type/_bulk -d …… any node shard 2 shard 3 more shards == scaling without sharing resources

Slide 15

Slide 15 text

search shard 4 shard 1 any node shard 2 shard 3 # curl localhost:9200/index/_search?q=something

Slide 16

Slide 16 text

search - more replicas shard 4 shard 1 any node shard 2 shard 3 # curl localhost:9200/index/_search?q=something shard 4 shard 1 shard 2 shard 3 # curl localhost:9200/index/_search?q=something more replicas == scaling without sharing resources

Slide 17

Slide 17 text

search - single shard, single request shard nosql 128 New York lat=6.9 lon=50 F 2 6 8 48 112 379 6 9 10 48 11 13 14 134 207 6 9 2 4 9 36 103 310 search time ~ max length of lists ~ number of docs in shard

Slide 18

Slide 18 text

in short • indexing: • # shards → higher throughput • searching: • # shards → more data (ﬁxed latency) • # replicas → higher throughput • and: • shard capacity is the base metric

Slide 19

Slide 19 text

Sizing a shard measurement

Slide 20

Slide 20 text

shard size shard node single indexer single searcher doc mix query mix search takes 160ms data time shard size

Slide 21

Slide 21 text

shard throughput (version 1) shard node scale indexers ﬁxed multiple searchers query mix known size max docs/sec

Slide 22

Slide 22 text

shard throughput (version 2) shard node scale searchers known size ﬁxed multiple indexers doc mix max q/sec

Slide 23

Slide 23 text

what did we learn? • Max Shard Capacity • dictated by latency • (Max) Shard/Node Throughput • indexing • searching • Resources needed to support a shard   under required load • CPU • Memory • IO

Slide 24

Slide 24 text

what does it tell us? • How many shards we need • to ﬁt the data • to support our indexing/searching requirement • How many shards we can put on and node • if we didn’t max out resources • How many nodes we need

Slide 25

Slide 25 text

do more with less? • Tweak Queries / data structures • measure the effect • Invest in your real bottleneck • faster storage • more memory • more cores • Use your resources efﬁciently • dedicated nodes for hot indices • shared nodes for old data • Sometimes, you just need those nodes

Slide 26

Slide 26 text

keep it simple • a few nodes can take you a surprising long way • defaults == predictable • Dedicated master nodes • as soon as you start to grow • Even simple experiments teach your a lot

Slide 27

Slide 27 text

thank you! http://elasticsearch.com/support @elasticsearch , @bleskes http://elasticsearch.org/resources