1/10 of a version, 10x the punch - coming features in ES 1.0

Boaz Leskes @bleskes ♎ of a version, 10x the punch
coming features in 1.0

1.0 RC1

So…. what’s coming? Aggregations (best thing since lego blocks) _cat
API (feline love for the dev op) Distributed Percolation (put some nitro in your coffee) Snapshot & Restore (point in time, API driven backup) Federated search (get your results from multiple clusters) many, many more (memory circuit breaker, geo points compression, major improvement in allocation decision speed, ….)

Aggregations "

What’s wrong with facets? nothing it’s just that we want
more…

curl -X GET 'localhost:9200/scores/_search/' -d '{  "query" : {  "match"
: {  "student" : "john"  }  },  "facets": {  "subjects" : {  "terms" : {  "ﬁeld" : "subject",  }  }  }  }'  John’s report card curl -X GET 'localhost:9200/scores/_search/' -d '{  "query" : {  "match" : {  "student" : "john"  }  },  "facets": {  "scores" : {  "statistical" : {  "ﬁeld" : "score",  }  }  }  }' 

curl -X GET 'localhost:9200/scores/_search/?search_type=count&pretty' -d '{  "query" : {  "match"
: {  "student" : "john"  }  },  "facets": {  "scores-per-subject" : {  "terms_stats" : {  "key_ﬁeld" : "subject",  "value_ﬁeld" : "score"  }  }  }  }'  "facets" : { "scores-per-subject" : { "_type" : "terms_stats", "missing" : 0, "terms" : [ { "term" : "math", "count" : 1, "total_count" : 1, "min" : 85.0, "max" : 85.0, "total" : 85.0, "mean" : 85.0 }, ... ] } } John’s report card

: {  "student" : "john"  }  },  "aggs": {  "scores-per-subject" : {  "terms" : { "ﬁeld" : “subject” }, "aggs" : { “avg_score” : { "avg" : { "ﬁeld" : "score"  } } } }  }  }'  "aggregations" : { "scores-per-subject" : { "terms" : [ { "term" : "math", "doc_count" : 1, "avg_score" : { “value": 85.0 } }, ... ] } } John’s report card, agg style

: {  "student" : "john"  }  },  "aggs": {  "scores-per-subject" : {  "terms" : { "field" : “subject” }, "aggs" : { "avg_score_by_year”: { “date_histogram”: { "field" : "date", "interval" : "year", "format" : "yyyy" } "aggs": { "avg_score" : { "avg": { "field" : "score"  } } } "aggregations" : { "scores-per-subject" : { "terms" : [ { "term" : "math", "doc_count" : 1, "avg_score_by_year" : [{ "key_as_string": "2013", "avg_score": { “value”: 85.0 } }… ] }, ... ] } } John has graduated…

_cat API "

what’s wrong with JSON? nothing it’s just that we are
not smart enough to read it

{ "cluster_name" : "elasticsearch", "master_node" : "GNf0hEXlTfaBvQXKBF300A", "blocks" : {
}, "nodes" : { "ObdRqLHGQ6CMI5rOEstA5A" : { "name" : "Triton", … }, "4C7pKbfhTvu0slcSy_G4_w" : { "name" : "Kid Colt", … }, "GNf0hEXlTfaBvQXKBF300A" : { "name" : "Lang, Steven", … } } { "cluster_name" : "elasticsearch", "master_node" : "GNf0hEXlTfaBvQXKBF300A", "blocks" : { }, "nodes" : { "ObdRqLHGQ6CMI5rOEstA5A" : { "name" : "Triton", … }, "4C7pKbfhTvu0slcSy_G4_w" : { "name" : "Kid Colt", … }, "GNf0hEXlTfaBvQXKBF300A" : { "name" : "Lang, Steven", … } } who is the master? curl "localhost:9200/_cluster/state? pretty&ﬁlter_metadata=true&ﬁlter_routing_table=true"

who is the master? _cat style boaz-air:elasticsearch$: curl localhost:9200/_cat/master !
GNf0hEXlTfaBvQXKBF300A 10.0.1.13 Lang, Steven ! boaz-air:elasticsearch$:

Distributed Percolation "

curl -XPUT “localhost:9200/twitter/.percolator/es-tweets” -d ‘{ “query”: { “match”: { “body”:
“elasticsearch” } } }’ $ curl -XGET “localhost:9200/twitter/_percolate” -d ‘{ “doc”: { “body”: “#elasticsearch is awesome” “nick”: “@imotov” “name”: “Igor Motov” “date”: “2013-11-03” } }’ { … “matches”: [ { “_index”: “twitter”, “_id”: “es-tweets” } ] }

Huh? why is it useful? •Alerting •Price pointing •Contextual advertisement
•Classifications what our users do:

So what’s in distribution? •Highlighting •Sorting •Multi-Index support •Aggregations •Multi-Percolate

Snapshot & Restore "

Backup, 0.90 style 1. disable flush 2. find all primary
shard location (optional) 3. copy files from primary shards (rsync) 4. enable flush

curl -XPUT “localhost:9200/_snapshot/my_backup/snapshot_20140101” Backup, 1.0 style

Register a repository curl -XPUT "localhost:9200/_snapshot/my_backup" -d '{ "type": "fs",
"settings": { "location":"/mnt/es-test-repo" } }'

curl -XPUT “localhost:9200/_snapshot/my_backup/snapshot_20140101” -d ‘{ "indices":"+test_*,-test_4" }’ Creating a Snapshot

Restore, 0.90 style 1. close the index (shutdown the cluster)
2. find all existing index shards 3. replace all index shards with data from backup 4. open the index (start the cluster)

curl -XPOST "localhost:9200/test_*/_close" Restore, 1.0 style curl -XPOST "localhost:9200/_snapshot/my_backup/snapshot_20140101" -d
'{ "indices":"test_*" }'

thanks!

1/10 of a version, 10x the punch - coming featu...

1/10 of a version, 10x the punch - coming features in ES 1.0

Boaz Leskes

More Decks by Boaz Leskes

Other Decks in Technology

Featured

Transcript

Boaz Leskes @bleskes ♎ of a version, 10x the punch

1.0 RC1

So…. what’s coming? Aggregations (best thing since lego blocks) _cat

Aggregations "

What’s wrong with facets? nothing it’s just that we want

curl -X GET 'localhost:9200/scores/_search/' -d '{  "query" : {  "match"

curl -X GET 'localhost:9200/scores/_search/?search_type=count&pretty' -d '{  "query" : {  "match"

curl -X GET 'localhost:9200/scores/_search/' -d '{  "query" : {  "match"

curl -X GET 'localhost:9200/scores/_search/' -d '{  "query" : {  "match"

_cat API "

what’s wrong with JSON? nothing it’s just that we are

{ "cluster_name" : "elasticsearch", "master_node" : "GNf0hEXlTfaBvQXKBF300A", "blocks" : {

who is the master? _cat style boaz-air:elasticsearch$: curl localhost:9200/_cat/master !

Distributed Percolation "

curl -XPUT “localhost:9200/twitter/.percolator/es-tweets” -d ‘{ “query”: { “match”: { “body”:

Huh? why is it useful? •Alerting •Price pointing •Contextual advertisement

So what’s in distribution? •Highlighting •Sorting •Multi-Index support •Aggregations •Multi-Percolate

Snapshot & Restore "

Backup, 0.90 style 1. disable flush 2. find all primary

curl -XPUT “localhost:9200/_snapshot/my_backup/snapshot_20140101” Backup, 1.0 style

Register a repository curl -XPUT "localhost:9200/_snapshot/my_backup" -d '{ "type": "fs",

curl -XPUT “localhost:9200/_snapshot/my_backup/snapshot_20140101” -d ‘{ "indices":"+test_*,-test_4" }’ Creating a Snapshot

Restore, 0.90 style 1. close the index (shutdown the cluster)

curl -XPOST "localhost:9200/test_*/_close" Restore, 1.0 style curl -XPOST "localhost:9200/_snapshot/my_backup/snapshot_20140101" -d

thanks!