Slide 1

Slide 1 text

1 Jun Ohtani 2017/09/11 @johtani Elasticsearch 6.0 is coming

Slide 2

Slide 2 text

‹#›

Slide 3

Slide 3 text

about • Me, Jun Ohtani / Technical Advocate ‒ lucene-gosenίϛολʔ ‒ ElasticSearch Server೔ຊޠ൛ͷ຋༁ ‒ http://blog.johtani.info
 • Elasticsearch, founded in 2012 ‒ Products: Elasticsearch, Logstash, Kibana, Beats 
 X-Pack, Elastic Cloud
 Professional services: Support & development subscriptions ‒ Trainings, Consulting, SaaS 3

Slide 4

Slide 4 text

4 Become an Elastic Pioneer Download 6.0 preview release Provide feedback via GitHub or Discuss forum Get limited edition Pioneer swag 1 2 3

Slide 5

Slide 5 text

5 Elastic Pioneer Program Download 6.0 preview release (alpha, beta, etc) Provide feedback via GitHub or Discuss forum Get limited edition Pioneer swag 1 2 3 We want your feedback!

Slide 6

Slide 6 text

6 Elasticsearch

Slide 7

Slide 7 text

7 Brand new upgrade experience Upgrades just got oh so simpler Upgrading to 2.x Upgrading to 5.x Upgrading to 6.x

Slide 8

Slide 8 text

8 Brand new upgrade experience • New Upgrade Assistant (UI & API) • Zero downtime upgrades ‒Rolling restarts from latest 5.x to 6.x ‒Cross-cluster search across major version Upgrades just got oh so simpler

Slide 9

Slide 9 text

9 Space-saving columnar store • Better for storing sparse fields • Save on disk space & file system cache Tapping into Lucene 7 goodness (sparse doc value) user first middle last age phone johns Alex Smith jrice Jill Amy Rice 508.567.121 1 mt123 Jeff Twain 56 sadams Sue Adams adoe Amy Doe 31 lp12 Liz Potter

Slide 10

Slide 10 text

10 Much speedier sorted queries Tapping into Lucene 7 goodness (index sorting) Player 1 Score: 600 5.x Query for top 3 player scores Player 2 Score: 0 Player 3 Score: 200 Player 4 Score: 700 Player 5 Score: 300 Player 1907 Score: 800 ... Query for top 3 player scores ... Player 1907 Score: 800 Player 4 Score: 700 Player 1 Score: 600 Player 5 Score: 300 Player 3 Score: 200 Player 2 Score: 0 6.x Sort at index time vs. query time Optimize on-disk format for some use cases Improve query performance at the cost of index performance

Slide 11

Slide 11 text

11 Large Improvements to Replication • Limit syncs to only changed documents (instead of file-based recovery) • Fast replica recovery after temporary unavailability (network issues, etc.) • Re-sync on primary failure • Laying foundation for future big league features ‒Cross-datacenter replication ‒Changes API (tbd) New operation-based approach to recovery (sequence numbers)

Slide 12

Slide 12 text

12 Breaking changes • Improved tools to handle breaking changes ‒Deprecation logging ‒Upgrade Assistant (UI & APIs) • Refer to Release Notes for complete list • Test, test, test Because major releases is time for major cleanup

Slide 13

Slide 13 text

13 Simpler data models with type removal • Breaking change • Gradual migration path ‒ 6.0 indices can be created with only one type ‒ Existing 5.x indices using _type will continue to function • Introducing new APIs for type-less operations Say goodbye to _type confusion

Slide 14

Slide 14 text

14 Some interesting changes • Rename template to index_patterns in _template • Content-Type detection disabled • Set explicit Content-Type in request header • Deprecation of _all • _all can no longer be configured for indices in 6.0 • Use all_fields in query

Slide 15

Slide 15 text

15 Some interesting changes • <= 2.x indices need to be reindexed • Re-index into 5.x or 6.0 cluster • Deprecate Groovy, Python, Javascript lang plugin • Rewrite scripts in plainless • Java High Level REST Client • Starting from version 5.6.0 a new Java client has been released.

Slide 16

Slide 16 text

16 Some interesting changes • Preventing full disks • Search scalability • Introduce max_concurrent_shard_requests • Improve Synonym filter • Use tokenizer and tokenfilters before synonym_token

Slide 17

Slide 17 text

17 Try 6.0 Beta!