Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Elasticsearch 6.0 is coming at CA.io

657aeeff3fc467567dacebf8a1ea0b23?s=47 Jun Ohtani
October 04, 2017

Elasticsearch 6.0 is coming at CA.io

「マッチングサービスを支えるElasticsearch」で登壇した時の資料になります。

657aeeff3fc467567dacebf8a1ea0b23?s=128

Jun Ohtani

October 04, 2017
Tweet

Transcript

  1. 1 Jun Ohtani 2017/09/11 @johtani Elasticsearch 6.0 is coming

  2. ‹#›

  3. about • Me, Jun Ohtani / Technical Advocate ‒ lucene-gosenίϛολʔ

    ‒ ElasticSearch Server೔ຊޠ൛ͷ຋༁ ‒ http://blog.johtani.info
 • Elasticsearch, founded in 2012 ‒ Products: Elasticsearch, Logstash, Kibana, Beats 
 X-Pack, Elastic Cloud
 Professional services: Support & development subscriptions ‒ Trainings, Consulting, SaaS 3
  4. 4 Become an Elastic Pioneer Download 6.0 preview release Provide

    feedback via GitHub or Discuss forum Get limited edition Pioneer swag 1 2 3
  5. 5 Elastic Pioneer Program Download 6.0 preview release (alpha, beta,

    etc) Provide feedback via GitHub or Discuss forum Get limited edition Pioneer swag 1 2 3 We want your feedback!
  6. 6 Elasticsearch

  7. 7 Brand new upgrade experience Upgrades just got oh so

    simpler Upgrading to 2.x Upgrading to 5.x Upgrading to 6.x
  8. 8 Brand new upgrade experience • New Upgrade Assistant (UI

    & API) • Zero downtime upgrades ‒Rolling restarts from latest 5.x to 6.x ‒Cross-cluster search across major version Upgrades just got oh so simpler
  9. 9 Space-saving columnar store • Better for storing sparse fields

    • Save on disk space & file system cache Tapping into Lucene 7 goodness (sparse doc value) user first middle last age phone johns Alex Smith jrice Jill Amy Rice 508.567.121 1 mt123 Jeff Twain 56 sadams Sue Adams adoe Amy Doe 31 lp12 Liz Potter
  10. 10 Much speedier sorted queries Tapping into Lucene 7 goodness

    (index sorting) Player 1 Score: 600 5.x Query for top 3 player scores Player 2 Score: 0 Player 3 Score: 200 Player 4 Score: 700 Player 5 Score: 300 Player 1907 Score: 800 ... Query for top 3 player scores ... Player 1907 Score: 800 Player 4 Score: 700 Player 1 Score: 600 Player 5 Score: 300 Player 3 Score: 200 Player 2 Score: 0 6.x Sort at index time vs. query time Optimize on-disk format for some use cases Improve query performance at the cost of index performance
  11. 11 Large Improvements to Replication • Limit syncs to only

    changed documents (instead of file-based recovery) • Fast replica recovery after temporary unavailability (network issues, etc.) • Re-sync on primary failure • Laying foundation for future big league features ‒Cross-datacenter replication ‒Changes API (tbd) New operation-based approach to recovery (sequence numbers)
  12. 12 Breaking changes • Improved tools to handle breaking changes

    ‒Deprecation logging ‒Upgrade Assistant (UI & APIs) • Refer to Release Notes for complete list • Test, test, test Because major releases is time for major cleanup
  13. 13 Simpler data models with type removal • Breaking change

    • Gradual migration path ‒ 6.0 indices can be created with only one type ‒ Existing 5.x indices using _type will continue to function • Introducing new APIs for type-less operations Say goodbye to _type confusion
  14. 14 Some interesting changes • Rename template to index_patterns in

    _template • Content-Type detection disabled • Set explicit Content-Type in request header • Deprecation of _all • _all can no longer be configured for indices in 6.0 • Use all_fields in query
  15. 15 Some interesting changes • <= 2.x indices need to

    be reindexed • Re-index into 5.x or 6.0 cluster • Deprecate Groovy, Python, Javascript lang plugin • Rewrite scripts in plainless • Java High Level REST Client • Starting from version 5.6.0 a new Java client has been released.
  16. 16 Some interesting changes • Preventing full disks • Search

    scalability • Introduce max_concurrent_shard_requests • Improve Synonym filter • Use tokenizer and tokenfilters before synonym_token
  17. 17 Try 6.0 Beta!