Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
You're not using Elasticsearch?
Search
Timon Vonk
March 13, 2014
Technology
2
270
You're not using Elasticsearch?
An introduction to Elasticsearch 1.0 and how Tolq uses it for its translation memory
Timon Vonk
March 13, 2014
Tweet
Share
Other Decks in Technology
See All in Technology
解析の定理証明実践@Lean 4
dec9ue
1
190
BrainPadプログラミングコンテスト記念LT会2025_社内イベント&問題解説
brainpadpr
1
170
LangChain Interrupt & LangChain Ambassadors meetingレポート
os1ma
2
200
MySQL5.6から8.4へ 戦いの記録
kyoshidaxx
1
290
Geminiとv0による高速プロトタイピング
shinya337
0
170
プロダクトエンジニアリング組織への歩み、その現在地 / Our journey to becoming a product engineering organization
hiro_torii
0
130
Model Mondays S2E03: SLMs & Reasoning
nitya
0
230
Oracle Cloud Infrastructure:2025年6月度サービス・アップデート
oracle4engineer
PRO
2
310
Amazon Bedrockで実現する 新たな学習体験
kzkmaeda
2
650
【5分でわかる】セーフィー エンジニア向け会社紹介
safie_recruit
0
26k
フィンテック養成勉強会#54
finengine
0
200
自律的なスケーリング手法FASTにおけるVPoEとしてのアカウンタビリティ / dev-productivity-con-2025
yoshikiiida
0
220
Featured
See All Featured
Practical Orchestrator
shlominoach
188
11k
Docker and Python
trallard
44
3.4k
The Power of CSS Pseudo Elements
geoffreycrofte
77
5.8k
Documentation Writing (for coders)
carmenintech
72
4.9k
A Modern Web Designer's Workflow
chriscoyier
694
190k
Adopting Sorbet at Scale
ufuk
77
9.4k
Scaling GitHub
holman
459
140k
Writing Fast Ruby
sferik
628
62k
Why You Should Never Use an ORM
jnunemaker
PRO
58
9.4k
Building Flexible Design Systems
yeseniaperezcruz
328
39k
YesSQL, Process and Tooling at Scale
rocio
173
14k
Thoughts on Productivity
jonyablonski
69
4.7k
Transcript
You’re not using ElasticSearch? Timon Vonk (@timonvonk)
About me • CTO @ Tolq, zero effort website translations
• Freelance hacker and consultant
None
The battlefield • Ferret • Xapian • Solr • Postgresql’s
TSVectors • Sphinx • Many more
What is ElasticSearch? • Search engine • Cloud in mind
• JSON API • Scriptable • Nosql • Great for things different than search
Why use it? • Cloud setup out of the box
• Fast indexing • Easy API • On the fly mappings • Very customisable
OMG WHAT IS LUCENE • Java library for search •
Only handles the search bit • Terms based vector algorithm
Making a query • Just send JSON: GET localhost:9200/example/peanuts/_search {
‘query’: { text: { ‘my_field’: ’many search terms’ }}} { took: 5, timed_out: false, _shards: { total: 5, successful: 5, failed: 0 }, hits: [ { _index: “example”, _type: “peanuts”, _score: 0.9, _source: { …data } } ] } }
Other types of queries • Terms, full text, boolean, fuzzy,
geolocation and lots more variants • Filters, aggregations, percolation, suggestions
Analysing • Pre-index and pre-search • This is when scoring
happens • Remove stop words, stemming, other normalisations • You can create your own analysers
Aggregations { “query”: … } { “aggregations”: { “rubyist_stats”: {
“stats”: { “field”: “meetup_visits” } } } } Aggregations are an upgrade over < 1.0 facets
Different kinds • min, max, avg, sum • stats, extended_stats
(all of the above + stdev/ mean) • percentile • counts • … all scriptable!
Also buckets! • Create subsets based on conditions • Nest
aggregations • … aaand scriptable
(Ruby) libraries • Good old Tire deprecated • Stretcher! •
elasticsearch-ruby • Also libraries for Go, Node, Javascript, etc • … it’s just json
How Tolq uses ElasticSearch • Suggest better translations for translators
• Fast access to text and translations for general search
Other fun stuff • Kibana • Hadoop • Hosted, autoscaling
providers
Also, it works great for text search! Thanks! We’ve launched!
http://www.tolq.com And hiring!