$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
You're not using Elasticsearch?
Search
Timon Vonk
March 13, 2014
Technology
2
280
You're not using Elasticsearch?
An introduction to Elasticsearch 1.0 and how Tolq uses it for its translation memory
Timon Vonk
March 13, 2014
Tweet
Share
Other Decks in Technology
See All in Technology
Bedrock AgentCore Evaluationsで学ぶLLM as a judge入門
shichijoyuhi
2
250
AI with TiDD
shiraji
1
280
[Neurogica] 採用ポジション/ Recruitment Position
neurogica
1
120
「もしもデータ基盤開発で『強くてニューゲーム』ができたなら今の僕はどんなデータ基盤を作っただろう」
aeonpeople
0
240
Oracle Database@Azure:サービス概要のご紹介
oracle4engineer
PRO
2
200
特別捜査官等研修会
nomizone
0
570
Oracle Database@Google Cloud:サービス概要のご紹介
oracle4engineer
PRO
1
760
フルカイテン株式会社 エンジニア向け採用資料
fullkaiten
0
9.9k
MariaDB Connector/C のcaching_sha2_passwordプラグインの仕様について
boro1234
0
1k
AWS re:Invent 2025~初参加の成果と学び~
kubomasataka
0
190
Microsoft Agent Frameworkの可観測性
tomokusaba
1
110
SREが取り組むデプロイ高速化 ─ Docker Buildを最適化した話
capytan
0
140
Featured
See All Featured
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.3k
Odyssey Design
rkendrick25
PRO
0
440
Automating Front-end Workflow
addyosmani
1371
200k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.6k
Deep Space Network (abreviated)
tonyrice
0
21
A Tale of Four Properties
chriscoyier
162
23k
Balancing Empowerment & Direction
lara
5
820
Accessibility Awareness
sabderemane
0
24
Art, The Web, and Tiny UX
lynnandtonic
304
21k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Leo the Paperboy
mayatellez
0
1.3k
Designing for Performance
lara
610
69k
Transcript
You’re not using ElasticSearch? Timon Vonk (@timonvonk)
About me • CTO @ Tolq, zero effort website translations
• Freelance hacker and consultant
None
The battlefield • Ferret • Xapian • Solr • Postgresql’s
TSVectors • Sphinx • Many more
What is ElasticSearch? • Search engine • Cloud in mind
• JSON API • Scriptable • Nosql • Great for things different than search
Why use it? • Cloud setup out of the box
• Fast indexing • Easy API • On the fly mappings • Very customisable
OMG WHAT IS LUCENE • Java library for search •
Only handles the search bit • Terms based vector algorithm
Making a query • Just send JSON: GET localhost:9200/example/peanuts/_search {
‘query’: { text: { ‘my_field’: ’many search terms’ }}} { took: 5, timed_out: false, _shards: { total: 5, successful: 5, failed: 0 }, hits: [ { _index: “example”, _type: “peanuts”, _score: 0.9, _source: { …data } } ] } }
Other types of queries • Terms, full text, boolean, fuzzy,
geolocation and lots more variants • Filters, aggregations, percolation, suggestions
Analysing • Pre-index and pre-search • This is when scoring
happens • Remove stop words, stemming, other normalisations • You can create your own analysers
Aggregations { “query”: … } { “aggregations”: { “rubyist_stats”: {
“stats”: { “field”: “meetup_visits” } } } } Aggregations are an upgrade over < 1.0 facets
Different kinds • min, max, avg, sum • stats, extended_stats
(all of the above + stdev/ mean) • percentile • counts • … all scriptable!
Also buckets! • Create subsets based on conditions • Nest
aggregations • … aaand scriptable
(Ruby) libraries • Good old Tire deprecated • Stretcher! •
elasticsearch-ruby • Also libraries for Go, Node, Javascript, etc • … it’s just json
How Tolq uses ElasticSearch • Suggest better translations for translators
• Fast access to text and translations for general search
Other fun stuff • Kibana • Hadoop • Hosted, autoscaling
providers
Also, it works great for text search! Thanks! We’ve launched!
http://www.tolq.com And hiring!