Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
You're not using Elasticsearch?
Search
Timon Vonk
March 13, 2014
Technology
2
270
You're not using Elasticsearch?
An introduction to Elasticsearch 1.0 and how Tolq uses it for its translation memory
Timon Vonk
March 13, 2014
Tweet
Share
Other Decks in Technology
See All in Technology
AWS Lambda のトラブルシュートをしていて思うこと
kazzpapa3
2
200
SREが投資するAIOps ~ペアーズにおけるLLM for Developerへの取り組み~
takumiogawa
2
740
Terraform Stacks入門 #HashiTalks
msato
0
360
Platform Engineering for Software Developers and Architects
syntasso
1
530
Storybook との上手な向き合い方を考える
re_taro
5
1.4k
[CV勉強会@関東 ECCV2024 読み会] オンラインマッピング x トラッキング MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping (Chen+, ECCV24)
abemii
0
230
データプロダクトの定義からはじめる、データコントラクト駆動なデータ基盤
chanyou0311
3
350
New Relicを活用したSREの最初のステップ / NRUG OKINAWA VOL.3
isaoshimizu
3
650
AGIについてChatGPTに聞いてみた
blueb
0
130
rootlessコンテナのすゝめ - 研究室サーバーでもできる安全なコンテナ管理
kitsuya0828
3
390
OS 標準のデザインシステムを超えて - より柔軟な Flutter テーマ管理 | FlutterKaigi 2024
ronnnnn
1
310
静的解析で実現した効率的なi18n対応の仕組みづくり
minako__ph
1
150
Featured
See All Featured
The Pragmatic Product Professional
lauravandoore
31
6.3k
How to Think Like a Performance Engineer
csswizardry
20
1.1k
Java REST API Framework Comparison - PWX 2021
mraible
PRO
28
8.2k
VelocityConf: Rendering Performance Case Studies
addyosmani
325
24k
Docker and Python
trallard
40
3.1k
Fashionably flexible responsive web design (full day workshop)
malarkey
405
65k
StorybookのUI Testing Handbookを読んだ
zakiyama
27
5.3k
Agile that works and the tools we love
rasmusluckow
327
21k
Practical Orchestrator
shlominoach
186
10k
Rebuilding a faster, lazier Slack
samanthasiow
79
8.7k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
506
140k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
126
18k
Transcript
You’re not using ElasticSearch? Timon Vonk (@timonvonk)
About me • CTO @ Tolq, zero effort website translations
• Freelance hacker and consultant
None
The battlefield • Ferret • Xapian • Solr • Postgresql’s
TSVectors • Sphinx • Many more
What is ElasticSearch? • Search engine • Cloud in mind
• JSON API • Scriptable • Nosql • Great for things different than search
Why use it? • Cloud setup out of the box
• Fast indexing • Easy API • On the fly mappings • Very customisable
OMG WHAT IS LUCENE • Java library for search •
Only handles the search bit • Terms based vector algorithm
Making a query • Just send JSON: GET localhost:9200/example/peanuts/_search {
‘query’: { text: { ‘my_field’: ’many search terms’ }}} { took: 5, timed_out: false, _shards: { total: 5, successful: 5, failed: 0 }, hits: [ { _index: “example”, _type: “peanuts”, _score: 0.9, _source: { …data } } ] } }
Other types of queries • Terms, full text, boolean, fuzzy,
geolocation and lots more variants • Filters, aggregations, percolation, suggestions
Analysing • Pre-index and pre-search • This is when scoring
happens • Remove stop words, stemming, other normalisations • You can create your own analysers
Aggregations { “query”: … } { “aggregations”: { “rubyist_stats”: {
“stats”: { “field”: “meetup_visits” } } } } Aggregations are an upgrade over < 1.0 facets
Different kinds • min, max, avg, sum • stats, extended_stats
(all of the above + stdev/ mean) • percentile • counts • … all scriptable!
Also buckets! • Create subsets based on conditions • Nest
aggregations • … aaand scriptable
(Ruby) libraries • Good old Tire deprecated • Stretcher! •
elasticsearch-ruby • Also libraries for Go, Node, Javascript, etc • … it’s just json
How Tolq uses ElasticSearch • Suggest better translations for translators
• Fast access to text and translations for general search
Other fun stuff • Kibana • Hadoop • Hosted, autoscaling
providers
Also, it works great for text search! Thanks! We’ve launched!
http://www.tolq.com And hiring!