Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Evolution of e-commerce search @ shopping24
Search
Torsten Bøgh Köster
November 19, 2014
Technology
0
1.2k
Evolution of e-commerce search @ shopping24
Held at the first Search Technology Meetup in Hamburg on November, 19th.
Torsten Bøgh Köster
November 19, 2014
Tweet
Share
More Decks by Torsten Bøgh Köster
See All by Torsten Bøgh Köster
🔪 How we cut our AWS costs in half
tboeghk
0
120
Shared Nothing Logging Infrastructure
tboeghk
0
79
Beyond Cloud: A road trip into AWS and back to bare metal
tboeghk
1
56
Shared Nothing Logging Infrastructure
tboeghk
0
1k
Kubernetes the ❤️ way
tboeghk
0
840
Beyond Cloud: A road trip into AWS and back to bare metal
tboeghk
0
74
Open-Source-Logging und -Monitoring (W-JAX 2017)
tboeghk
0
87
Beyond Cloud (W-JAX 2017)
tboeghk
0
110
Open Source Logging & Monitoring (code.talks 2017)
tboeghk
0
82
Other Decks in Technology
See All in Technology
ワールドカフェI /チューターを改良する / World Café I and Improving the Tutors
ks91
PRO
0
120
KubeCon EU 2024 Recap “Kubernetes Policy Time Machine: Where to Next?”
ryysud
0
220
開発パフォーマンスを最大化するための開発体制
ham0215
2
450
APIファーストなプロダクトマネジメントの実践 〜SaaSus Platformでの例〜 / "Practicing API-First Product Management - An Example with SaaSus Platform
oztick139
0
110
LangSmith入門―トレース/評価/プロンプト管理などを担うLLMアプリ開発プラットフォーム
os1ma
3
380
Vertex AI を中心に 生成AIのアップデートを共有します
kaz1437
0
310
Cypress or Playwright?
rainerhahnekamp
0
110
Google Cloud Next '24でブログを10本書いた方法と勉強会を沸かせた方法
yasumuusan
0
310
いつか使うかも貯金してたらめちゃめちゃ機能が増えてた話
riyaamemiya
0
370
エンジニア候補者向け資料2024.04.24.pdf
macloud
0
3.3k
20240418_Google ColabにLLMが搭載されたようなのでPython x データ分析の勉強方法を考えてみる
doradora09
0
140
Cloud Native Java with Spring Boot (CNCF Aarhus, April 2024)
thomasvitale
1
180
Featured
See All Featured
Teambox: Starting and Learning
jrom
128
8.4k
Writing Fast Ruby
sferik
621
60k
jQuery: Nuts, Bolts and Bling
dougneiner
59
7.1k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
221
21k
Rails Girls Zürich Keynote
gr2m
91
13k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
241
1.2M
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
6
1.5k
The Language of Interfaces
destraynor
151
23k
Build The Right Thing And Hit Your Dates
maggiecrowley
24
2k
Build your cross-platform service in a week with App Engine
jlugia
225
17k
Put a Button on it: Removing Barriers to Going Fast.
kastner
58
3.1k
The Pragmatic Product Professional
lauravandoore
25
5.8k
Transcript
Evolution of e-commerce search @ shopping24 Search Technology Meetup Hamburg
Torsten Bøgh Köster (Shopping24) 19. November 2014
Agenda Why search? Motivation & introduction Evolutionary steps taken Advanced
steps Pitfalls
@tboeghk ‣CTO shopping24 internet group ‣University of Hamburg, class of
2005 ‣Likes: search, build, delivery, code quality, road bike
None
Open Source Power. Delivered.
search system architecture overview
Fun fact: <1% visitors actually use the search bar.
Search enables automatic SEA scaling. But what about navigating afterwards?
Agenda Why search? Motivation & introduction Evolutionary steps taken Advanced
steps Pitfalls
Don’t get me started on tokenizing. Move expensive operations (synonyms,
stemming) to index time
German stemming: „Ein_ Geschicht_ voll__ Missverständniss_“: Refrain from Porter and
Snowball stemmer.
Extend recall using synonyms & subtopics, use edismax query parser
with boost terms for high precision. Consider reranking to penalize documents
3 approaches to navigating search results
use facetting to narrow a search result, use adaptive tree
structures
the direct spellchecker in Solr does a great job. Consider
word break. Avoid dictionaries, handle special cases using synonyms (+ custom code).
Use Solrs more like this. Supply terms in mlt request.
Works on >1 documents as well. Filter on gender (and categories).
remove terms from query and retry when hitting zero results.
Uses spellchecker & custom collators
Recycle Solr spellchecker infrastructure to retrieve related brands, categories &
searches.
Agenda Why search? Motivation & introduction Evolutionary steps taken Advanced
steps Pitfalls
TF/IDF ranking does not work for e-commerce search. Consider the
bmax query parser.
first impression matters: use solr grouping and expand to „fold“
similar products.
Separate data & ranking information. Retrieve ranking information from an
external data store (ExternalFileFieldType, RedisFieldType). Use boost functions to mix information retrieved. per document lookup
Agenda Why search? Motivation & introduction Evolutionary steps taken Advanced
steps Pitfalls
Visualize results for the target audience. Separate business from technical
views.
Custom code in Solr is failure by design. You will
inevitably hit garbage collection hell. GC will happen, deal with it.
Ultimate solution: issue replication slots to slaves. Perform Full GC
after cache warming.
Find us on github.com
Questions? @tboeghk developer.s24.com
[email protected]