Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Elasticsearch & 63 Million WordPress Sites
Search
xyu
February 06, 2014
Technology
1
240
Elasticsearch & 63 Million WordPress Sites
Overview of the Elasticsearch infrastructure that Automattic maintains to support WordPress.com.
xyu
February 06, 2014
Tweet
Share
More Decks by xyu
See All by xyu
Improving Search With One Weird Trick
xyu
0
120
Better Performance & Uptime Through Async
xyu
0
150
Dealing With Time Travelers in Analytics
xyu
0
110
Scaling WordPress Queries With Elasticsearch
xyu
1
13k
Scale Out Don't Scale Up
xyu
2
270
Understanding Language and Fixing WP Search
xyu
1
200
Jetpack Related Posts for Power Users
xyu
1
690
WP ❤️ Elasticsearch
xyu
1
430
Cloud Powered WordPress for Everyone
xyu
0
160
Other Decks in Technology
See All in Technology
AIのグローバルトレンド 2025 / ai global trend 2025
kyonmm
PRO
1
120
ロールが細分化された組織でSREと協働するインフラエンジニアは何をするか? / SRE Lounge #18
kossykinto
0
200
【CEDEC2025】現場を理解して実現!ゲーム開発を効率化するWebサービスの開発と、利用促進のための継続的な改善
cygames
PRO
0
730
オブザーバビリティプラットフォーム開発におけるオブザーバビリティとの向き合い / Hatena Engineer Seminar #34 オブザーバビリティの実現と運用編
arthur1
0
350
Claude Codeが働くAI中心の業務システム構築の挑戦―AIエージェント中心の働き方を目指して
os1ma
9
1.6k
データモデリング通り #2オンライン勉強会 ~方法論の話をしよう~
datayokocho
0
120
AIに頼りすぎない新人育成術
cuebic9bic
3
180
解消したはずが…技術と人間のエラーが交錯する恐怖体験
lamaglama39
0
190
Kiroでインフラ要件定義~テスト を実施してみた
nagisa53
3
310
LTに影響を受けてテンプレリポジトリを作った話
hol1kgmg
0
320
「Roblox」の開発環境とその効率化 ~DAU9700万人超の巨大プラットフォームの開発 事始め~
keitatanji
0
110
LLMをツールからプラットフォームへ〜Ai Workforceの戦略〜 #BetAIDay
layerx
PRO
1
890
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
337
57k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
110
19k
Embracing the Ebb and Flow
colly
86
4.8k
For a Future-Friendly Web
brad_frost
179
9.9k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
126
53k
Site-Speed That Sticks
csswizardry
10
750
A better future with KSS
kneath
238
17k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.9k
Statistics for Hackers
jakevdp
799
220k
The Cost Of JavaScript in 2023
addyosmani
51
8.8k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
29
2.8k
Docker and Python
trallard
45
3.5k
Transcript
Elasticsearch & 63 Million WordPress Sites Elasticsearch Boston (Feb. 2014)
Xiao Yu Code Wrangler — Automattic @HypertextRanch
[email protected]
xyu
VaultPress Jetpack Simplenote Akismet Polldaddy Gravatar VideoPress IntenseDebate
Simperium Code Poet Cloudup
Cluster Stats • 63M Sites • 743M Documents • 12TB
Primary + Replicas • 51M Query Ops / Day • 15M Index Ops / Day 2 Major Use Cases • Global Search ! • Local Search Elasticsearch + WordPress.com
Infrastructure Layout Internal API Cache REST API PHP Node 1
Node 2 Cluster A Node 1 Node 2 Node 3 Node n Cluster B Stats
Documents & Types /index/post { blog_id: 123, post_id: 456, title:
"Search!", content: "…", blog: { lang: "en", … }, … } /index/blog { blog_id: 123, url: "www.xyu.io", follower_ids: [ 789, … ], lang: "en", indexable: true, … }
Storage Strategy • Grow Number of Indices (10M Sites /
Index) • 25 Shards / Index (400K Sites / Shard) • 3 Copies of Data (1 Primary + 2 Replicas) 2 Major Use Cases • Global Search • Query All Shards • Local Search • Query One Shard Indicies & Shards
Nodes & Clusters Warning, YMMV! !
Monitoring Cluster Health
Monitoring Cluster Health
Monitoring Cluster Health
Monitoring Cluster Health
Thanks! @HypertextRanch
[email protected]
xyu