Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Elasticsearch & 63 Million WordPress Sites
Search
xyu
February 06, 2014
Technology
1
260
Elasticsearch & 63 Million WordPress Sites
Overview of the Elasticsearch infrastructure that Automattic maintains to support WordPress.com.
xyu
February 06, 2014
Tweet
Share
More Decks by xyu
See All by xyu
Improving Search With One Weird Trick
xyu
0
140
Better Performance & Uptime Through Async
xyu
0
180
Dealing With Time Travelers in Analytics
xyu
0
140
Scaling WordPress Queries With Elasticsearch
xyu
1
13k
Scale Out Don't Scale Up
xyu
2
290
Understanding Language and Fixing WP Search
xyu
1
220
Jetpack Related Posts for Power Users
xyu
1
800
WP ❤️ Elasticsearch
xyu
1
470
Cloud Powered WordPress for Everyone
xyu
0
190
Other Decks in Technology
See All in Technology
サイボウズ 開発本部採用ピッチ / Cybozu Engineer Recruit
cybozuinsideout
PRO
10
76k
Physical AI on AWS リファレンスアーキテクチャ / Physical AI on AWS Reference Architecture
aws_shota
1
170
大規模ECサイトのあるバッチのパフォーマンスを改善するために僕たちのチームがしてきたこと
panda_program
1
400
Phase01_AI座学_基礎
overflowinc
0
4.3k
やさしいとこから始めるGitHubリポジトリのセキュリティ
tsubakimoto_s
3
1.8k
SSoT(Single Source of Truth)で「壊して再生」する設計
kawauso
2
380
The essence of decision-making lies in primary data
kaminashi
0
120
昔話で振り返るAWSの歩み ~S3誕生から20年、クラウドはどう進化したのか~
nrinetcom
PRO
0
100
韓非子に学ぶAI活用術
tomfook
3
1k
TUNA Camp 2026 京都Stage ヒューリスティックアルゴリズム入門
terryu16
0
560
Change Calendarで今はOK?を仕組みにする
tommy0124
1
120
QA組織のAI戦略とAIテスト設計システムAITASの実践
sansantech
PRO
1
200
Featured
See All Featured
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
150
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.8k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
160
Exploring anti-patterns in Rails
aemeredith
2
290
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
76
A better future with KSS
kneath
240
18k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.4k
The SEO identity crisis: Don't let AI make you average
varn
0
420
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.1k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
80
Transcript
Elasticsearch & 63 Million WordPress Sites Elasticsearch Boston (Feb. 2014)
Xiao Yu Code Wrangler — Automattic @HypertextRanch
[email protected]
xyu
VaultPress Jetpack Simplenote Akismet Polldaddy Gravatar VideoPress IntenseDebate
Simperium Code Poet Cloudup
Cluster Stats • 63M Sites • 743M Documents • 12TB
Primary + Replicas • 51M Query Ops / Day • 15M Index Ops / Day 2 Major Use Cases • Global Search ! • Local Search Elasticsearch + WordPress.com
Infrastructure Layout Internal API Cache REST API PHP Node 1
Node 2 Cluster A Node 1 Node 2 Node 3 Node n Cluster B Stats
Documents & Types /index/post { blog_id: 123, post_id: 456, title:
"Search!", content: "…", blog: { lang: "en", … }, … } /index/blog { blog_id: 123, url: "www.xyu.io", follower_ids: [ 789, … ], lang: "en", indexable: true, … }
Storage Strategy • Grow Number of Indices (10M Sites /
Index) • 25 Shards / Index (400K Sites / Shard) • 3 Copies of Data (1 Primary + 2 Replicas) 2 Major Use Cases • Global Search • Query All Shards • Local Search • Query One Shard Indicies & Shards
Nodes & Clusters Warning, YMMV! !
Monitoring Cluster Health
Monitoring Cluster Health
Monitoring Cluster Health
Monitoring Cluster Health
Thanks! @HypertextRanch
[email protected]
xyu