Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Cloudstack design decisions
Search
Pierre-Yves Ritschard
June 10, 2014
Technology
0
87
Cloudstack design decisions
Cloud operations at scale
Pierre-Yves Ritschard
June 10, 2014
Tweet
Share
More Decks by Pierre-Yves Ritschard
See All by Pierre-Yves Ritschard
Meetup Camptocamp: Exoscale SKS
pyr
0
470
The (long) road to Kubernetes
pyr
0
320
From vertical to horizontal: The challenges of scalability in the cloud
pyr
0
73
Change Management at Scale
pyr
0
120
5 years of Clojure
pyr
2
1k
Taming Jenkins
pyr
0
54
Init: then and now
pyr
1
200
Billing the Cloud
pyr
0
310
From Vertical to Horizontal
pyr
2
140
Other Decks in Technology
See All in Technology
あなたの知らない Linuxカーネル脆弱性の世界
recruitengineers
PRO
3
130
FinOps について (ちょっと) 本気出して考えてみた
skmkzyk
0
190
All About Sansan – for New Global Engineers
sansan33
PRO
1
1.2k
物体検出モデルでシイタケの収穫時期を自動判定してみた。 #devio2025
lamaglama39
0
260
難しいセキュリティ用語をわかりやすくしてみた
yuta3110
0
350
Click A, Buy B: Rethinking Conversion Attribution in ECommerce Recommendations
lycorptech_jp
PRO
0
120
現場データから見える、開発生産性の変化コード生成AI導入・運用のリアル〜 / Changes in Development Productivity and Operational Challenges Following the Introduction of Code Generation AI
nttcom
1
420
「魔法少女まどか☆マギカ Magia Exedra」の多様なバトルの開発を柔軟かつ効率的に実現するためのPure C#とUnityの分離について
gree_tech
PRO
0
220
RDS の負荷が高い場合に AWS で取りうる具体策 N 連発/a-series-of-specific-countermeasures-available-on-aws-when-rds-is-under-high-load
emiki
7
4.4k
AI時代の開発を加速する組織づくり - ブログでは書けなかったリアル
hiro8ma
1
200
それでも私が品質保証プロセスを作り続ける理由 #テストラジオ / Why I still continue to create QA process
pineapplecandy
0
150
OAuthからOIDCへ ― 認可の仕組みが認証に拡張されるまで
yamatai1212
0
150
Featured
See All Featured
Context Engineering - Making Every Token Count
addyosmani
7
280
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
190
55k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.7k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
253
22k
How GitHub (no longer) Works
holman
315
140k
Large-scale JavaScript Application Architecture
addyosmani
514
110k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
667
120k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
15k
Git: the NoSQL Database
bkeepers
PRO
431
66k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.7k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Transcript
CLOUDSTACK DESIGN DECISIONS CLOUD OPERATIONS AT SCALE
SHORT BIO Pierre-Yves Ritschard CTO @ exoscale - The safe
home for your cloud applications Open Source Developer - pithos, cyanite, riemann, collectd, openbsd Architect of several cloud platforms - paper.li Recovering Operations Engineer
Simple and efficient cloud hosting platform Full compatibility with automation
tools Hosted in a safe jurisdiction
CLOUD BUILDING BLOCKS service infrastructure software people
SERVICE SIMPLICITY AND SCALABILITY Cloudstack based Basic networking Local storage
KVM hypervisor: SmartOS inspired
CLOUDSTACK Great extensibility, easy to plug into.
BASIC NETWORKING One IP per VM. Security groups are hypervisor
controlled layer 2 firewall rules. Provides all the flexibility of a traditionnal firewall, completely API controlled.
LOCAL STORAGE Fast I/O, persistent disks.
KVM HYPERVISOR Best in class hypervisor. Diskless and netboot approach.
Avoids resource waste, facilitates upgrades.
INFRASTRUCTURE THE GOOD CITIZEN CONTRACT Configuration management Visibility Build factory
Remote execution
THE GOOD CITIZEN CONTRACT new machines have roles role defines
converged configuration as sum of components each component has an expected normal state and reports it no local intervention needed
CONFIGURATION MANAGEMENT code is a great way to define infrastructure
ensures homogeneity ability to iterate fast great source of change tracking avoids fear of change
OVER 3000 COMMITS
CONFIGURATION MANAGEMENT: PUPPET battle tested tool simple declarative DSL to
express configuration fits our component approach well
VISIBILITY FROM THE MAP TO THE TERRITORY logs metrics alerts
WHY FOCUS ON VISIBILITY distributed systems with lots of moving
parts, high node volatility
LOGS all application and system logs sent over the wire
logstash disects and extracts metadata elasticsearch indexes for easy retrieval simple correlation
None
METRICS all application and system metrics sent over the wire
by collectd graphite's carbon aggregates and produces appropriate roll- ups if it moves, graph it. if it doesn't, graph it if it starts moving.
None
ALERTS unbounded stream of log and metric data passive approach
bodes well with node volatility riemann takes decisions based on stream content ability to extract meaningful information
BUILD FACTORY continuous integration package repositories
CONTINUOUS INTEGRATION over 60 build jobs ties into our code
hosting platform handled by jenkins
PACKAGE REPOSITORIES generates valid and signed Debian repositories ensures fast
upgrades simplifies configuration management
REMOTE EXECUTION a simple pubsub system recurrent commands stored as
scenarios command line, HTTP and IRC interaction
A SIMPLE PUBSUB SYSTEM each node runs an agent responsible
for carrying out commands. commands are sent to groups of nodes (by predicates such as role).
RECURRENT COMMANDS STORED AS SCENARIOS intricate workflows can be expressed
through a simple DSL
COMMAND LINE, HTTP AND IRC INTERACTION most of our production
environment can be controlled through our chatroom
SOFTWARE FILLING IN THE GAPS Customer management Real-time metering and
billing Integrated console A few other things
CUSTOMER MANAGEMENT Keeping track of our users Support services (ticket
management, coupons, emails)
REAL-TIME METERING AND BILLING can't be tied to a cloudstack
only solution cloudstack emits useful data ties into our customer management
INTEGRATED CONSOLE integrated experience across our services hides complexity and
cloudstack specifics exposes exoscale specific features
None
A FEW OTHER THINGS pithos cyanite fleet collectd add-ons
PEOPLE EFFICIENT WORK. QUIET NIGHTS Small SRE team Avoiding deploy
anxiety
SMALL SRE TEAM Our platform must be simple to operate,
additional moving parts must provide business value or help operations
AVOIDING DEPLOY ANXIETY Our software and infrastructure helps ensure we
have good tools to ensure quiet nights and easily caught errors
LOOKING BACK Cloudstack is a solid foundation for a IAAS
platform There's a bit more to it than just installing cloudstack Building a sustainable and scalable platform on top of cloudstack is possible
QUESTIONS ?