Just Enough Ops for Developers

Peter Baumgartner Just Enough Ops for Developers DjangoCon US 2022
@ipmb lincolnloop.com

About Me • Founder at Lincoln Loop — lincolnloop.com •
Co-author of High Performance Django — highperformancedjango.com • Building AppPack — apppack.io

Prepping your project for production Watch my talk at DjangoCon
2019

Why just enough Ops?

PaaS & Managed Services are really good

Forget about 😴 • System security/hardening • Routing/networking • Secrets
management • Deployments • Scaling how to, not when to • Process Management (systemd, docker, etc.) • Hardware failures

But! You need to understand the basics • CPU •
RAM • I/O

CompSci 101

Disk (HDD, SSD) Persistent File Storage

• Disk access is slow • Usually ephemeral Disk (HDD,
SSD) Persistent File Storage

CPU (processor) Code execution

How many CPUs do I need?

How many CPUs do I need? It depends

1 request/1 process/1 CPU

https://fastapi.tiangolo.com/async/ Cook = CPU Cashier = Python Process Customer =
Request

How many CPUs do I need? It depends

How many cooks do I need? It depends

More Users → More CPU Rule of 👍

2 requests/1 process/1 CPU

2 requests/2 process/2 CPU

How do I minimize cost?

How do I maximize CPU usage?

Multiple Processes per CPU How do I maximize CPU usage?

2 requests/2 processes/1 CPU

gunicorn \ --workers=3 \ …

Tuning Worker Count Each app is different • Start at
double your CPU count • You can experiment with adding workers until • Not enough memory • Response times plateau or degrade

Improve Application Performance How do I maximize CPU usage?

1 CPU / 1 minute 100ms response time = 6000
requests 1s response time = 60 requests

⏳Slow responses ⏱More CPU time 💰Higher cost

I/O CompSci 101

I/O (Input/Output) Examples • Reading/writing files • Database queries •
Object storage (S3) • Search index queries • Third-party APIs

Everything waits for input/output

Synchronous I/O is blocking 🤓

🤓 Asynchronous I/O is non-blocking

Async Django Views since Django 3.1 / ORM since Django
4.1

Asynchronous programming is hard 🤯

Jeff Atwood “Hardware is cheap, Programmers are expensive

gevent How do I maximize CPU usage?

🪄 magic How do I maximize CPU usage?

2 workers with gevent

gunicorn \ --worker-class=gevent \ --worker-connections=50 \ …

Not Enough CPU

Backlog

Error Codes • 502 Bad Gateway • 503 Service Unavailable
• 504 Gateway Timeout

Memory (RAM) Ephemeral cache

• “Warm” code cache • Python objects • File processing/manipulation
• Network sockets • File descriptors Memory (RAM) Ephemeral cache

• Fast • Limited Memory (RAM) Ephemeral cache

Memory usage for a typical Django app

128 - 512+ MB Memory usage for a typical Django
app

Memory usage for a typical Django app 128 - 512+
MB per process

👷 4 gunicorn workers 📊 512 MB per process 🟰
2 GB of memory

Over 100% memory → ☠

Not enough memory? • ⬆ Increase allowed memory • ⬇
Reduce processes/workers

Memory usage should be stable

Stable Memory Usage

Variable (but Stable) Memory Usage

Django memory tips • Don’t read huge files into a
string/byte object • Don’t process a huge queryset • Use Model.objects.iterator() • Use .values() to avoid creating a model instance • Use .only() to avoid loading large text fields

Memory Leak

Memory Leak Causes • Bug in C extension (no garbage
collection) • Leaving file descriptors or network sockets open (use context managers) • Global objects (sometimes accidental)

gunicorn \ --max-requests=10000 \ --max-requests-jitter=500 \ …

Databases

Same rules apply

Remember: disk is slow 🌋 Database should fit in RAM
Rule of 👍

Scaling

Horizontal Vertical

Autoscaling is great 🎉

…but it’s not magic 🪄

Serverless

1 request/1 process/1 CPU Serverless

Pricing Per request + Allocated resources per millisecond of response
time Serverless

• Scaling • CPU Allocation • Application server (workers, gevent,
etc.) Forget about 😴 Serverless

Worry about 😥 • Budgeting/variable costs • Cold starts •
Database connections • Limitations (upload size, max duration) • Remote shell access Serverless

Final Thoughts 🧐 • Get to know your application —
“observability” • CPU usage • Memory usage • Response times • Error rates/uptime

Thanks! 👋 @ipmb lincolnloop.com apppack.io

Just Enough Ops for Developers

Just Enough Ops for Developers

More Decks by Peter Baumgartner

Other Decks in Technology

Featured

Transcript