Creating Scalable APIs & API Performance

–Guy Kawasaki “I’ve never seen a startup die because it
couldn’t scale fast enough. I’ve seen hundreds of startups die because people refused to embrace their product.”

Creating Scalable APIs & API Performance Patrick Heneise

Patrick Heneise Software Architect at blended technologies S.L. MSc in
Media Technology BSc in Computer Science in Media Startup Mentor Co-Organiser MediterráneaJS, BarcelonaJS, NodeBCN & CoreOS Barcelona

–Wikipedia “A set of routines, protocols, and tools for building
software applications.” Application Programming Interface

APIs • could send a lot of emails • could
send a lot of other notiﬁcations • could do background jobs • could do heavy computation • don’t render web content (HTML)

If your API takes more than 50ms for these things…

you might be on the wrong path.

To the topic.

Where to start with scalability?

Measure! You don't know what's going on if you don't
have the numbers.

Unit tests Core Billing #subscribe ✓ fail to upgrade subscription
if no subscription exists ✓ create inactive subscription in database ✓ modify inactive subscription in database ✓ fail to subscribe with an invalid period ✓ fail to subscribe to a non-existing plan 1) create inactive subscription in database 2) modify inactive subscription in database ✓ create active subscription in database and stripe (1358ms) ✓ fail to subscribe if active subscription already exists

End-to-End (E2E) Tests Billing POST /billing/subscribe ✓ should fail to
subscribe to a non existing plan ✓ should pre-subscribe user to basic yearly plan

Response Time and Analytics

Load testing / stress tests

Ok, got the numbers. Now?

Measure again.

Inspect Find the bottleneck of your system. One at a
time.

Performance 101

Data transfer, optimise in- and output • Use JSON •
Minimise transfer data ({ dont_use_excessive_attribute_names: true; }) • Smart endpoints instead of REST-only endpoints (give users the data they need in one call)

Data storage, use the right database for the job •
MySQL for session data? • NoSQL for relational data?

Node distance • use nodes within the same subnet •
use 'Private Networking' / internal network interfaces • try to have the nodes physically close to each other. The best database doesn't help if it's in China and your API server in Barcelona.

Async it! • Does the user really need to know
(and wait for it) if that email has been sent?

Ruby on Rails Scaling Trick #1

––https://blog.engineyard.com/2009/5-tips-to-scale-your-ror- application “Cache, cache, cache and more cache.”

Seriously?

Scaling 101

Request and response optimisation • Provide consumers with the data
they need (to reduce requests)  Example Twitter: Timeline + user info, no need to get every users details separately. Example Instagram: Notify consumers when there is new content instead of letting them poll every X seconds (and generate request)

Data optimisation • Implement caching layers where possible and feasible
• Cache when it makes sense!  Example: Doing complex geo-location searches over a big ElasticSearch index? Cache the result in Redis.

Components and Services

It’s 2015 Don’t build monolithic applications.

Identifying API components

Remember all that data you measured? • E-Mail, notiﬁcations and
other things that involve 3rd party services • background jobs • computation jobs • cron jobs • anything else that takes more than 25ms to respond

Microservices • Anything that doesn't require immediate response to the
user can be done asynchronously by a micro-service. • Example: Using a 3rd party service can add precious time to a response. Instead of waiting, respond with HTTP 102 or 202 (request accepted, processing pending).

–Mike Krieger, Co-Founder @ Instagram “Scaling - replacing all components
of your a car while driving it at 100mph”

The larger the components, the harder they are to replace.
It's easier to change a tyre than the motor.

Should scaling be your concern?

18M requests/week. 22ms response time. 1 API. 1 machine.

ROAAAAR!!

Scaling Pieces

Software • Database • Clustering • Sharding • find the
right database • Web Server • find the optimal web server or proxy (nginx, Apache, haproxy, ...) • API platform • find the optimal platform (node.js, Python, Ruby on Rails, ...) • Assets • Amazon S3 • CDN

“Your users around the world don't care that you wrote
your own DB” –Mike Krieger, Co-Founder @ Instagram

“Your users around the world don't care what tech you
use.”

Stay nimble and focused; choose your tools wisely and don't
reinvent the wheel.

Vertical Hardware Scaling • Add more memory • Add more
computation power • Reboot • Double capacity != double scale / speed

Horizontal Hardware Scaling • Add Database nodes • Add API
platform nodes • Add Computation nodes • Add node locations (cross-datacenter, distance to the consumer) Building a Barcelona startup with a cluster in US-WEST and wondering why your API response time is high?

Horizontal vs. Vertical Hardware Scaling + Cheaper + Faster to
implement + Adds node redundancy & security - Requires extra nodes for load balancing

1. Measure. Find the bottleneck 2. Can performance be improved
with a software ﬁx? 3. Can you exchange the component? 4. Can you scale vertically? 5. Scale horizontally.

Thank you.

Patrick Heneise @PatrickHeneise // @blendedio [email protected]

Creating Scalable APIs & API Performance

Creating Scalable APIs & API Performance

More Decks by PatrickHeneise

Other Decks in Technology

Featured

Transcript