ISTA 2019 - Migrating data-intensive microservices from Python to Go

Migrating data-intensive microservices from Python to Go Nikolay Stoitsev Engineering
Manager @ Uber

Early years Dispatch API Storage

Early years Dispatch API Storage Python Node.js

Invoice Generation Service

Background Legal document

Background Legal document Vary by country

Background Legal document Vary by country Vary by business line

Background Legal document Vary by country Vary by business line
Triggered after every trip or food delivery

Sample architecture Money System Cassandra Kafka Preprocess Render Kafka Consumer
Object Store

More than 30 upstream systems Large Scale

More than 30 upstream systems More than 100 TBs of
data stored Large Scale

data stored Running on 400 containers in multiple DCs Large Scale

data stored Running on 400 containers in multiple DCs Running for 5 years Large Scale

data stored Running on 400 containers in multiple DCs Running for 5 years 99.999% availability for last 6 months Large Scale

data stored Running on 400 containers in multiple DCs Running for 5 years 99.999% availability for last 6 months Implemented in Python Large Scale

Sample architecture Money System Cassandra Kafka Preprocess Render Kafka Consumer
Object Store Web API Hive

Building blocks http://ﬂask.pocoo.org

Flask Example

Flask Usage

MVCS Controller Mapper Service Entities External Services Database

Building blocks https://uwsgi-docs.readthedocs.io/

uWSGI uwsgi python python python

Building blocks http://www.celeryproject.org/

Celery celery-worker celery-worker celery-worker kafka consumer Redis

“Use the right tool for the job”

It hurts velocity at some point

What we need for each language? Training / Best practices
/ Documentation / Experts

/ Documentation / Experts Project template / Bootstrapping

/ Documentation / Experts Project template / Bootstrapping Configuration

/ Documentation / Experts Project template / Bootstrapping Configuration Debuggers

/ Documentation / Experts Project template / Bootstrapping Configuration Debuggers Profilers

/ Documentation / Experts Project template / Bootstrapping Configuration Debuggers Profilers Building, Packaging, Deploying

We picked Go and Java

Why Go?

Broad applicability

High performance

Static typing

Has momentum

From Python to Go

EAFP versus LBYL

Dependency injection https://github.com/uber-go/fx

Cadence instead of Celery https://github.com/uber/cadence

Cadence Cadence DB queue Timers invoice service worker worker worker
worker

MVCS translates nicely

How to migrate? Money System Invoice Generation Storage Python

Option #1 - Big Bang Rewrite Money System Invoice Generation
Storage Python Invoice Generation Go

Option #1 - Big Bang Rewrite Money System Storage Invoice
Generation Go

No visibility on regressions

No visibility on performance degradation

No visibility on feature parity

Option #2 - Do it iteratively

Invoice Generation Storage Kafka

Storage Kafka Preprocess Render

Storage Kafka Preprocess Render Preprocess Go

Storage Kafka Preprocess Render Preprocess Go Compare

Storage Kafka Preprocess Render Preprocess Go Compare Toggle

Volume?

m3 DB https://www.m3db.io

Tally - stats collection in Go https://github.com/uber-go/tally

Tally - stats collection in Go

Measure processing time p95, p99

Storage Kafka Preprocess Render Preprocess Go Compare m3 Grafana

Correctness?

Storage Kafka Preprocess Render Preprocess Go Compare Kafka ELK

Structured logging

Zap https://github.com/uber-go/zap

Benefits of iterative approach Verify regressions Verify performance problems Verify
feature parity

Lessons learned

Spend time to learn the new language

Spend time to read code in the new language

Do a rollout plan and stick to it

Python can scale and is reliable

Q&A Thank you! Nikolay Stoitsev, stoitsev@uber.com

ISTA 2019 - Migrating data-intensive microservi...

ISTA 2019 - Migrating data-intensive microservices from Python to Go

More Decks by Nikolay Stoitsev

Other Decks in Technology

Featured

Transcript