Building for 100x scale by Simon Kelly

Building for 100x Scale Simon Kelly www.dimagi.com @snopoke

• Who is Dimagi • A story of scale •
Rethinking the system • Implementation • Learnings

About Dimagi Delivering open and innovative technology to underserved communities
everywhere

What we do?

• 500 projects • 60 countries • 20,000 active users
Global footprint

A story of Scale

Senegal National Informed Push for Supply Chain Guatemala Scaling maternal
health, malaria & nutrition app to 9,000 users Ghana Supply Chain, Community Health Worker Expansion Burkina Faso Clinical tools used in 25% of all national clinics India Scaling app to 100,000 Community Health Workers Myanmar Scaling to 12,000 midwives Tanzania Nationally scaling supply chain project Mozambique National Community Health Worker app rollout The story of scale

System Model Mobile Users Cases Data Elements Transactions { "type":
"person", "name": "Mary", "gender": "F", "dob": "1985-04-12" } { "next_visit": "2016-11-04", } • 100 - 20 000 per user • Sharing

Expected System Load (2017) 150 thousand 4 million / month
600 % 600 % Mobile Users Cases 4 billion / month 2500 % Data Elements

Data growth 5 Year timeline • 10 billion cases •
1 trillion data points • 5 petabytes of data

System architecture Redis (cache) Elasticsearch Django Django Django nginx Celery
Celery Celery Stream processing PostgreSQL CouchDB CouchDB Cluster

Rethinking the system

System principles ◦ Good technology fit ◦ Open Source ◦
Cost ◦ Control ◦ Lock in ◦ Mature ◦ Well supported ◦ Reasonable upgrade paths ◦ Good tooling ◦ Horizontally scalable

Principles applied • Cost • Control • Lock in •
Design Horizontal scalability Open Source Maturity Technology Fit Redis (cache) Elasticsearch Django Celery nginx Stream processing PostgreSQL CouchDB Cluster

Rethinking our data High Volume Primary Data Cases, Forms Low
Volume Primary Data Users, Groups, Apps etc. Analytics Data Binary Data Attachments, Multimedia ✓

Evaluating options • Technology fit • Horizontal scalability • Open
Source • Project maturity • Transactional properties / consistency model • Speed / transaction throughput • Secondary index support • Ease of implementation • Maintenance burden

Identifying solutions • Options ◦ Cassandra, HBase, Riak, Elasticsearch, Redis,
PostgreSQL, MongoDB, CouchDB

Identifying solutions • Short list ◦ PostgreSQL ◦ MongoDB ◦
CouchDB • Prototype • Benchmark ◦ Tsung ◦ Variety of workloads Flask PostgreSQL CouchDB mongoDB

Evaluation results • PostgreSQL ◦ Benchmarks ◦ Flexibility of SQL
◦ Mature product ◦ Already in our toolset • Gaining insights ◦ Optimize for reads ◦ Scaling factors ◦ Scale limitations

Implementation • Foundations • Data Model • Interfaces • Sharding

Foundations: Tests • Test suite ◦ Good coverage of code
and use cases ◦ Run on both backends @run_with_all_backends def test_parent_and_child_cases(self): …. run_with_all_backends = functools.partial( run_with_multiple_configs, run_configs=[ RunConfig(settings={'USE_SQL_BACKEND': True}, post_run=self.tearDown()), RunConfig(settings={'USE_SQL_BACKEND': False}, pre_run=self.setUp()), ] )

Foundations: Branching if should_use_sql_backend(project): # SQL specific else: # CouchDB
specific • Code branching mechanism ◦ Override in tests ◦ In production def should_use_sql_backend(project): local_override = get_local_sql_backend_override(project) if local_override is not None: return local_override if settings.UNIT_TESTING: return _should_use_sql_backend_in_tests(project) return USE_SQL_BACKEND_FLAG.enabled(project)

Data Model • Nested JSON objects → Multiple SQL tables
{ "doc_type": "case", "relationships": [{"case_id": "a"}, {"case_id": "b"}], "transactions": [{"id": "1"}, {"id": "2"}], …. } Case CaseRelationship CaseTransaction 1 1 0..* 0..*

Data Access Interface CaseAccessors(project).get_case(case_id) class CaseAccessors(object): def __init__(self, project=None): self.project
= project @property def _db_accessor(self): if should_use_sql_backend(self.project): return CaseAccessorSQL else: return CaseAccessorCouch def get_case(self, case_id): return self._db_accessor.get_case(case_id)

Data Access Implementation class CaseAccessorSQL(AbstractCaseAccessor): @staticmethod def get_case(case_id): return CaseSQL.objects.get(case_id=case_id)
class CaseAccessorCouch(AbstractCaseAccessor): @staticmethod def get_case(case_id): return CaseCouch.get(case_id) class AbstractCaseAccessor(six.with_metaclass(ABCMeta)): @abstractmethod def get_case(case_id): raise NotImplementedError

Implementation • Sharding

Sharding • App layer VS proxy layer Django DB1 DB2
DB3 proxy DB1 DB2 DB3 Django

Sharding “PL/Proxy is a PostgreSQL procedural language handler that allows
you to do remote procedure calls between PostgreSQL databases, with optional sharding.” PL/Proxy DB DB 1 DB 2 DB 3 2N Logical shards mapped to Y databases

PL/Proxy: RUN ON hash() DB 2 DB 1 DB 3
SELECT * FROM get_case_by_id(case_id) Stub function Function implementation PL/Proxy DB get_case_by_id(case_id) get_case_by_id(case_id) hash(case_id) & (2N-1)

PL/Proxy DB get_cases_for_user(user_id) PL/Proxy: RUN ON ALL SELECT * FROM
get_cases_for_user(user_id) DB 1 DB 2 DB 3

PL/Proxy DB get_cases_by_ids(case_ids) PL/Proxy: SPLIT SELECT * FROM get_cases_by_ids(case_ids) DB
1 DB 2 DB 3 split(case_ids)

SQL Functions Actual Function Proxy Function CREATE FUNCTION get_case_by_id(p_case_id TEXT)
RETURNS SETOF case_table AS $$ CLUSTER 'commcare'; RUN ON hash_string(case_id); $$ LANGUAGE plproxy; CREATE FUNCTION get_case_by_id(p_case_id TEXT) RETURNS SETOF case_table AS $$ BEGIN RETURN QUERY SELECT * FROM case_table where case_id = p_case_id; END; $$ LANGUAGE plpgsql;

Un-sharded Environment Final State Sharded Environment unsharded proxy p1 pN
Django unsharded Django PL/Proxy functions SQL functions SQL functions

Running queries from Python • Fetching Django objects • Queries
that don’t return objects CaseSQL.objects.get(case_id=case_id) CaseRelationship.objects.filter(case_id=case_id).values_list('referenced_id') CaseSQL.objects.raw('SELECT * from get_case_by_id(%s)', [case_id])[0] with get_cursor(CaseSQL) as cursor: cursor.execute('SELECT referenced_id FROM get_parent_case_ids(%s)', [case_id]) results = fetchall_as_namedtuple(cursor) return [result.referenced_id for result in results]

Writing data CREATE FUNCTION save_case( case case_table ) AS $$
BEGIN INSERT INTO case_table (case_id, type, properties) VALUES ( case.case_id, case.type, case.properties ); END $$ SELECT save_case(ROW('123','farmer','{"name": "Jo"}')::case_table); cursor.execute('SELECT save_case(%s)', [case]) psycopg2.extensions.register_adapter(CaseSQL, case_adapter) https://github.com/dimagi/commcare-hq/blob/4375b4a1e4107616abe686550fb13ed73542d054/corehq/form_processor/utils/sql.py

App Layer (Django) • Raw queries ◦ Disable Django ORM
queries class DisabledDbMixin(object): def save(self, *args, **kwargs): raise AccessRestricted('Direct object save disabled.') class RestrictedManager(models.Manager): def get_queryset(self): raise AccessRestricted('Only "raw" queries allowed')

App Layer (Django) • DB Router ◦ Queries ◦ Migrations
$ ./manage.py migrate --database=proxy • Tools for managing PL/Proxy cluster $ ./manage.py migrate_multi $ ./manage.py configure_pl_proxy_cluster

New architecture Elasticsearch Django Django Django nginx Redis (cache) Stream
processing Celery Celery Celery CouchDB Cluster proxy p1 pN Kafka RiakCS PostgreSQL

Learnings

Gotchas • Transactions ◦ Django’s connection is with the proxy
◦ ‘proxy’ issues autocommit transactions to shard DB’s ◦ Even if Django rolls back the effects to the shard DB’s persist • Returning results from multiple databases ◦ count_case_in_domain (RUN ON ALL) ◦ one result from each shard DB returned ◦ SELECT sum(c) AS count FROM count_cases_in_domain('x') as t(c); • Limiting / sorting results ◦ ‘limit’ and ‘sort’ operations happens on shard DB’s

Learnings • Talk to others in the industry • Planning

Learnings • Validate decisions with data • But don’t try
too hard

➔ PL/Proxy ➔ Django Multi-Database support ➔ Horizontal scaling with
PL/Proxy ➔ RiakCS ➔ Kafka ➔ www.dimagi.com ➔ www.commcarehq.org ➔ github.com/dimagi/commcare-hq

Building for 100x scale by Simon Kelly

Building for 100x scale by Simon Kelly

More Decks by Pycon ZA

Other Decks in Programming

Featured

Transcript