Building distributed systems with OSS

Building massively distributed systems with OSS Mateusz ‘Serafin’ Gajewski allegro.tech
meeting v8 2015

Web Scale source: reactivemanifest.org

Distributed top-down architecture, computing, messaging, databases (No/New SQL), data processing,
file systems, resource management, infrastructure.

Distributed toolbox dynamic flow control, rate limiting, exponential back-offs, automatic
failover, hinted-handoffs, data scrubbing, CRDTs, backpressure, circuit breakers, bulk heads vector clocks, two-phase commit, consensus algorithms, gossip protocols, leader election, distributed coordination, eventual consistency, data replication, OCC, MVCC...

Thesis: Building distributed & correct systems is very hard. Proof
through: Jepsen :)

Thesis: Most of our problems/needs can be addressed using existing
Open Source Software. Proof through: a lot of companies i.e. Allegro ;)

Just four OSS examples with concepts behind them

Apache Cassandra · 2008

Architecture

SSTable

Read/write path

Will it scale?

Yes it will!

Apache Kafka · 2011

Architecture

Partition structure source: kafka.apache.org

Will it scale?

Apache Spark · 2009

Components source: spark.apache.org

RDD abstraction

Architecture source: spark.apache.org

Does it scale? source: databricks.com

Apache Mesos · 2009

Mesos architecture source: mesos.apache.org

Offers source: mesos.apache.org

Mesos ecosystem source: mesosphere.com

Does it scale?

All you need is... Scalable system = Cassandra as data
storage + Spark as data processing engine + Mesos as resource scheduler + Kafka as core messaging.

Good news: we use it all!

but... OSS cons & pros for your consideration

OSS cons • immature (not production-ready), • bugs, • poor
or misleading documentation, • learning curve, • few or no experts on the market, • slow adoption rate, • dependencies on other OSS, • (sometimes) lack of support

OSS pros • “there is OSS for that” ;) •
licensing, • sources, • speeds up time-to-market • helps recruiting

OSS tips • stay up-to-date, • don’t trust docs -
deep dive instead, • engage with community, • remove OSS barriers - contribute back, • release your software - share, • grow experts in your company - educate, • evaluate-hold-adopt cycle - experiment, • know your hardware & OS - tune, • be patient ;)

Thank you!

Key facts • partitioned, nested, sorted map, • AP system
(with tunable C), • masterless architecture (p2p) with gossip protocol, • multi dc (a)synchronous replication, • consistent hashing (with virtual nodes), • support CQL (query language similar to SQL), • modeled after Dynamo, BigTable.

Key facts • general purpose, distributed data-processing engine, • extends
Map/Reduce & Dryad data flow programming models, • fault tolerance via RDDs, • supports iterative algorithms, map/reduce, stream processing, relational queries & hybrid models, • partial DAG execution

Key facts • distributed, fault tolerant resource scheduler, • provides
performance isolation, • leader election with ZooKeeper, • master maintains soft-state.

Key facts • partitioned, immutable, linearizable append-only log, • CA
system (can lost data during partition), • (a)synchronous replication (tunable), • at-least-once delivery semantics, • ZooKeeper for partition leader election, • ISR (in-sync-replicas set) concept, • relies heavily on OS caches.

Building distributed systems with OSS

Building distributed systems with OSS

More Decks by Mateusz Gajewski

Other Decks in Programming

Featured

Transcript