Modelling prototypes to critical systems with Cassandra

Mistakes were made… Modelling prototypes to critical systems with Cassandra
Matt Heath, Monzo

Hi, I’m Matt

@mattheath

Nov 2015 Mar 2019

Nov 2015 Mar 2019 Fastest growing UK Bank 1.6M Customers

“construct a highly agile and highly available service from ephemeral
and assumed broken components” - Adrian Cockcroﬅ

How does Monzo operate? - AWS, GCP, and physical data
centres - Cloud Native technologies:  Kubernetes, Docker, Calico, Cassandra, Kafka, NSQ, Etcd, Prometheus, Jaeger, Envoy, Elasticsearch… - Go based micro services  ⛅

Feb 2015 Mar 2019 1000 services

Why Cassandra? - High Availability - Tuneable Consistency - Linear
Scalability - Geographic Replication 

eu-west-1a eu-west-1b eu-west-1c

CREATE KEYSPACE account WITH replication = { 'class': 'NetworkTopologyStrategy', 'eu-west-1':
'3' }

eu-west-1a eu-west-1b eu-west-1c

Data Modelling

@jrecursive

Column B Column C Key Column A

Column B Column C ID Column A

Key Value Value Value Column A Column B Column C
Timestamp Timestamp Timestamp

CREATE TABLE IF NOT EXISTS account ( id text, userid
text, created timestamp, currency text, country text, description text, type text, PRIMARY KEY ((id)) );

Key Value Value Value Column A Column B Column C
Timestamp Timestamp Timestamp

acc_as8d… 2018-xx-xx Matt’s account user_007xUi8… “created" “description” “userid” 1526917782000 1526917782000
1526917782000

CREATE TABLE IF NOT EXISTS accounts_by_userid ( id text, userid
text, created timestamp, currency text, country text, description text, type text, PRIMARY KEY ((userid), id) );

Partition Key Row 1 Row 2 Row 3

Partition  Key Row 1 Row 2 Row 3

Partition Key

Partition  Key Row 1 Row 2 Row 3

Partition  Key Row 1 Row inﬁnity

User ID Account 1 Account 2 Account 3 No-one has
inﬁnite accounts!

CREATE TABLE IF NOT EXISTS transaction ( id text, accountid
text, created timestamp, currency text, amount bigint, description text, PRIMARY KEY ((id)) );

Column B Column C Transaction  ID Column A

Column B Column C Transaction  ID Column A Must know
primary key — can’t iterate

CREATE TABLE IF NOT EXISTS transaction_by_account ( id text, accountid
text, created timestamp, currency text, amount bigint, description text, PRIMARY KEY ((accountid), id) );

Account  ID Transaction 1 Transaction  inﬁnity

Timeseries: Partition by Time ⏱

Partition 2 3 4 5 7

Bucket 2 3 4 5 7 Composite Partition Key (Time
range and Account ID) PRIMARY KEY ((accountid, timebucket), created, id)

Day 1 2 3 4 5 7 Composite Partition Key
(Time range and Account ID) PRIMARY KEY ((accountid, timebucket), created, id)

Day 1 2 3 4 5 7 9 12 15
16 17 18 19 20 Day 2 21 22 24 23

Day 1 2 3 4 5 7 9 12 15
16 17 18 19 20 Day 2 21 22 24 23 “Return last 100 transactions”

9 12 15 16 17 18 19 20 Day 3
21 22 24 23 Day 2 Day 1 2 3 4 5 7 No data in this time period

9 12 15 16 17 18 19 20 Day 1000
21 22 24 23 Day 999 Day 1 2 3 4 5 7 Day 998 Day 3 Day 2 … No data in this time period

9 12 15 16 17 18 19 20 Day 1000
21 22 24 23 Day 999 Day 1 Day 998 Day 3 Day 2 … Did the data ever exist?! When do we stop…?

Is your data predictable? How do you choose your bucket
size?

Day 1 2 3 4 5 7 9 12 15
16 17 18 19 20 Day 2 21 22 24 23

Day 1 2 3 4 5 7 9 12 15
16 17 18 19 20 Day 2 21 22 24 23 Hot Partition Key

eu-west-1a eu-west-1b eu-west-1c Partition Key = Day 1

eu-west-1a eu-west-1b eu-west-1c Partition Key = Day 2

Is your data predictable? How do you choose your bucket
size?  Ensure correct partitioning!

Flakeseries: Partition by Time ⏱ Retrieve by ID

Bucket 2 3 4 5 7 Composite Partition Key (Time
range and Account ID) PRIMARY KEY ((accountid, timebucket), created, id)

Bucket 2 3 4 5 7 Time range: We need
to know the timestamp to read PRIMARY KEY ((accountid, timebucket), created, id)

Bucket 2 3 4 5 7 Time range: We need
to know the timestamp to read PRIMARY KEY ((accountid, timebucket), created, id) accountid = acc_00009Wd3Yeh2O329bFTVHF

accountid = acc_00009Wd3Yeh2O329bFTVHF

accountid = acc_00009Wd3Yeh2O329bFTVHF Flake IDs = Time based lexically sortable
IDs

IDs Base62 encoded 128bit Int  eg 26341991268378369512474991263748

IDs Base62 encoded 128bit Int  eg 26341991268378369512474991263748 64 bits - Time in ms since epoch  48 bits - Worker ID 16 bits - Sequence ID

accountid = acc_00009Wd3Yeh2O329bFTVHF

accountid = acc_00009Wd3Yeh2O329bFTVHF transactionid = tx_00009gEBzyFoAtFYllr9Qf

accountid = acc_00009Wd3Yeh2O329bFTVHF transactionid = tx_00009gEBzyFoAtFYllr9Qf PRIMARY KEY ((bucket), flake_created,
transactionid)

accountid = acc_00009Wd3Yeh2O329bFTVHF transactionid = tx_00009gEBzyFoAtFYllr9Qf PRIMARY KEY ((bucket), flake_created,
transactionid) PRIMARY KEY ((accountid, bucket), flake_created, transactionid)

TTLing your data ⏱

TTLing your data

Key 2 3 4 5 7 9 12 15 16
17 18 19 20 Key 21 22 24 23

Key 2 3 4 5 7 9 12 15 16
17 18 19 20 Key 21 22 24 23 Mixed TTLs across same data set 

Key 2 3 4 5 7 9 12 15 16
17 18 19 20 Key 21 22 24 23 Mixed TTLs across same data set  Compacted into same sstables

Key 2 4 5 7 9 12 15 17 18
19 20 Key 21 24 23 First set of rows expire

Key 2 4 7 9 12 15 17 Key 21
24 23 Second set of rows expire

No data is removed until compaction occurs Key 2 3
4 5 7 9 12 15 16 17 18 19 20 Key 21 22 24 23

Time Window Compaction Strategy Create sstables per time range Bucket
1 Bucket 2 9 12 15 16 17 18 19 20 Bucket 3 21 22 24 23 2 3 4 5 7 9 12 15 16 17 18 19 20 21 22 24 23

Time Window Compaction Strategy sstables dropped once all data has
expired Bucket 1 Bucket 2 9 12 15 16 17 18 19 20 Bucket 3 21 22 24 23 2 3 4 5 7 9 12 15 16 17 18 19 20 21 22 24 23

Data modelling take aways - Correct data modelling is incredibly
important! - Wide rows are ok to a point - Repairs on wide rows are problematic - Make Timeseries buckets predictable - Watch for Hot Keys! - TTLs don’t always mean your data is deleted

Nov 2015 Mar 2019 What works here Might not work
here

monzo.com/careers

Modelling prototypes to critical systems with C...

Modelling prototypes to critical systems with Cassandra

More Decks by mattheath

Other Decks in Programming

Featured

Transcript