Redis Cluster for Write Intensive Workloads

Hello, I’m Tugberk!

Sign up and get £10! https://roo.it/tugberku-dgfd

careers.deliveroo.co.uk We’re Growing! Unique challenges, amazing people and great food!

Deliveroo Home Feed • Dense areas with a lot of
restaurants • Making it hard for users to choose from the large selection • Each user's needs are different

Jonny Brooks-Bartlett- How Deliveroo improved the ranking of restaurants -
PyData London 2019 youtube.com/watch?v=bG95RmVOn0E • Already algorithmically ranking the Restaurant List through a rudimentary Linear Regression model • Desire to personalize this ranking for each user's needs • Predicting which restaurant a user is more likely to order from

• Access to the aggregated user speciﬁc data from the
ranking service on production • Costly to aggregate on production • Needs to be in-sync with the training pipeline and model serving. • Need a way to retrieve this data in optimum time for millions of users, while sustaining >1K rps, and keep this data up to date within a reasonable data consistency lag.

Canoe pipeline kicks in, aggregates the data for each user
and serializes the data based on a protobuf format in 20 user bundles. Canoe Aggregating User Features Storing Protobuf{ed} Features From the Canoe pipeline, we pick up files which has protobuf data for 20 users and upload them to S3 S3 Queuing the Work For Each S3 File Mapping between S3 and SQS allows us to queue messages into SQS whenever there is a file upload on the S3 bucket SQS indexing each user features to Redis Cluster Lambda is kicked off by the event source mapping between SQS and the Lambda, which handles the Lambda Storing the data for O(1) access per user Redis Cluster is available to serve reads and writes with 3 primary shards and each having 1 replica Redis Cluster Reading the data from the Redis Cluster On production, we can access the user specific feature by issuing an O(1) query to redis cluster. Access

• Data aggregation pipeline bundles 50 records per proto ﬁle,
and uploads to a known S3 bucket • S3 object creation notiﬁcation is enqueued to SQS • Lambda instances dequeues from from SQS, and writes to Redis Cluster

Allows you to scale the writes as well as the
reads, which are good especially for unpredictable write workloads Allows you to increase the capacity with zero-downtime by adding new shard(s) and performing online resharding Reduces your blast radius, i.e. when a shard goes down, it only affects the portion of your data surface until a failover happens

• Redis installation where data is sharded across multiple Redis
nodes • These nodes still have the same capabilities as a normal Redis node, and they can have their own replica sets • Redis assigns "slot" ranges (a.k.a. hash slots) for each master node within the cluster

tugberkugurlu/redis-cluster usage https://github.com/tugberkugurlu/redis-cluster

• Redis comes with some out of the box commands
to help you manage your cluster setup

• For a given Redis key, the hash slot for
that key is the result of CRC16(key) modulo 16384, where CRC16 here is the implementation of the CRC16 hash function • Redis clients can query which node is assigned to which slot range by using the CLUSTER SLOTS command

https://www.tugberkugurlu.com/archive/redis- cluster-beneﬁts-of-sharding-and-how-it-works

• Gives a managed support for Redis Cluster mode (e.g.
you don't need to worry about operational handling for resharding, failover, etc.) • Integrates well with our existing infrastructure stack at Deliveroo (e.g. AWS, Terraform, etc.)

https://docs.aws.amazon.com/AmazonElastiCache/latest/red -ug/Replication.Redis-RedisCluster.html

• READONLY command enables read queries for a connection to
a Redis Cluster replica node. • RouteRandomly conﬁg option allows routing read-only commands to the random master or slave node. • These conﬁgurations allows us to distribute the read load across the master and all replicas in a random way at the cost of potentially increased data consistency gap.

• Having tight timeouts allows us to reduce the impact
of potential issues with the Redis to the rest of the application • If we know the expectations from the redis cluster in terms of response time, we can tune the timeout to fail early, allowing the rest of the application to keep executing in case of potential issues. • Timeout tuning is a half scientiﬁc and half ﬁnger in the air process...

• Simple Redis Set command • The client knows which
node to send this write request to thanks to its Redis Cluster knowledge

• Simple Redis Get command • The contract between write
and read side the is the userID • Checking on Redis error whether it's of type "redis.Nil" which indicates absence of the key.

staurant features

• Multi-command operations such as MGET can only succeed if
all of the keys belong to same slot https://www.tugberkugurlu.com/archive/redis-cluster-benefits-of-sharding-and-how- it-works#hash-tags

• Hash tags allow us to force certain keys to
be stored in the same hash slot. • when the Redis key contains "{...}" pattern only the substring between { and } is hashed in order to obtain the hash slot. https://www.tugberkugurlu.com/archive/redis-cluster-benefits-of-sharding-and-how- it-works#hash-tags

• None of the access pattern needs was requiring us
to go across city boundary • Therefore, used City ID as the hash tag value

• Same as the write side, we use City ID
as the hash tag here to inﬂuence the shard selection to route us to the same node • Bundling all Redis Get commands within a single TCP connection to improve the performance by saving from the round trip • Pipeline requests run in order but they are not blocking other connections unlike MGET

• around ~850-1K queries per second • ~9.72ms max p95
latency for entire pipeline query

• Increasing the number of node groups for your Elasticache
Cluster will kick off an online resharding operation • This will inherit the same number of replications as the other node groups

• You can increase/decrease the replica count independent of the
shard count • Note that there was a bug on Terraform regarding this but it has been ﬁxed, see github.com/hashicorp/terraform-provider-aw s/issues/6184 https://docs.aws.amazon.com/AmazonElastiCache/latest /APIReference/API_IncreaseReplicaCount.html

https://docs.aws.amazon.com/AmazonElastiCache/latest/red-ug/AutoFailover.html#auto-failover-test

1 4 2 5 3 6

56 Software Engineer - Mid, Senior, Staff-level Engineering Manager Senior
Software Engineer, Infrastructure Machine Learning Engineer - Mid, Senior, Staff-level Data Engineer Data Scientist - Mid, Senior, Staff-level Data Science Manager Locations: London, Remote UK, Remote Poland See the complete list at https://careers.deliveroo.co.uk/ !

Redis Cluster for Write Intensive Workloads

Redis Cluster for Write Intensive Workloads

Tugberk Ugurlu

More Decks by Tugberk Ugurlu

Other Decks in Programming

Featured

Transcript