Distributed ID Generation

Distributed ID Generation @nathankleyn

or, how to make your ID generation less like this:

and more like this:

Requirements

3 main requirements

1 an ID for every event

2 IDs must be unique

3 it must scale well

Theory

Time is hard. Really hard. (Diamond. Hard. See what I
did there?)

Time oracles could solve this problem.

(No, not that oracle)

Time oracles could solve this problem.

NTP tries to solve this problem for the “rest of
us”.

(No, not that NTP?)

NTP tries to solve this problem for the “rest of
us”.

However, expect ±10ms. (at least)

That’s ±10ms per machine.

Hey Bob, what’s the time? It’s 1970! Back in my
day... Fucking Bob always thinks it’s 1970. It’s clearly 1980. Look at my hair! No, Karen. It’s 1990. I’m not wearing this antiquated Disney shirt for kicks.

Time can move backwards or forwards.

1s ≠ 1000ms

Representing Numbers In Binary

Java has many numeric data types.

However it has no unsigned variants.* This is no longer
strictly true in Java 8, however it’s a trick: they’re still signed types, but now there’s a bunch of functions to do unsigned operations on them (eg. unsignedDivide).

So in Java the MSB is for the sign: 1000000000000
1 when negative and 0 when positive. So the above is a short and is -4096.

Some languages make it hard to use the signed bit.

A Java long is 64-bits. So we have 63 usable
bits.

The Epoch

An epoch is a marker of time relative to true
time.

Unix time is an epoch measured relative to 00:00:00.000 1/1/1970.

We can deﬁne our own epoch.

So in 1 year it will be 31,536,000. Unix time
will be ~1,456,348,092,000.

Why is this useful? Because it allows us to compress
the storage of time.

Redis is awesome, fast and stable.

It supports scripting via Lua.

We can create a Lua script to make an ID
inside Redis.

Redis is not distributed.* Redis clustering will arrive in v3.

We can round-robin between a bunch of Redis servers to
achieve distribution.

k-sorting

k-sorting = “roughly sorting”

IDs should provide only k- sorting guarantees.

How It’s Done

Format

Our 64-bit IDs look like this:

ABBBBBBBBBBBBBBBB BBBBBBBBBBBBBBBBB BBBBBBBBCCCCCCCCC CDDDDDDDDDDDD

ABBBBBBBBBBBBBBBBBBBBBBBBB BBBBBBBBBBBBBBBBCCCCCCCCCC DDDDDDDDDDDD A is the reserved signed bit of
a Java long (1 bit).

ABBBBBBBBBBBBBBBBBBBBBBBBB BBBBBBBBBBBBBBBBCCCCCCCCCC DDDDDDDDDDDD B is the timestamp in milliseconds since
custom epoch bits (41 bits).

ABBBBBBBBBBBBBBBBBBBBBBBBB BBBBBBBBBBBBBBBBCCCCCCCCCC DDDDDDDDDDDD C is the logical shard ID (10
bits).

ABBBBBBBBBBBBBBBBBBBBBBBBB BBBBBBBBBBBBBBBBCCCCCCCCCC DDDDDDDDDDDD D is the sequence (12 bits).

The Timestamp

Represented in 41 bits using a custom epoch.

This allows ~69 years of continuous ID generation.

Note this is the ﬁrst part of the ID, so
it has the most bearing on sorting.

Sorting IDs sorts by time strictly, remainder of ID roughly
(ie. k-sorted).

Logical Shard ID

We want to be able to have many Redis servers.

We allow 10 bits for this ID, so we can
have up to 1024 ID generation machines.

We give a ﬁxed ID to each Redis server and
it stamps its IDs with this ever after.

The Sequence

What happens if you ask the same Redis server to
generate multiple IDs in a millisecond?

The sequence ensures IDs are never duplicated when this happens.

We rotate a 12-bit number.

We roll back to 0 when it reaches 4905.

That means a maximum of 4096 IDs per node per
millisecond.

If the sequence rolls over twice in the same millisecond,
we block until the time changes.

Distributing The Load

Simple round-robin between the Redis servers.

Retry 5 times before failing.

Fin Questions?

Distributed ID Generation

Distributed ID Generation

More Decks by Nathan Kleyn

Other Decks in Programming

Featured

Transcript