snarks <3 hash functions

Slide 1

Slide 1 text

snarks <3 hash functions Rebekah Mercer O(1) Labs

Slide 2

Slide 2 text

Roadmap • The state of the art for arithmetic circuit-friendly hash functions. • What do we need to keep in mind when choosing hash functions for use inside snarks? • Which hash functions best satisfy these requirements?

Slide 3

Slide 3 text

Using SHA2

Slide 4

Slide 4 text

Using SHA2 • Lack of algebraic structure • Working with bytes and words is nice and fast in normal software but doesn’t quite translate to speed in snarks

Slide 5

Slide 5 text

Using SHA2 • SHA2 has 27,446 constraints for a 512 bit message, and proof creation takes ~1.5 seconds. • This means 53.6 constraints per bit processed.

Slide 6

Slide 6 text

Commitments • Commitments only require second pre-image resistance, or potentially collision resistance. • Constructing a low arithmetic complexity hash function that achieves only collision resistance might be simpler!

Slide 7

Slide 7 text

Merkle trees

Slide 8

Slide 8 text

Merkle trees

Slide 9

Slide 9 text

Pedersen • Windowing: hardcoding parameters and using lookups (which are cheaper) to replace exponentiations. • Leads to around 1.7 constraints per bit processed.  ~896 constraints to hash a 512-bit message. • Provable security from DL assumption. With n generators Gi, and message m made of bits mi, we compute: m1G1 + m2G2 + … + mnGn

Slide 10

Slide 10 text

Indifferentiability from a random oracle • Schnorr signatures: using the Fiat-Shamir transform requires the hash function to be modelled as a random oracle. • Proof composition with universal-setup SNARKs (these often use the Fiat-Shamir transform to make the protocols non-interactive without relying on heavier cryptographic machinery).

Slide 11

Slide 11 text

RO hashing to bitstrings • To replace SHA with its full functionality, various hash functions (or pairs of hash functions) have been recently suggested: • MPC-inspired low-complexity hash functions: • [AGRRT16] : MIMChash • [GRRSS16], [AGP+19] : symmetric key primitives for low arithmetic circuit complexity

Slide 12

Slide 12 text

RO hashing to bitstrings • To replace SHA with its full functionality, various hash functions (or pairs of hash functions) have been recently suggested: • HADESMIMC-inspired hash functions: • [AABDS19] : Vision/Rescue • [GKK+19] : STARKAD/POSEIDON  

Slide 13

Slide 13 text

Poseidon

Slide 14

Slide 14 text

Poseidon • Combines full rounds and partial rounds to increase efficiency. • Operations are defined to occur in the field, so have low arithmetic circuit complexity. • Gives 228 constraints in MNT4/6’s case, so 0.3 constraints per bit processed.

Slide 15

Slide 15 text

S-box structure • Poseidon uses xα as its s-box, with α = 3 if x3 is a bijection (which happens only if r != 3k + 1, which leaves only   r = 2 mod 3). • It otherwise recommends 1/α, but you can also choose α as any prime with r != αk + 1 (i.e. α doesn’t divide r - 1) • We choose α = 11.

Slide 16

Slide 16 text

• Rescue uses both xα and x1/α as s-boxes. These give the same complexity in terms of constraints, but x1/α is slow to compute outside of the snark. • Poseidon also uses partial rounds, removing a lot of constraints due to invocations of the s-box itself. • A more thorough comparison coming soon! S-box structure

Slide 17

Slide 17 text

Indifferentiable hashing to elliptic curves: motivation • Veriﬁable random functions (VRFs) • Password authenticated key exchange (PAKE) • BLS signatures

Slide 18

Slide 18 text

• Proof of stake protocols can rely on the existence of a VRF, and the most eﬃcient of these uses a hash function H’ that maps a message into an elliptic curve point H’(m). • A proposer wins the right to propose a block if, for   (hx, hy) = sk H’(m), hx > target value. • Can provide a proof as before, with base H(‘m) instead of base g. Veriﬁable random functions

Slide 19

Slide 19 text

Slide 20

Slide 20 text

• Proof of stake protocols can rely on the existence of a VRF, and the most eﬃcient of these uses a hash function H’ that maps a message into an elliptic curve point H’(m). • A proposer wins the right to propose a block if, for   (hx, hy) = sk H’(m), hx > target value. • The proof looks just like Schnorr, but with base H’(m) instead of base g. Veriﬁable random functions

Slide 21

Slide 21 text

r q R = (rx, ry) = rH’(m) z = cx + r Veriﬁable random functions rx, z Compute c, compute R, verify   zH’(m) = ch + R c = H(R, H’(m), h) H’, m, h x, H’(m), h = H’(m)x

Slide 22

Slide 22 text

r q R = (rx, ry) = rH’(m) z = cx + r Veriﬁable random functions rx, z Compute c, compute R, verify   zH’(m) = ch + R c = H(R, H’(m), h) H’, m, h x, H’(m), h = H’(m)x

Slide 23

Slide 23 text

Snarks with private input • Simulation extractability • Ensures that if you can produce a valid transaction, then a valid witness can be extracted. • [BG18] transforms Groth16 into a simulation extractable SNARK. • It achieves this by including a message hashed to an elliptic curve point alongside the proof.

Slide 24

Slide 24 text

snark-friendly hashing to elliptic curves

Slide 25

Slide 25 text

Embedded curves With = E(q), (x, y) ∈ means (x, y) ∈ q × q Arithmetic inside the snark happens in r. So what if we can define a curve with (x, y) ∈ r × r? Then we would be able to do elliptic curve arithmetic inside a snark! efficiently! A curve built with coordinates and a group operation defined in r for this reason is referred to as an embedded curve.

Slide 26

Slide 26 text

Try and increment

Slide 27

Slide 27 text

Try and increment Finding whether or not f(x) is a quadratic residue takes an exponentiation. Square rooting (if f(x) is in QR), can be done via methods with very diﬀerent eﬃciency depending on your prime.

Slide 28

Slide 28 text

Try and increment Approximately half of x ∈ r give f(x) ∈ QR. k is a parameter set so that the probability of the algorithm terminating without giving an answer, assuming the QRs are well distributed throughout r, is 1/2k. But setting k to a reasonable parameter leads to huge circuits.

Slide 29

Slide 29 text

h(m)G: BLS signatures • pk = xG, s = xH(m), veriﬁcation is e(H(m), pk) = e(s, G). • With H(m) = h(m)G, we can produce s = xH(m) as h(m)pk and forge a signature for any m and pk. • Constraints = costRO + exponentiation (228 * log(r) * 2) s = xH(m)

Slide 30

Slide 30 text

Icart-style encodings • These maps look like H = f(h(u)). • The basic map of [Icart09] achieves one-wayness (if h is one-way). • The extension in [BCI+10] to f(h(m)) + gh(m) achieves indistinguishability from a random oracle, assuming that h(m) achieves this. (Think of gh(m) as a one-time pad.)

Slide 31

Slide 31 text

Slide 32

Slide 32 text

Icart-style encodings • The basic function of [Icart09] achieves one-wayness (if h is one-way). • The extension in [BCI+10] to f(h0(u)) + f(h1(u)) achieves indiﬀerentiability from a random oracle, assuming that h0(u) and h1(u) achieve this. (Think of f(h1(u)) as a one-time pad.) • Assumes p = 2 mod 3.

Slide 33

Slide 33 text

Slide 34

Slide 34 text

Slide 35

Slide 35 text

Skałba/SWU-style encodings • The map has the form, with g the curve equation,   g(x1) g(x2) g(x3) = u2  which guarantees that one of g(x1), g(x2), or g(x3) is a square, and so in QR, and so we can use it to construct a point (x, y) ∈ E(r).

Slide 36

Slide 36 text

Skałba/SWU-style encodings From [BCIMRT10]

Slide 37

Slide 37 text

• [BCI+10] extends and simpliﬁes the SWU encoding, moving it from just achieving one-wayness to also achieving indiﬀerentiability from a random oracle, but only with r = 3 mod 4. • [WB19] then removes this restriction and generalises, proposing a construction for ab != 0, and one that works even in this case. Skałba/SWU-style encodings

Slide 38

Slide 38 text

What’s special about hashing inside snarks? • Randomised functions! • We can make the computations faster if we can verify, rather than compute, the hash function inside of the snark. • Imagine, rather than performing a square root of α with Tonelli-Shanks, we instead give the square root of α, called x, as input, and the snark circuit checks x*x = α. • This takes us from O((log r)3) run time to O(1).

Slide 39

Slide 39 text

• t = hRO(m) • We have f(x1(t)), … f(x3(t)), and pick the ﬁrst square. • Checking squareness outside of snark: exponentiation. • Checking squareness inside of snark: the formula is simple, but is an exponentiation, so requires O(log r) constraints. Simplifying WB19 for snarks

Slide 40

Slide 40 text

• Precompute β ∈ NQR. (This is hardcoded.) • Then either: • If a ∈ QR, there exists x with x2 = a mod r, so submit x. • Or if a ∈ NQR, sqrt(aβ) exists — for any a ∈ NQR, β ∈ NQR, we have that aβ ∈ QR. Then submit x with x2 = aβ mod r. • Then we have:  x, b = witness,   assert xx = ba + (1 - b) aβ,   return b Proving a ∈ QR or a ∈ NQR

Slide 41

Slide 41 text

Summary • Hashing to elliptic curves inside snarks is a little different! We need deterministic time algorithms that are finite field friendly. • Functions for hashing to elliptic curves are already fairly finite field friendly! • Depending on the curve equation of the embedded curve, there are different algorithms with different efficiencies for you to use. • snarks <3 hash functions

Slide 42

Slide 42 text

Thanks! • Hashing with collision resistance/ second preimage resistance: use Pedersen hashes (see https:// github.com/zcash/zcash/issues/ 2234) • Hashing with indiﬀerentiability from a RO: use Poseidon • Hashing to an elliptic curve with indiﬀerentiability from a RO: use WB19 • snarks <3 hash functions

Slide 43

Slide 43 text

RO hashes [AGRRT16] : M. Albrecht, L. Grassi, C. Rechberger, A. Roy, and T. Tiessen. MiMC: Eﬃcient encryption and cryptographic hashing with minimal multiplicative complexity. https://eprint.iacr.org/2016/492 [GRRSS16] : L. Grassi, C. Rechberger, D. Rotaru, P. Scholl, and N. P. Smart. MPC- Friendly Symmetric Key Primitives. https://eprint.iacr.org/2016/542 [AGP+19] : M. R. Albrecht, L. Grassi, L. Perrin, S. Ramacher, C. Rechberger, D. Rotaru, A. Roy, and M. Schofnegger. Feistel Structures for MPC, and More. https:// eprint.iacr.org/2019/397    [AABDS19] : A. Aly, T. Ashur, E. Ben-Sasson, S. Dhooghe, and A. Szepieniec. Design of Symmetric-Key Primitives for Advanced Cryptographic Protocols. https:// eprint.iacr.org/2019/426 [GKK+19] : L. Grassi, D. Kales, D. Khovratovich, A.Roy, C. Rechberger, and M. Schofnegger. Starkad and Poseidon: New Hash Functions for Zero Knowledge Proof Systems. https://eprint.iacr.org/2019/458

Slide 44

Slide 44 text

RO hashes to elliptic curves [SvdW06] A. Shallue and C. van de Woestijne. Construction of rational points on elliptic curves over finite fields. [Ulas07] M. Ulas. Rational points on certain hyperelliptic curves over finite fields. https://arxiv.org/pdf/0706.1448.pdf [Icart09] T. Icart. How to Hash into Elliptic Curves. https://eprint.iacr.org/2009/226 [BCI+10] E. Brier, J. Coron, T. Icart, D. Madore, H. Randriam, and M. Tibouchi. Efficient Indifferentiable Hashing into Ordinary Elliptic Curves. https://eprint.iacr.org/ 2009/340 [BHKL13] D. J. Bernstein, M. Hamburg, A. Krasnova, and T. Lange. Elligator. https:// elligator.cr.yp.to [WB19] R. S. Wahby and D. Boneh. Fast and simple constant-time hashing to the BLS12-381 elliptic curve. https://eprint.iacr.org/2019/403