Table of Contents
- Overview
- Let’s get Started
- Basics of Schnorr Signatures
- Schnorr Signatures
- MuSig
- References
- Contributors
Overview
Private-public key pairs are the cornerstone of much of the cryptographic security underlying everything from secure web browsing to banking to cryptocurrencies. Private-public key pairs are asymmetric. This means that given one of the numbers (the private key), it’s possible to derive the other one (the public key). However, doing the reverse is not feasible. It’s this asymmetry that allows one to share the public key, uh, publicly and be confident that no one can figure out our private key (which we keep very secret and secure).
Asymmetric key pairs are employed in two main applications:
- in authentication, where you prove that you have knowledge of the private key; and
- in encryption, where messages can be encoded and only the person possessing the private key can decrypt and read the message.
In this introduction to digital signatures, we’ll be talking about a particular class of keys: those derived from elliptic curves. There are other asymmetric schemes, not least of which are those based on products of prime numbers, including RSA keys [1].
We’re going to assume you know the basics of elliptic curve cryptography (ECC). If not, don’t stress, there’s a gentle introduction in a previous chapter.
Let’s get Started
This is an interactive introduction to digital signatures. It uses Rust code to demonstrate some of the ideas presented here, so you can see them at work. The code for this introduction uses the tari_crypto library.
Tari makes use of the Ristretto elliptic curve in its cryptography ([Why Risretto?]).
This particular implementation has some nice features. We’ve overridden the +
(addition) and *
(multiplication)
operators so that the Rust code looks a lot more like mathematical formulae. This makes it much easier
to play with the ideas we’ll be exploring.
Basics of Schnorr Signatures
Public and Private Keys
The first thing we’ll do is create a public and private key from an elliptic curve.
On Ristretto, a private key is simply a scalar integer value between 0 and ~2256. That’s roughly how many atoms there are in the universe, so we have a big sandbox to play in.
We have a special point on the Ristretto curve called G, which acts as the “origin”. A public key is calculated by adding G on the curve to itself, \( k_a \) times. This is the definition of multiplication by a scalar, and is written as:
\[P_a = k_a G\]Let’s take an example from this post, where
it is known that the public key for 1
, when written in uncompressed format, is 6a493210f74...73a3b919
.
The following code snippet demonstrates this:
Creating a Signature
Approach Taken
Reversing ECC math multiplication (i.e. division) is pretty much infeasible when using properly chosen random values for your scalars ([5],[6]). This property is called the Discrete Log Problem, and is used as the principle behind many cryptography and digital signatures. A valid digital signature is evidence that the person providing the signature knows the private key corresponding to the public key with which the message is associated, or that they have solved the Discrete Log Problem.
The approach to creating signatures always follows this recipe:
- Generate a secret once-off number (called a nonce), $r$.
- Calculate the public version of the nonce, $R$ from $r$ (where $R = r.G$).
- Send the following to Bob, your recipient - your message ($m$), $R$, and your public key ($P = k.G$).
The actual signature is created by hashing the combination of all the public information above to create a challenge, $e$:
\[e = H(R \| P \| m)\]The hashing function is chosen so that e has the same range as your private keys. In our case, we want something that returns a 256-bit number, so SHA256 is a good choice.
Now the signature is constructed using your private information:
\[s = r + ke\]Bob can now also calculate $e$, since he already knows $m, R, P$. But he doesn’t know your private key, or nonce.
Note: When you construct the signature like this, it’s known as a Schnorr signature, which is discussed in a following section. There are other ways of constructing $s$, such as ECDSA [2], which is used in Bitcoin.
But see this:
\[sG = (r + ke)G\]Multiply out the right-hand side:
\[sG = rG + (kG)e \]Substitute \(R = rG \) and \(P = kG \) and we have: \(sG = R + Pe \)
So Bob must just calculate the public key corresponding to the signature $\text{(}s.G\text{)}$ and check that it equals the right-hand side of the last equation above $\text{(}R + P.e\text{)}$, all of which Bob already knows.
Why do we Need the Nonce?
Why do we need a nonce in the standard signature?
Let’s say we naïvely sign a message $m$ with
\[e = H(P \| m)\]and then the signature would be \(s = ek \).
Now as before, we can check that the signature is valid:
\[\begin{align} sG &= ekG \\\\ &= e(kG) = eP \end{align}\]So far so good. But anyone can read your private key now because $s$ is a scalar, so \(k = {s}/{e} \) is not hard to do. With the nonce you have to solve \( k = (s - r)/e \), but $r$ is unknown, so this is not a feasible calculation as long as $r$ has been chosen randomly.
We can show that leaving off the nonce is indeed highly insecure:
ECDH
How do parties that want to communicate securely generate a shared secret for encrypting messages? One way is called the Elliptic Curve Diffie-Hellman exchange (ECDH), which is a simple method for doing just this.
ECDH is used in many places, including the Lightning Network during channel negotiation [3].
Here’s how it works. Alice and Bob want to communicate securely. A simple way to do this is to use each other’s public keys and calculate
\[\begin{align} S_a &= k_a P_b \tag{Alice} \\\\ S_b &= k_b P_a \tag{Bob} \\\\ \implies S_a = k_a k_b G &\equiv S_b = k_b k_a G \end{align}\]For security reasons, the private keys are usually chosen at random for each session (you’ll see the term ephemeral keys being used), but then we have the problem of not being sure the other party is who they say they are (perhaps due to a man-in-the-middle attack [4]).
Various additional authentication steps can be employed to resolve this problem, which we won’t get into here.
Schnorr Signatures
If you follow the crypto news, you’ll know that that the new hotness (at the time of writing, in 2018) in Bitcoin is Schnorr Signatures.
But in fact, they’re old news! The Schnorr signature is considered the simplest digital signature scheme to be provably secure in a random oracle model. It is efficient and generates short signatures. It was covered by U.S. Patent 4,995,082, which expired in February 2008 [7].
So why all the Fuss?
What makes Schnorr signatures so interesting and potentially dangerous, is their simplicity. Schnorr signatures are linear, so you have some nice properties.
Elliptic curves have the multiplicative property. So if you have two scalars $x, y$ with corresponding points $X, Y$, the following holds:
\[(x + y)G = xG + yG = X + Y\]Schnorr signatures are of the form \( s = r + e.k \). This construction is linear too, so it fits nicely with the linearity of elliptic curve math.
You saw this property in a previous section, when we were verifying the signature. Schnorr signatures’ linearity makes it very attractive for, among others:
- signature aggregation;
- atomic swaps;
- “scriptless” scripts.
Naïve Signature Aggregation
Let’s see how the linearity property of Schnorr signatures can be used to construct a two-of-two multi-signature.
Alice and Bob want to cosign something (a Tari transaction, say) without having to trust each other; i.e. they need to be able to prove ownership of their respective keys, and the aggregate signature is only valid if both Alice and Bob provide their part of the signature.
Assuming private keys are denoted \( k_i \) and public keys \( P_i \). If we ask Alice and Bob to each supply a nonce, we can try:
\[\begin{align} P_{agg} &= P_a + P_b \\\\ e &= H(R_a \| R_b \| P_a \| P_b \| m) \\\\ s_{agg} &= r_a + r_b + (k_a + k_b)e \\\\ &= (r_a + k_ae) + (r_b + k_ae) \\\\ &= s_a + s_b \end{align}\]So it looks like Alice and Bob can supply their own $R$, and anyone can construct the two-of-two signature from the sum of the $Rs$ and public keys. This does work:
But this scheme is not secure!
Key Cancellation Attack
Let’s take the previous scenario again, but this time, Bob knows Alice’s public key and nonce ahead of time, by waiting until she reveals them.
Now Bob lies and says that his public key is \( P_b’ = P_b - P_a \) and public nonce is \( R_b’ = R_b - R_a \).
Note that Bob doesn’t know the private keys for these faked values, but that doesn’t matter.
Everyone assumes that \(s_{agg} = R_a + R_b’ + e(P_a + P_b’) \) as per the aggregation scheme.
But Bob can create this signature himself:
\[\begin{align} s_{agg}G &= R_a + R_b' + e(P_a + P_b') \\\\ &= R_a + (R_b - R_a) + e(P_a + P_b - P_a) \\\\ &= R_b + eP_b \\\\ &= r_bG + ek_bG \\\\ \therefore s_{agg} &= r_b + ek_b = s_b \end{align}\]Better Approaches to Aggregation
In the Key Cancellation Attack, Bob didn’t know the private keys for his published $R$ and $P$ values. We could defeat Bob by asking him to sign a message proving that he does know the private keys.
This works, but it requires another round of messaging between parties, which is not conducive to a great user experience.
A better approach would be one that incorporates one or more of the following features:
- It must be provably secure in the plain public-key model, without having to prove knowledge of secret keys, as we might have asked Bob to do in the naïve approach.
- It should satisfy the normal Schnorr equation, i.e. the resulting signature can be verified with an expression of the form \( R + e X \).
- It allows for Interactive Aggregate Signatures (IAS), where the signers are required to cooperate.
- It allows for Non-interactive Aggregate Signatures (NAS), where the aggregation can be done by anyone.
- It allows each signer to sign the same message, $m$.
- It allows each signer to sign their own message, \( m_i \).
MuSig
MuSig is a recently proposed ([8],[9]) simple signature aggregation scheme that satisfies all of the properties in the preceding section.
MuSig Demonstration
We’ll demonstrate the interactive MuSig scheme here, where each signatory signs the same message. The scheme works as follows:
- Each signer has a public-private key pair, as before.
- Each signer shares a commitment to their public nonce (we’ll skip this step in this demonstration). This step is necessary to prevent certain kinds of rogue key attacks [10].
- Each signer publishes the public key of their nonce, \( R_i \).
- Everyone calculates the same “shared public key”, $X$ as follows:
Note that in the preceding ordering of public keys, some deterministic convention should be used, such as the lexicographical order of the serialized keys.
- Everyone also calculates the shared nonce, \( R = \sum R_i \).
- The challenge, $e$ is $ H(R || X || m) $.
- Each signer provides their contribution to the signature as:
Notice that the only departure here from a standard Schnorr signature is the inclusion of the factor \( a_i \).
The aggregate signature is the usual summation, \( s = \sum s_i \).
Verification is done by confirming that as usual:
\[sG \equiv R + e X \\\]Proof:
\[\begin{align} sG &= \sum s_i G \\\\ &= \sum (r_i + k_i a_i e)G \\\\ &= \sum r_iG + k_iG a_i e \\\\ &= \sum R_i + X_i a_i e \\\\ &= \sum R_i + e \sum a_i X_i \\\\ &= R + e X \\\\ \blacksquare \end{align}\]Let’s demonstrate this using a three-of-three multisig:
Security Demonstration
As a final demonstration, let’s show how MuSig defeats the cancellation attack from the naïve signature scheme. Using the same idea as in the Key Cancellation Attack section, Bob has provided fake values for his nonce and public keys:
\[\begin{align} R_f &= R_b - R_a \\\\ X_f &= X_b - X_a \\\\ \end{align}\]This leads to both Alice and Bob calculating the following “shared” values:
\[\begin{align} \ell &= H(X_a \| X_f) \\\\ a_a &= H(\ell \| X_a) \\\\ a_f &= H(\ell \| X_f) \\\\ X &= a_a X_a + a_f X_f \\\\ R &= R_a + R_f (= R_b) \\\\ e &= H(R \| X \| m) \end{align}\]Bob then tries to construct a unilateral signature following MuSig:
\[s_b = r_b + k_s e\]Let’s assume for now that \( k_s \) doesn’t need to be Bob’s private key, but that he can derive it using information he knows. For this to be a valid signature, it must verify to \( R + eX \). So therefore:
\[\begin{align} s_b G &= R + eX \\\\ (r_b + k_s e)G &= R_b + e(a_a X_a + a_f X_f) & \text{The first term looks good so far}\\\\ &= R_b + e(a_a X_a + a_f X_b - a_f X_a) \\\\ &= (r_b + e a_a k_a + e a_f k_b - e a_f k_a)G & \text{The r terms cancel as before} \\\\ k_s e &= e a_a k_a + e a_f k_b - e a_f k_a & \text{But nothing else is going away}\\\\ k_s &= a_a k_a + a_f k_b - a_f k_a \\\\ \end{align}\]In the previous attack, Bob had all the information he needed on the right-hand side of the analogous calculation. In MuSig, Bob must somehow know Alice’s private key and the faked private key (the terms don’t cancel anymore) in order to create a unilateral signature, and so his cancellation attack is defeated.
Replay Attacks!
It’s critical that a new nonce be chosen for every signing ceremony. The best way to do this is to make use of a cryptographically secure (pseudo-)random number generator (CSPRNG).
But even if this is the case, let’s say an attacker can trick us into signing a new message by “rewinding” the signing ceremony to the point where partial signatures are generated. At this point, the attacker provides a different message, \( e’ = H(…||m’) \) to sign. Not suspecting any foul play, each party calculates their partial signature:
\(s'_i = r_i + a_i k_i e'\) However, the attacker still has access to the first set of signatures: \( s_i = r_i + a_i k_i e \). He now simply subtracts them:
\[\begin{align} s'_i - s_i &= (r_i + a_i k_i e') - (r_i + a_i k_i e) \\\\ &= a_i k_i (e' - e) \\\\ \therefore k_i &= \frac{s'_i - s_i}{a_i(e' - e)} \end{align}\]Everything on the right-hand side of the final equation is known by the attacker and thus he can trivially extract everybody’s private key. It’s difficult to protect against this kind of attack. One way to is make it difficult (or impossible) to stop and restart signing ceremonies. If a multi-sig ceremony gets interrupted, then you need to start from step one again. This is fairly unergonomic, but until a more robust solution comes along, it may be the best we have!
References
[1] Wikipedia: “RSA (Cryptosystem)” [online]. Available: https://en.wikipedia.org/wiki/RSA_(cryptosystem). Date accessed: 2018‑10‑11.
[2] Wikipedia: “Elliptic Curve Digital Signature Algorithm” [online]. Available: https://en.wikipedia.org/wiki/Elliptic_Curve_Digital_Signature_Algorithm. Date accessed: 2018‑10‑11.
[3] Github: “BOLT #8: Encrypted and Authenticated Transport, Lightning RFC” [online]. Available: https://github.com/lightningnetwork/lightning-rfc/blob/master/08-transport.md. Date accessed: 2018‑10‑11.
[4] Wikipedia: “Man in the Middle Attack” [online]. Available: https://en.wikipedia.org/wiki/Man-in-the-middle_attack. Date accessed: 2018‑10‑11.
[5] StackOverflow: “How does a Cryptographically Secure Random Number Generator Work?” [online]. Available: https://stackoverflow.com/questions/2449594/how-does-a-cryptographically-secure-random-number-generator-work. Date accessed: 2018‑10‑11.
[6] Wikipedia: “Cryptographically Secure Pseudorandom Number Generator” [online]. Available: https://en.wikipedia.org/wiki/Cryptographically_secure_pseudorandom_number_generator. Date accessed: 2018‑10‑11.
[7] Wikipedia: “Schnorr Signature” [online]. Available: https://en.wikipedia.org/wiki/Schnorr_signature. Date accessed: 2018‑09‑19.
[8] Blockstream: “Key Aggregation for Schnorr Signatures” [online]. Available: https://blockstream.com/2018/01/23/musig-key-aggregation-schnorr-signatures.html. Date accessed: 2018‑09‑19.
[9] G. Maxwell, A. Poelstra, Y. Seurin and P. Wuille, “Simple Schnorr Multi-signatures with Applications to Bitcoin” [online]. Available: https://eprint.iacr.org/2018/068.pdf. Date accessed: 2018‑09‑19.
[10] M. Drijvers, K. Edalatnejad, B. Ford, E. Kiltz, J. Loss, G. Neven and I. Stepanovs, “On the Security of Two-round Multi-signatures”, Cryptology ePrint Archive, Report 2018/417 [online]. Available: https://eprint.iacr.org/2018/417.pdf. Date accessed: 2019‑02‑21.