Clarify onionmessage decryption #1179

rustyrussell · 2024-07-09T12:40:33Z

I recently re-implemented this to get our code production ready, and I found the spec unclear and even misleading. This series seeks to rework it to be clearer and more coherent.

Most pointedly, it's now clear that blinding applies to the onion, not the internal encrypted blob: the spec was confused about this at various points. The onion decoding is now specified exactly, and is general. The old references to "realms" (from legacy onions) is removed.

thomash-acinq

blinding applies to the onion, not the internal encrypted blob.

That's not what's implemented in eclair and phoenix, and it contradicts the Route Blinding Requirements section.
I guess the error comes from the fact that there are two ways to use the blinding:

For the onion itself we use $B_i = HMAC256(\text{"blinded\_node\_id"}, ss_i) * N_i$
For the encrypted blob inside the onion we use $rho_i = HMAC256(\text{"rho"}, ss_i)$

thomash-acinq · 2024-07-09T13:53:56Z

04-onion-routing.md

+1. an initial introduction point (`first_node_id`)
+2. an initial tweak to modify the first node_id to decrypt the onion (`blinding`)
+3. a series of tweaked node ids (`path.blinded_node_id`)
+4. a series binary blobs encrypted to the real node ids (`path.encrypted_recipient_data`)


This is incorrect, the blob is not encrypted to the real node id, but with a key that is derived from both the blinding and the node's private key:
$ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$ (ECDH shared secret known only by $N_r$ and $N_i$)
$rho_i = HMAC256(\text{"rho"}, ss_i)$ (key used to encrypt the payload for $N_i$ by $N_r$)

I believe this is what we have in LDK as well, although I'm not too familiar with the notation being used. E.g., for ss_0, we compute ecdh(N_0, session_priv), which I think is equivalent to SHA256(e_0 * N_0) as in the above comment. Eclair is the most interop tested with us so far so pretty sure we're on the same page.

This is incorrect, the blob is not encrypted to the real node id, but with a key that is derived from both the blinding and the node's private key: ssi=SHA256(ei∗Ni)=SHA256(ki∗Ei) (ECDH shared secret known only by Nr and Ni) rhoi=HMAC256("rho",ssi) (key used to encrypt the payload for Ni by Nr)

Well, yes. I should spell that out, indeed you use that same blinding as the ECDH ephemeral key.

What you don't do, is ssi=SHA256(ei*Bi). That would be a valid construction (which I would call "encrypting to the blinded node pubkey"). i.e. you don't use blinding as blinding at all here...

i.e. the onion is encrypted to the blinded node id, the inner encrypted_recipient_data is not. This is important!

thomash-acinq · 2024-07-09T14:03:32Z

04-onion-routing.md

@@ -290,13 +295,11 @@ The reader:
  - If `encrypted_recipient_data` is present:
    - If `blinding_point` is set in the incoming `update_add_htlc`:
      - MUST return an error if `current_blinding_point` is present.
-      - MUST use that `blinding_point` as the blinding point for decryption.


Why did you remove this? blinding_point needs to be used as $E_i$ for the Route Blinding Requirements section.

Yes, but it's not used as a blinding point. I changed this and the next one:

- If `blinding_point` is set in the incoming `update_add_htlc`: - MUST return an error if `current_blinding_point` is present. - MUST use `blinding_point` as $`E_i`$ - Otherwise: - MUST return an error if `current_blinding_point` is not present. - MUST use `current_blinding_point` as $`E_i`$ - SHOULD add a random delay before returning errors. - MUST return an error if `encrypted_recipient_data` does not decrypt to a valid `encrypted_data_tlv` as described in [Route Blinding](#route-blinding).

?

thomash-acinq · 2024-07-09T14:05:06Z

04-onion-routing.md

    - Otherwise:
      - MUST return an error if `current_blinding_point` is not present.
-      - MUST use that `current_blinding_point` as the blinding point for decryption.
+      - MUST use that `current_blinding_point` as `E_i` to derive the following blinding point.


I don't understand what you mean by "following blinding point" here.

This is wrong, it's used both to derive the ss and to derive the next blinding point. Will fix...

thomash-acinq · 2024-07-09T14:16:45Z

04-onion-routing.md

+  - if `blinding` is specified:
+    - Calculate the `blinding_ss` as ECDH(`blinding`, `node-privkey`)
+    - Tweak `public_key` by multiplying by $`HMAC256(\text{"blinded\_node\_id}", blinding_ss)`$


This is confusing. We need to tweak the private key. And then we need to use the tweaked key instead of the public one.

This is the alternative construction, where we tweak the onion ephemeral key (which is equivalent). (CLN needs to do this because we have an HSM and we didn't want to teach it to tweak, but it's also easier in the text).

We could specify both, but we do note the private key method alternative in the rationale?

thomash-acinq · 2024-07-09T14:28:06Z

04-onion-routing.md

+  - $`E_0 = e_0 \cdot G`$
+  - For every node in the route:
+    - let $`N_i = k_i * G`$ be the `node_id` ($`k_i`$ is $`N_i`$'s private key)
+    - $`ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$` (ECDH shared secret known only by $`N_r`$ and $`N_i`$)


Suggested change

- $`ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$` (ECDH shared secret known only by $`N_r`$ and $`N_i`$)

- $ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$ (ECDH shared secret known only by $N_r$ and $N_i$)

Markdown rendering is broken because of a misplaced `, we should probably remove all the ` when we already have $.

Wow, I didn't even know that worked. But if we take out ` the formatting changes from monospaced to normal, which would be confusing unless we do it everywhere.

Fixed for now...

Revisiting our implementation recently, I found it difficult to follow. In particular: 1. The onion is encrypted to the blinded node id, including (slightly redundantly) the introduction point. 2. The encrypted payload is encrypetd to the *unblinded* node id. Make this clear, and give an example. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

There's currently a *description* of how to decrypt an onion, and some requirements in forwarding. But it also applies to onion messages, so: 1. Turn the description into actual enumerated requirements. 2. Ensure the description covers both payload and messaging onions. 3. Leave the actual handling of the extracted payload (payment vs messaging onion) to those specific sections (e.g. reporting failure) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

…sages and payments. Simply tie the sections together, and put a section on how to handle blinding when decrypting onions. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

We use blinding for the onion, not the `encrypted_recipient_data`. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

1. The writer creates a `blinded_path` object, so twealk requirements to refer directly to how to populate those fields. 2. This isn't just for payments, but also onion messages, so make language clearer. 3. There are two readers: the sender, who uses the blinded path to create an onion, and the other nodes who decrypt the encrypted_recipient_data. Make separate requirements for each. 4. Put the `encrypted_data_tlv` definition into its own section, even though requirements are in onion messages / payload sections. Some rationale remains. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

This was from the legacy onion, and is no longer present. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

You can't actually generate encrypted_data_tlv until you've created the blinding points for the shared secrets (needed for both tweaking the outer onion and decryting the `encrypted_recipient_data`). It makes the explanation more complex, but the previous one glossed over too much. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell · 2024-07-10T02:49:49Z

OK, so I rolled in Thomas' feedback, which lead me to think that we should simply avoid calling the point "blinding" (as that's not all it's used for). Example commit which changes it to "path_key" at the end?

I think this is clearer. WDYT?

…ng blinding We are handed an ephemeral key (E_i) to derive a shared secret, which derives both the blinding tweak for the onion, *and* the key to decrypt the encrypted_recipient_data. But we call it "blinding", which is confusing! How about we call it "path_key"? 1. update_add_htlc_tlv's "blinding_point.blinding" -> "blinded_path.path_key" 2. payload's "current_blinding_point" -> "current_path_key" 3. blinded_path's "blinding" -> "first_path_key". 4. encrypted_data_tlv's "next_blinding_override.blinding" -> "next_path_key_override.path_key" This sweep found other changes: 1. Writer of the TLV `payload`: simply refer here to Route Blinding requirements which says how to use the `blinded_path` if we have one (and gets it right on the two ways to use current_blinding_point). 2. Refer concretely to `blinded_path` (a type) rather than "blinded route" (a concept). 3. Use the term `encrypted_recipient_data` everywhere for consistency, and `encrypted_data_tlv` once it's decrypted. 4. Note explictly that you can't use the "next_path_key_override" if the prior node doesn't support route blinding. 5. Header "Blinding Ephemeral Keys" -> "Blinding Ephemeral Onion Keys" to avoid confusion with blinded paths! 6. When using `reply_path` for onion messages, simply refer to the Route Blinding requirements. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

t-bast

I stopped reviewing this half-way, when I reached the incorrect statement that Alice encrypts the onion to (Bob', Carol', Dave'). As detailed in my comment below, this is not how it works, and leads me to think that this updated version is actually less clear than the previous one.

But I do see where the confusion comes from: there is a fundamental difference between onion messages and payments, because onion messages combine two blinded paths while payments only use the recipient path. That's why I think the way it is specified currently on master makes more sense: how you decrypt the onion only depends on whether you received a path key or not (if you receive a path key, the onion is encrypted for your blinded node_id, otherwise it's encrypted to your normal node_id). Then, once you have a decrypted onion, if it contains an encrypted_data_tlv, you will use the path key you must have received somehow to derive the shared secret to decrypt it. That is a pretty simple rule to follow, so the reader requirements should be very straightforward. We may be missing some detailed writer requirements however, which is where there are different cases for payments and onion messages.

I think we can create a smaller patch to clear up those confusions than what this PR does, I'd like to try a different approach for that.

I like the renaming of blinding points to path keys though!

t-bast · 2024-07-10T13:07:32Z

02-peer-protocol.md

@@ -2040,8 +2040,10 @@ A receiving node:
  - if other `id` violations occur:
    - MAY send a `warning` and close the connection, or send an
      `error` and fail the channel.
-  - if `blinding_point` is provided:
-    - MUST use the corresponding blinded private key to decrypt the `onion_routing_packet` (see [Route Blinding](04-onion-routing.md#route-blinding))
+  - MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption) using `payment_hash` as `associated_data` (and `path_key` if specified).


nit: it's a bit confusing to how path_key is mentioned here, it feels like we should include it in the associated_data (which is not the case), what about:

Suggested change

- MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption) using `payment_hash` as `associated_data` (and `path_key` if specified).

- MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption):

- MUST use `path_key` if specified to derive the onion decryption key.

- MUST use `payment_hash` as `associated_data`.

I've done that change in t-bast@c073ef8 to simplify updates.

t-bast · 2024-07-10T13:08:36Z

02-peer-protocol.md

+  - MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption) using `payment_hash` as `associated_data` (and `path_key` if specified).
+  - If that fails, or the payload is not a valid `payload` TLV:
+    - MUST report the failure to the origin node as described in [Returning Errors](04-onion-routing.md#returning-errors)
+  - MUST follow the requirements for processing the payload under [Failure Messages](04-onion-routing.md#failure-messages)


nit: indenting is wrong:

Suggested change

- MUST follow the requirements for processing the payload under [Failure Messages](04-onion-routing.md#failure-messages)

- MUST follow the requirements for processing the payload under [Failure Messages](04-onion-routing.md#failure-messages)

t-bast · 2024-07-10T14:13:07Z

04-onion-routing.md

+Alice encrypts an onion to Bob', Carol', Dave' and gives it to Bob
+with the `first_path_key`.


When used for payments, Alice doesn't encrypt the onion to Bob', she encrypts it to Bob and includes the first_path_key in the onion. That's what allows nodes between Alice and Bob to be unaware that a blinded path is used downstream. If the onion was encrypted to Bob', Bob would have no way of obtaining the path_key to decrypt it.

Alice uses Bob' only when she combines a blinded path from herself to Bob to the blinded path from Bob to Dave (which is what is done for onion messages).

In both cases, Bob only uses that path key to decrypt the encrypted_data_tlv: it's only nodes that are not the introduction node that use the path key to decrypt the onion.

I clarified that in t-bast@c073ef8

I agree on the details, just not the framing.

I think of the broad case which I'm describing here as "encrypt to the entire blinded chain", with a special exception for the payment case.

This is spelled out clearly in the new requirements (which I just noticed did not replace blinding so uses the old terminology):

The reader of the `blinded_path`: - MUST create its own onion to reach the `first_node_id` - For the first entry in `path`: - if it is sending a payment: - MAY encrypt the first blinded path onion to `first_node_id` and include `blinding` as `current_blinding_point` - if it does not do that: - MUST encrypt the first blinded path onion to the first `blinded_node_id`. - MUST set `next_blinding_override` in the prior onion payload to `blinding`. - MUST include the first `path` `encrypted_recipient_data` in each onion payload within the blinded path.

And in the rationale:

Note that there are two ways for the sender to reach the introduction point: one is to create a normal (unblinded) payment, and place the initial blinding point in `current_path_key` along with the `encrypted_recipient_data` in the onion payload for the introduction point to start the blinded path. The second way (which is the only way for onion messages) is to create a blinded path to the introduction point, set `next_path_key_override` inside the `encrypted_data_tlv` on the hop prior to the introduction point to the `first_path_key`, so it is sent to the introduction node. However, this only works if that prior node supports blinded paths.

t-bast · 2024-07-10T15:01:43Z

I think that one of the things that makes this PR hard to review is that 5983830 and 570f6fa are unrelated to route blinding and could be extracted to its own PR. It's easier to review on its own and we should reach agreement quickly on it and merge it as a first step.

rustyrussell · 2024-07-10T23:31:00Z

I think that one of the things that makes this PR hard to review is that 5983830 and 570f6fa are unrelated to route blinding and could be extracted to its own PR. It's easier to review on its own and we should reach agreement quickly on it and merge it as a first step.

Agreed. It's only when I started editing I found these.

I will open a new PR for those, then rebase on that.

thomash-acinq reviewed Jul 9, 2024

View reviewed changes

rustyrussell added 7 commits July 10, 2024 10:20

BOLT 2, 4: refer to Onion Decryption section for handling blinded mes…

86fc133

…sages and payments. Simply tie the sections together, and put a section on how to handle blinding when decrypting onions. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

BOLT 4: don't refer to using blinding points for decryption.

76579da

We use blinding for the onion, not the `encrypted_recipient_data`. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

BOLT 4: remove obsolete references to realm

570f6fa

This was from the legacy onion, and is no longer present. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell force-pushed the guilt/clarify-onionmessage-decryption branch from d3e24c6 to 465e470 Compare July 10, 2024 00:52

More fixes from @thomas-acinq.

8722986

rustyrussell force-pushed the guilt/clarify-onionmessage-decryption branch from 78fe75f to 2646637 Compare July 10, 2024 02:50

t-bast reviewed Jul 10, 2024

View reviewed changes

This was referenced Jul 11, 2024

Clarify onion spec: part 1 (the uncontroversial bits) #1181

Merged

Clarify onions part 2: a bit deeper rework #1182

Merged

rustyrussell closed this Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify onionmessage decryption #1179

Clarify onionmessage decryption #1179

rustyrussell commented Jul 9, 2024

thomash-acinq left a comment

thomash-acinq Jul 9, 2024

valentinewallace Jul 9, 2024

rustyrussell Jul 9, 2024

thomash-acinq Jul 9, 2024

rustyrussell Jul 10, 2024

thomash-acinq Jul 9, 2024 •

edited

Loading

rustyrussell Jul 10, 2024

thomash-acinq Jul 9, 2024

rustyrussell Jul 10, 2024

thomash-acinq Jul 9, 2024

rustyrussell Jul 10, 2024

rustyrussell commented Jul 10, 2024

t-bast left a comment •

edited

Loading

t-bast Jul 10, 2024

t-bast Jul 10, 2024

t-bast Jul 10, 2024

rustyrussell Jul 10, 2024

t-bast commented Jul 10, 2024 •

edited

Loading

rustyrussell commented Jul 10, 2024

	- $`ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$` (ECDH shared secret known only by $`N_r`$ and $`N_i`$)
	- $ss_i = SHA256(e_i * N_i) = SHA256(k_i * E_i)$ (ECDH shared secret known only by $N_r$ and $N_i$)

-  - MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption) using `payment_hash` as `associated_data` (and `path_key` if specified).
+  - MUST decrypt `onion_routing_packet` as specified in [Onion Decryption](04-onion-routing.md#onion-decryption):
+    - MUST use `path_key` if specified to derive the onion decryption key.
+    - MUST use `payment_hash` as `associated_data`.

	- MUST follow the requirements for processing the payload under [Failure Messages](04-onion-routing.md#failure-messages)
	- MUST follow the requirements for processing the payload under [Failure Messages](04-onion-routing.md#failure-messages)

		Alice encrypts an onion to Bob', Carol', Dave' and gives it to Bob
		with the `first_path_key`.

Clarify onionmessage decryption #1179

Clarify onionmessage decryption #1179

Conversation

rustyrussell commented Jul 9, 2024

thomash-acinq left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomash-acinq Jul 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rustyrussell commented Jul 10, 2024

t-bast left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

t-bast commented Jul 10, 2024 • edited Loading

rustyrussell commented Jul 10, 2024

thomash-acinq Jul 9, 2024 •

edited

Loading

t-bast left a comment •

edited

Loading

t-bast commented Jul 10, 2024 •

edited

Loading