Introduce 'webrtc-src'. #287

mikewest · 2018-01-17T12:39:14Z

The 'webrtc-src' directive is a proposal for handling WebRTC connections. This patch
isn't exactly finished, but it should at least give a concrete proposal that we can
discuss in #92.

Preview | Diff

The 'webrtc-src' directive is a proposal for handling WebRTC connections. This patch isn't exactly finished, but it should at least give a concrete proposal that we can discuss in #92.

mikewest · 2018-01-17T12:41:02Z

@annevk, @alvestrand, @ckerschb, @martinthomson, @murillo128, @andypaicu, WDYT?

alvestrand · 2018-01-17T13:16:35Z

index.src.html

+  <pre>
+    directive-name  = "webrtc-src"
+    directive-value = <a grammar>serialized-source-list</a>
+  </pre>


We need some more language here explaining what "servers or peers to which connections may be established" means.

Suggested text off the top of my head:

The following situations are to be checked according to this directive:

A host URL occurs in the list of RTCIceServers of an RTCConfiguration when a PeerConnection is created. In this case, the PeerConnection creation will fail.

A host URL occurs in the list of RTCIceServers of an RTCConfiguration when a PeerConnection's setConfiguration method is called. In this case, setting the configuration will fail.

An address occurs in the ip, protocol and port fields of an RTCIceCandidate created from SetRemoteDescription or AddIceCandidate. In this case, the call will be rejected.

And perhaps a note of caution, something like: "Due to the problem of listing all possible communication partners for a WebRTC application, the "*" value is likely to be the most useful value to set as the value of the "webrtc-src" directive".

note - in candidates, addresses are usually expressed as address literals (1.2.3.4 or 202b:47:9f::334) - matching a hostname against an address is a complex task, and we probably shouldn't require that it happens.

note - in candidates, addresses are usually expressed as address literals (1.2.3.4 or 202b:47:9f::334) - matching a hostname against an address is a complex task, and we probably shouldn't require that it happens.

Hrm. The matching algorithm in https://w3c.github.io/webappsec-csp/#match-hosts actually specifically excludes IP addresses. We might need to rethink that if it's the way these things are typically specified in WebRTC.

Should the SetRemoteDescription method fail if there are any non-whitelisted peers or filter them out and allow only the valid ones? I am thinking on a situation where you allow a TURN server ip address and want all the data go via it, but on the sdp you will have all the non-turn candidates.

Should the SetRemoteDescription method fail if there are any non-whitelisted peers or filter them out and allow only the valid ones? I am thinking on a situation where you allow a TURN server ip address and want all the data go via it, but on the sdp you will have all the non-turn candidates.

Ideally, the WebRTC spec would answer those questions. I'd like CSP to give that feature hooks that express a page's policy decisions, but I just don't know WebRTC's API surface or developer expectations to make helpful suggestions about when you'd want to reject promises, or what kinds of points you'd want to guard with these hooks.

I would say that it is ok to require FQDN candidates when using something different than webrtc-src:*

I would prefer that this only define * in the first patch.

It is very common for WebRTC hosts to be specified via address literals to the point that this is the dominant method. Even for STUN and TURN. For this to fail on that make it considerably less useful.

Also, hosts are typically not authenticated by their name. That makes names a little less valuable.

If the intent is to prevent exfiltration, we can't allow application control over connections at all. CSP intentionally avoids address literals so that DNS queries don't leak information, but here we need a new strategy. It's possible that we want to govern STUN and TURN differently from candidates. Using serialized-source-list seems fine for that purpose.

But using serialized-source-list might prevent us from developing a sensible strategy for handling candidates in a sensible fashion.

Can we reserve values other than * and leave this unspecified until we sort this out in more detail?

To the question of what setRemoteDescription does, I think that there is a simple answer: nothing. The same thing we do if we get given SDP with bogus candidates now, or SDP with no candidates. Instead, we would have all connectivity checks toward that destination fail without even being attempted. Browsers might avoid putting those candidates in check lists to avoid gaps, but that would be up to them (the important thing is to not send the packet).

If WebRTC is disabled entirely (i.e., webrtc-src: none or similar on the fallback path), then we might do something to indicate that sooner. That would be an amenity feature though, the security check should be when STUN messages go out.

michaelficarra · 2018-01-17T16:57:03Z

How can we possibly get away with having webrtc-src (or basically any new directives) extend from default-src and connect-src? Websites which currently define a default-src will unintentionally adopt WebRTC restrictions. I get that we could do it for preload/prefetch or anything that only affects performance, but this will undoubtedly reduce functionality for existing websites.

martinthomson · 2018-01-18T06:46:20Z

index.src.html

@@ -1370,7 +1374,7 @@ <h4 id="should-block-navigation-response" algorithm>
    in |target| be blocked by Content Security Policy?
  </h4>

-  Given a <a for="/">request</a> (|navigation request|),, a string (|type|, either
+  Given a <a for="/">request</a> (|navigation request|), a string (|type|, either


unrelated change?

martinthomson · 2018-01-18T06:50:40Z

index.src.html

+    Given a page with the following Content Security Policy:
+
+    <pre>
+      Content-Security-Policy: <a>webrtc-src</a> example.com


Missing colon.

martinthomson · 2018-01-18T06:58:25Z

index.src.html

+  <pre>
+    directive-name  = "webrtc-src"
+    directive-value = <a grammar>serialized-source-list</a>
+  </pre>


I would prefer that this only define * in the first patch.

It is very common for WebRTC hosts to be specified via address literals to the point that this is the dominant method. Even for STUN and TURN. For this to fail on that make it considerably less useful.

Also, hosts are typically not authenticated by their name. That makes names a little less valuable.

If the intent is to prevent exfiltration, we can't allow application control over connections at all. CSP intentionally avoids address literals so that DNS queries don't leak information, but here we need a new strategy. It's possible that we want to govern STUN and TURN differently from candidates. Using serialized-source-list seems fine for that purpose.

But using serialized-source-list might prevent us from developing a sensible strategy for handling candidates in a sensible fashion.

Can we reserve values other than * and leave this unspecified until we sort this out in more detail?

To the question of what setRemoteDescription does, I think that there is a simple answer: nothing. The same thing we do if we get given SDP with bogus candidates now, or SDP with no candidates. Instead, we would have all connectivity checks toward that destination fail without even being attempted. Browsers might avoid putting those candidates in check lists to avoid gaps, but that would be up to them (the important thing is to not send the packet).

If WebRTC is disabled entirely (i.e., webrtc-src: none or similar on the fallback path), then we might do something to indicate that sooner. That would be an amenity feature though, the security check should be when STUN messages go out.

alvestrand · 2018-01-18T11:47:03Z

With regard to IP matching: In peer-to-peer WebRTC scenarios, candidates are discovered by inspecting the client's local interfaces. There is no reasonable mechanism for finding a FQDN that matches the IP address. (Reverse DNS lookup is not reasonable for this case.)
What I was thinking when writing this was that when evaluating the CSP directive, the browser would resolve the hostnames in the CSP directive to addresses, and if those addresses were an exact match for the address provided, they could let them through.
This would allow people with (for instance) a small set of whitelisted media gateways or TURN servers to define a DNS name that resolved to all of those hosts' IP address, and use that in their CSP directive.
(Putting '*.something.something' in the CSP directive doesn't work either, obviously.)
Using IP literals in the CSP directive doesn't seem to be useful.

For TURN servers, hostnames could be made to work.

murillo128 · 2018-01-18T12:52:41Z

Candidates gathered by the browser won't work (either local, reflexive or server reflexive), but it is valid for an SDP generated by a gateway to add their FQDN instead of their ip address on the candidate.
and even the *.somthing will work on that case.

It won't cover the case in which you want to whitelist TURN servers tough, which, as you say, would require to resolve each host entry before doing the match.

martinthomson · 2018-01-18T22:46:54Z

@alvestrand, doing a DNS lookup would allow the site to exfiltrate data. We can't allow the network to be triggered using site-provided inputs. @murillo128 is right that there are some cases where an IP address could work and it's not the case that we have the same constraints as other CSP directives.

murillo128 · 2018-01-19T08:49:54Z

@martinthomson not sure if we are talking about the same. Let me explain myself with an example:

Imagine that you have a conference-only scenario (i.e. no P2P) when all your media is going via your sfu. It could make sense to enable CSP to prevent any data going in/out from your service.

You could add default-src: *.example.com as a CSP policy and get all your https/wss connection going to www.example.com. You could use turn.example.com normally to create the peerconnection and the SDP provided by the sfu could include a candidate like:

a=candidate:1 1 UDP 2130706431 sfu.example.com 8998 typ host

What this scenario will not allow would be P2P calls with media going via the TURN server, as they remote candidates would have the turn server IP address and not the FQDN.

alvestrand · 2018-01-29T13:42:16Z

@martinthomson I think we agree. The mechanism I was thinking of is too fragile and too complex.
The use case I was thinking of was:

example.com is specified in the CSP list.
browser resolves example.com to 1.2.3.4 (this isn't a security risk, we already trust the author of the CSP header containing example.com)
Javascript wants to connect to 1.2.3.4
browser says "OK".

but when A records can change between the loading of the page and the checking of a candidate, this is just Not Worth It - let's KISS.

lgrahl · 2018-02-17T14:58:30Z

I need to ask: Does it actually make sense to provide anything more than an on/off switch for WebRTC? We not only have the problem that WebRTC leaks data to STUN/TURN servers. If WebRTC is turned on, it can leak data to arbitrary hosts via well-crafted ICE candidates even without any servers. So, if I use webrtc-src: turn.example.com, I can still send arbitrary data to arbitrary hosts by crafting my own ICE candidates. It doesn't limit me to turn.example.com at all.

In case your answer would be Yes, I still think this is useful to the question above, then FWIW I also agree with @michaelficarra that inheriting from connect-src might provide a false sense of security since this has no meaning for ICE candidates (only for STUN/TURN servers). So, if I have set default-src: example.com, I'm not more safe than I was before because I can still send data to arbitrary hosts as pointed out above. This is something that I think will surprise a lot of people.

patrickkettner · 2018-03-05T05:29:50Z

after speaking with a few teams that use webrtc within Microsoft, the consensus is this should just be a toggle for webrtc. You could trivially use a proxy to get around any kind of host restriction, not to mention the issues brought up on the most recent call that the current resolution doesn't currently cover the more commonly used (for webrtc) IP addresses.

alvestrand · 2018-03-08T14:49:31Z

If we inherit from connect-src: and say that we only check for the single value '*', would that give the right level of granularity?

What I want to have as properties is:

Pages that don't protect against connecting to random websites don't block webrtc either
Pages that protect against connecting to random websites won't have webrtc as a bypass
Pages that protect against connecting to random websites can still enable webrtc

Separating the first 2 cases seems to indicate that we should be affected by connect-src; separating the last 2 cases seems to indicate that we need a webrtc-src: directive of some sort.

I think just having webrtc-src: * or webrtc-allowed: yes is enough complexity for the benefit it gives, it's the interaction with connect-src / default-src that needs to be clear.

benjamingr · 2018-03-14T09:47:38Z

I'm not a fan of a toggle for WebRTC in a page - namely it would be very problematic for people who are:

Users of WebRTC enabled services they want to explicitly allow.
Want to explicitly block other WebRTC connections from sources they don't trust.

Basically, what would be beneficial for these stakeholders is the ability to indicate in the CSP which scripts are allowed to initiate WebRTC requests or not. Simply blacklisting WebRTC will make it unusable for them and they will benefit greatly from finer grained filtering.

There is also adblockers who are a potential stakeholder of this feature and probably want/need fine grained control over what WebRTC to block.

For example:

Site A is a company internal communication platform of a large enterprise.
Site A uses WebRTC to deliver information in peer-to-peer between users.
It is important due to the sensitivity of information in Site A that third party scripts included can't leak information by making web requests (including RTCPeerConnections).

Alternatively:

Site B delivers video or large files to paying subscribers in p2p.
Site B uses WebRTC to deliver information in peer-to-peer between users.
It is important due to the sensitivity of information in Site B that third party scripts included can't leak information by making web requests (including RTCPeerConnections) - for example by advertisers.

Alternatively:

Site C is a web app that lets the user share files (WebTorrent for example)
Site C wants to display ads, but wants certain guarantees about what those ads can and can't do
Site C wants to blacklist WebRTC from those third party scripts, but not from the WebTorrent ones.

I also really want WebRTC blocking in the CSP to be a thing, it's something that could help large websites that utilize WebRTC and certainly those who don't who are fearful about third party scripts.

I'd love to help in the spec process (I have a little experience with whatwg, but I haven't been involved in a w3c spec yet - at least not in the last few years).

How can I get involved? @mikewest

annevk · 2018-03-14T09:53:48Z

@benjamingr the main problem is that we cannot really distinguish between the different scripts. They all operate in the same global scope. Trying to assign blame to individual scripts gets complicated very quickly and there's basically no infrastructure for that (and no interest in solving it as far as I know).

benjamingr · 2018-03-14T10:06:09Z

@benjamingr the main problem is that we cannot really distinguish between the different scripts. They all operate in the same global scope.

Thanks for clarifying, I realize this might be a bigger issue to solve - given WebRTC can always craft ICE candidates to bypass any non-global restriction and a global restriction pretty much ignores concerns of sites using WebRTC.

If my site is not using WebRTC this might be useful for me, but if my site is using WebRTC then a blanket "block all" does not help improve the security guarantees of my site at all.

As more sites will utilize P2P delivery in the future (in one way or another, the trend we're seeing) to benefit users in geographies with bad networking conditions (and users in general) this effectively means we're giving them a choice between security and performance (which is never a great thing to do).

@lgrahl do you think there is a small(ish) set of changes that can be done to the RTCPeerConnection spec that might help us put a more reasonable limitation?

lgrahl · 2018-03-28T01:04:35Z

@benjamingr I'm not saying it can't be done but everything my mind was able to come up with so far was overly complicated and I discarded it immediately. Token solutions for example are hard to get right and pretty much prevent any caching.

alvestrand · 2018-04-03T08:07:58Z

@benjamingr I think the way you establish trust boundaries between scripts is to isolate the scripts you don't trust (the third party ones) inside an iframe. At the iframe boundary you can add arguments that stript the iframe of various capabilities, such as the ability to use webrtc.
I don't think we have a way to separate permissions for scripts under a single origin in a single page.

alvestrand · 2018-05-03T09:29:41Z

My conclusion from the discussion is that dealing with hostnames is too much cost and complexity for too little gain in actual benefit, and that the pure yes/no semantics is needed and needs to get published.
I'll prepare a revised PR in line with that and ask @mikewest to review.

benjamingr · 2018-05-03T09:35:32Z

In full honesty - IMO this is a bad technical solution that does not help the primary use case at all. I feel like you're choosing the easy solution over the correct one.

annevk · 2018-05-03T09:37:43Z

What is the correct solution?

benjamingr · 2018-05-03T13:27:22Z

@annevk something that would actually address WebRTC blocking use cases and isn't a kill-switch. I don't feel like the options for addressing that use case have been exhausted at all.

Namely, being merged as is means everyone who wants to use WebRTC data channels for their own benefit on their own website would be unable to block untrusted third parties from opening WebRTC connections.

For example:

I own a streaming website
I use a third party script on my site for ads/analytics/whatever
I don't want the script to be able to open RTCPeerConnections

Now to clarify this is a real use case, we happen to have a subset of our customers (>50) with a subset of our concurrent RTCPeerConnections (>100K, probably >1M) who would use this capability. I can probably easily reach out to smaller companies who work with WebRTC CDNs and get the number of interested parties to 100 (and this is websites delivering > TBs a month with WebRTC).

This means we either don't have the capability at all (no WebRTC delivery) which is expensive and slow, or we have the capability and untrusted third parties can utilize it.

annevk · 2018-05-03T13:32:51Z

We went through that scenario in #287 (comment) and down. The token-based solution @lgrahl mentions is reasonable, but also comes at quite a bit of additional complexity. It seems that such a solution could be persued on top of a simple on-off switch advocated for here.

benjamingr · 2018-05-03T15:20:33Z

I feel like we haven't exhausted the options, I'm not very experienced with the spec and am unlikely to be able to contribute a more complete fix like @lgrahl or yourself. It is possible that this solution could pursued on top of a simple on-off switch, it's also possible it can't be or that the syntax will end up looking quirky with a correct solution because of the on-off switch.

I really can't tell because we haven't had a concrete proposal for a "correct" solution yet - I'm only objecting to adding an API we know doesn't work from the get go.

I don't have a lot of experience with web APIs but the ones I've participated in or observed had a very high standard before adding an API in. Typically use cases were discussed, consumers were talked with and edge cases were sorted out. Fetch even waited for two years for other platform APIs.

I'd expect the same level of seriousness to be exercised here with the WebRTC CSP API - this might be an expectation problem on my had in which case I apologize for the mess I'm causing.

I'm totally on board with any efforts you might have to gather use cases or discuss them. I can probably talk with potential users of the API and gather use cases or examples of scenarios users expect to work. I also don't mind reaching out to the community at large and gathering use cases there too if you think it would help.

annevk · 2018-05-03T16:33:56Z

We can do token-based on top of on-off, just look at scripts and stuff. There's precedent for that in CSP so no need to worry and no need to block things on. We need on-off as a baseline because otherwise WebRTC is an exfiltration channel in an otherwise fully controlled environment. That's the thing that started this discussion.

lgrahl · 2018-05-04T15:45:01Z

I guess the best kind of solution would be to have more fine granular control on a per-script level (as proposed earlier by @benjamingr) but I have no idea how feasible that is as they all operate in the same global scope. As an idealist, I'm puzzled that there isn't more interest in solving this because it seems to me as if this is one of the major issues of web environments since third party dependencies such as ad and analystic mixin scripts are a thing.

zenhack · 2020-12-18T16:15:26Z

I'd like to get this discussion moving again.

Re: the use case of isolating different scripts within a single page, probably something like SES is needed for that to be tractable, as otherwise the scripts all live in the same global scope and can trample one another, so a CSP directive is useless for that level of granularity. If SES becomes widely available in the browser, then access can be mediated for code running in a compartment by providing that compartment with an attenuated API, with no need for headers to be involved. Some level of separation is possible now using iframes, and I'd like to see webrtc API objcts be transferable via postMessage(), which would allow more flexible policies re: access given to child frames.

So I'm of the opinion that we should add a simple on/off switch as far as CSP is concerned.

benjamingr · 2020-12-18T16:25:35Z

2 years later I think a kill-switch is probably much better than having nothing if there is a way to add support for finer grained control in the future.

zenhack · 2021-01-15T22:08:51Z

What is needed to push this forward? A draft of another change just adding the on/off switch?

annevk · 2021-01-18T07:41:19Z

https://whatwg.org/working-mode#changes has a rough list of requirements. (While this is not a WHATWG document, the requirements are high-level enough that it doesn't really matter.)

alvestrand · 2021-01-19T06:38:06Z

To my mind, the critical part of pushing this forward is to ensure that there is someone who will ensure that the resources are available to implement it in browsers (and deal with the inevitable fallout of the effects of its introduction).
The reason it stalled in 2018 was that people stopped asking for it (as in "I'm altering your priorities" asking for it).

zenhack · 2021-05-30T19:22:46Z

This should probably be closed in favor of #457

Introduce 'webrtc-src'.

b509df0

The 'webrtc-src' directive is a proposal for handling WebRTC connections. This patch isn't exactly finished, but it should at least give a concrete proposal that we can discuss in #92.

mikewest mentioned this pull request Jan 17, 2018

WebRTC can be used for exfiltration #92

Closed

alvestrand reviewed Jan 17, 2018

View reviewed changes

martinthomson suggested changes Jan 18, 2018

View reviewed changes

uBlock-user mentioned this pull request Mar 7, 2018

... uBlockOrigin/uAssets#1677

Closed

lgrahl mentioned this pull request Mar 29, 2018

What if I like to use Threema and Threema Web completely anonymously? threema-ch/threema-web#479

Closed

gorhill mentioned this pull request Jun 27, 2018

Allow disabling webrtc leak prevention on special sites uBlockOrigin/uBlock-issues#95

Closed

Malvoz mentioned this pull request Nov 28, 2018

What happened to the webrtc feature? w3c/webappsec-permissions-policy#250

Open

Malvoz mentioned this pull request Feb 28, 2019

Support individual CSP rules Fyrd/caniuse#4751

Open

alvestrand mentioned this pull request Jul 4, 2019

Integrate CSP access control into algorithms w3c/webrtc-nv-use-cases#36

Closed

dontcallmedom mentioned this pull request Dec 18, 2020

CSP-like mechanism for disabling RTCPeerConnection API? w3c/webrtc-pc#2619

Closed

zenhack mentioned this pull request Jan 15, 2021

Ian: Push web standards to allow blocking WebRTC sandstormports/community-project#12

Open

zenhack mentioned this pull request Jan 18, 2021

Introduce 'webrtc' as a simple on/off switch #457

Merged

Base automatically changed from master to main February 16, 2021 23:21

annevk closed this May 31, 2021

annevk deleted the webrtc-src branch May 31, 2021 07:18

lgrahl mentioned this pull request Aug 19, 2021

Add a CSP check to RTCPeerConnection.addIceCandidate(). w3c/webrtc-extensions#81

Merged

Introduce 'webrtc-src'. #287

Introduce 'webrtc-src'. #287

Conversation

mikewest commented Jan 17, 2018 • edited by pr-preview bot Loading

mikewest commented Jan 17, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

michaelficarra commented Jan 17, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alvestrand commented Jan 18, 2018

murillo128 commented Jan 18, 2018

martinthomson commented Jan 18, 2018

murillo128 commented Jan 19, 2018

alvestrand commented Jan 29, 2018

lgrahl commented Feb 17, 2018

patrickkettner commented Mar 5, 2018

alvestrand commented Mar 8, 2018

benjamingr commented Mar 14, 2018

annevk commented Mar 14, 2018

benjamingr commented Mar 14, 2018 • edited Loading

lgrahl commented Mar 28, 2018 • edited Loading

alvestrand commented Apr 3, 2018

alvestrand commented May 3, 2018

benjamingr commented May 3, 2018

annevk commented May 3, 2018

benjamingr commented May 3, 2018

annevk commented May 3, 2018 • edited Loading

benjamingr commented May 3, 2018

annevk commented May 3, 2018

lgrahl commented May 4, 2018

zenhack commented Dec 18, 2020

benjamingr commented Dec 18, 2020

zenhack commented Jan 15, 2021

annevk commented Jan 18, 2021

alvestrand commented Jan 19, 2021

zenhack commented May 30, 2021

mikewest commented Jan 17, 2018 •

edited by pr-preview bot

Loading

benjamingr commented Mar 14, 2018 •

edited

Loading

lgrahl commented Mar 28, 2018 •

edited

Loading

annevk commented May 3, 2018 •

edited

Loading