Spec is inconsistent about which strings are valid CSPs #414

bakkot · 2019-11-22T04:13:16Z

The Parse a serialized CSP algorithm says that it must be given a "serialized CSP", which is an ASCII string adhering to

serialized-policy =
    serialized-directive *( optional-ascii-whitespace ";" [ optional-ascii-whitespace serialized-directive ] )
serialized-directive =
    directive-name [ required-ascii-whitespace directive-value ]
directive-name =
    1*( ALPHA / DIGIT / "-" )
directive-value =
    *( required-ascii-whitespace / ( %x21-%x2B / %x2D-%x3A / %x3C-%x7E ) )

This does not match, for example, the string @; script-src 'none'.

However, the algorithm itself does accept that string: step 2.3 is "Let directive name be the result of collecting a sequence of code points from token which are not ASCII whitespace.", which consumes @ as directive name. And indeed I would expect it a parser to have defined behavior for all strings, which may include rejecting them. It is very strange to say that the parser must be given a string which conforms to a particular grammar, especially since consumers like the HTML spec do not first check that the strings with which they are calling it conform to the grammar.

(In fact the algorithm as specified accepts all strings.)

It is not clear to me what the intended interpretation of the string @; script-src 'none' is. Browsers seem to treat the @ as an unrecognized directive and discard it as they would any other directive, and hence still enforce the script-src 'none' part. But if we read 3.1 The Content-Security-Policy HTTP Response Header Field strictly, a HTTP header named Content-Security-Policy whose value is @; script-src 'none' is not actually a Content-Security-Policy header.

There are other places this comes up: for example, the grammar for serialized-source-list

serialized-source-list =
    ( source-expression *( required-ascii-whitespace source-expression ) ) / "'none'"
source-expression =
    scheme-source / host-source / keyword-source / nonce-source / hash-source

does not match 'none' https://example.com, which by a strict reading of the definition of img-src means that img-src 'none' https://example.com is not a img-src directive. Is that the intent? (cf #411)

The general statement of the problem is that the current spec gives grammars for things which are more restrictive than the algorithms which are said to correspond to those grammars. I think this should be fixed, either by removing the relevant grammars from the normative specification, by loosing them, or by tightening the algorithms which correspond to them (which would probably be a breaking change).

The text was updated successfully, but these errors were encountered:

lucasgadani · 2020-01-03T20:09:42Z

Checking current chrome and firefox implementations, it seems that the existing implementations ignore invalid source-expressions, which means that image-src 'none' https://example.com matches https://example.com and ignores the 'none' part, though I agree with your assertion above that technically this is not a valid img-src.

Similarly, in the existing implementations something like frame-ancestors wrong_scheme: https: will ignore wrong_scheme: (due to the invalid scheme character), but still allow https: schemes. It'll also accept things like frame-ancestors * .example.com as accepting anything, as it parses "*" then ignores ".example.com".

It's important for the spec to clarify what should be the behavior in this situation. Should the invalid CSP grammar still be accepted and partially applied, based on the parsing of each individual source-expression? Or should the implementations ignore the invalid directives entirely due to them not being valid?

This was referenced Jan 17, 2020

Why does plugin-types use the empty list instead of 'none'? #420

Open

Does the meta http-equiv="Content-Security-Policy" tag allow lists of policies? whatwg/html#5102

Open

bakkot mentioned this issue Apr 23, 2020

Why do base-uri and frame-ancestors have different grammars? #431

Open

bakkot mentioned this issue May 13, 2020

Clarify/test which quote characters may be used #434

Open

bakkot mentioned this issue Mar 8, 2021

Non-ASCII characters in CSP policy. #473

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec is inconsistent about which strings are valid CSPs #414

Spec is inconsistent about which strings are valid CSPs #414

bakkot commented Nov 22, 2019 •

edited

Loading

lucasgadani commented Jan 3, 2020

Spec is inconsistent about which strings are valid CSPs #414

Spec is inconsistent about which strings are valid CSPs #414

Comments

bakkot commented Nov 22, 2019 • edited Loading

lucasgadani commented Jan 3, 2020

bakkot commented Nov 22, 2019 •

edited

Loading