Principal extraction from nested username claim was broken #194

mstruk · 2023-06-28T19:43:29Z

When extracting a user id from JWT token by using oauth.username.claim or oauth.fallback.username.claim it only worked for top level attributes, not for nested attributes. For example, by configuring: "oauth.username.claim=auth.userid", and given a JWT token:

{
    ...
   "auth": {
      "userid": "alice"
   }
}

Extraction would not find 'userid' key under top level 'auth' object, rather it was looking for 'auth.userid' top level key.

This PR adds an option to use JsonPath to target nested attributes.
If the claim specification starts with an opening square bracket '[', it is interpreted as a JsonPath query.
Otherwise, it is interpreted as a top level attribute name.

userId                    ... use top level attribute named 'userId'
user.id                   ... use top level attribute named 'user.id'
$userid                   ... use top level attribute named '$userid'
['userInfo'].id           ... use nested attribute 'id' under 'userInfo' top level attribute
['user.info']['user.id']  ... use nested attribute 'user.id' under 'user.info' top level attribute
['user.info'].['user.id'] ... use nested attribute 'user.id' under 'user.info' top level attribute (optional dot)

scholzj

I think this deserves some description to expain what this change does. You seem to have really just swicthed two parameters around without any change to the method you call. So it is hard to understand the logic behind it.

mstruk · 2023-06-29T10:04:51Z

@scholzj Thanks for pointing this out. Upon a second look I realised that the fix is not good as it potentially introduces backwards compatibility issues. I'll describe the problem in more detail, and I'm working on a proper fix.

oauth-common/src/main/java/io/strimzi/kafka/oauth/common/PrincipalExtractor.java

tombentley · 2023-07-03T15:00:52Z

oauth-common/src/main/java/io/strimzi/kafka/oauth/common/PrincipalExtractor.java

+            if (spec.charAt(epos) != '.') {
+                throw new IllegalArgumentException("Failed to parse usename claim spec: '" + spec + "' (Missing '.' at position: " + epos + ")");
+            }
+            pos = spec.indexOf("[", epos + 1);


How does this handle the input [foo].bar[baz]?

This should be illegal. It should throw an exception. I'll fix it. Once you start your claim spec with '[' the characters between ']' and '[' should be limited to arbitrary number of spaces and a single '.'.

tombentley · 2023-07-03T15:02:58Z

oauth-common/src/main/java/io/strimzi/kafka/oauth/common/PrincipalExtractor.java

+            if (epos == -1) {
+                throw new IllegalArgumentException("Failed to parse username claim spec: '" + spec + "' (Missing ']')");
+            }
+            parsed.add(removeQuotesAndSpaces(spec.substring(pos, epos)));


How does this handle the input ['foo'bar']?

Currently it looks for the claim called foo'bar. I can't imagine a sane person using ', [ or ] inside an attribute name, so I don't think we have to support that. But I'm also not a fan of going out of our way to error on such strange characters beyond them interfering with parsing rules.

For example you can't have ']' in the name, but you can have '['. In the same vein I don't see a problem allowing ' in the middle of names.

I accept that this is an edge case, but I find it really surprising anything should accept 'foo'bar' as if it were a literal containing the characters foo'bar. I can't think of any examples (e.g. in programming languages) where things are parsed where such input wouldn't be rejected. Such parsing is usually based on tokenisation and the idea of paired delimiters. If people need to include ' in the middle of a literal like that the right way to do it is require escaping. You're violating a very very common expectation here. That would be OK if there was a valid use for supporting this, but you agree that no one should want or need this flexibility.

I think it's far safer to reject inputs like this, since it's more than likely due to a user messing up their config than something intentional. And rejection would appear to be easy enough: Anything inside the quotes should match [a-zA-Z0-9.]+ or similar.

In general it's much easier to be strict from that start, and relax once its clear that the rules are too strict for valid use cases that users have, than to try to restrict inputs later on. By that point you don't know what weird and wonderful things people might be using and expecting to continue working.

@tombentley I modified the implementation to use JsonPath for nested attributes. This should remove any inconsistencies in special character matching ...

mstruk · 2023-07-05T10:17:05Z

@tombentley I hope I adequately addressed the comments. WDYT?

tombentley

Thanks @mstruk

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

mstruk added this to the 0.13.0 milestone Jun 28, 2023

scholzj requested changes Jun 28, 2023

View reviewed changes

scholzj approved these changes Jun 29, 2023

View reviewed changes

scholzj requested a review from tombentley June 29, 2023 16:36

mstruk mentioned this pull request Jun 30, 2023

Map Group to User Principal using data in User Info Endpoint #192

Open

tombentley reviewed Jul 3, 2023

View reviewed changes

tombentley approved these changes Jul 7, 2023

View reviewed changes

mstruk added 6 commits July 7, 2023 14:54

Principal extraction from nested username claim was broken

d8ba636

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

Add proper support for nested claims

1ef4a38

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

Address PR comments and suggestions

05a5f71

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

Remove testsuite run using Kafka 3.3.2 from Travis build

b5eb645

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

Replace custom parsing with JsonPath already used for groups extraction

8e7b807

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

Fix javadoc issue

21f01f9

Signed-off-by: Marko Strukelj <marko.strukelj@gmail.com>

mstruk force-pushed the nested-userid-extraction branch from 00ee727 to 21f01f9 Compare July 7, 2023 12:58

mstruk merged commit 47e76b1 into strimzi:main Jul 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Principal extraction from nested username claim was broken #194

Principal extraction from nested username claim was broken #194

mstruk commented Jun 28, 2023 •

edited

Loading

scholzj left a comment

mstruk commented Jun 29, 2023

tombentley Jul 3, 2023

mstruk Jul 4, 2023

mstruk Jul 4, 2023

tombentley Jul 3, 2023

mstruk Jul 4, 2023

tombentley Jul 5, 2023

mstruk Jul 7, 2023

mstruk commented Jul 5, 2023

tombentley left a comment

Principal extraction from nested username claim was broken #194

Principal extraction from nested username claim was broken #194

Conversation

mstruk commented Jun 28, 2023 • edited Loading

scholzj left a comment

Choose a reason for hiding this comment

mstruk commented Jun 29, 2023

tombentley Jul 3, 2023

Choose a reason for hiding this comment

mstruk Jul 4, 2023

Choose a reason for hiding this comment

mstruk Jul 4, 2023

Choose a reason for hiding this comment

tombentley Jul 3, 2023

Choose a reason for hiding this comment

mstruk Jul 4, 2023

Choose a reason for hiding this comment

tombentley Jul 5, 2023

Choose a reason for hiding this comment

mstruk Jul 7, 2023

Choose a reason for hiding this comment

mstruk commented Jul 5, 2023

tombentley left a comment

Choose a reason for hiding this comment

mstruk commented Jun 28, 2023 •

edited

Loading