Feature/add scope filter #14

daniellrgn · 2024-04-03T07:00:29Z

Limit user access to requests for their patient resource and resources that reference their patient resource, based on the id found in their token.

…stem

pbugni · 2024-04-03T23:53:18Z

jwt_proxy/api.py


 from jwt_proxy.audit import audit_HAPI_change

 blueprint = Blueprint('auth', __name__)
 SUPPORTED_METHODS = ('GET', 'POST', 'PUT', 'DELETE', 'OPTIONS')

+# TODO: to be pulled into its own module and loaded per config


i love the TODO. makes sense to me to move this to a separate module - by that i mean a separate .py file for easier migration later.

pbugni · 2024-04-04T00:07:10Z

jwt_proxy/api.py

+        return False
+
+    user_id = token.get("sub")
+    identifier_pattern = rf"(https(:|%3[Aa])(\/|%2[Ff]){2}keycloak\.ltt\.cirg\.uw\.edu(%7[Cc]|\|))?{user_id}"


this is rather hardcoded to one keycloak install. it would be nice to look to a config value, similar to UPSTREAM_SERVER.

any reason to use re for URL parsing, vs. say urlparse from urllib.parse ?

Thanks Paul, good points -

This pattern just matches identifier-like query param values like https://keycloak.ltt.cirg.uw.edu|keycloak-user-id
The code system https://keycloak.ltt.cirg.uw.edu is a dummy system url that cPRO adds to the LTT patient resources to identify keycloak user ids, and would certainly better live in config to support other codings/identifiers/systems.

I wasn't aware of urlparse, so thanks for the recommendation! I can't tell if it's quite equivalent to the use of this pattern though: it seems closer to a replacement for Flask's req.args call in the next line that parses the query params from the url, but if there's a better way to validate the content of a specific query param value there I'm all ears.

All that being said, I'll clean up this pattern - I'd hedged it a bit since I didn't know exactly what was url-encoded and what wasn't, but I've found that the query param values are always url-decoded by the req.args call here, and the later check against the identifier params in a POSTed resource's references will always come through url-encoded.

pbugni · 2024-04-04T00:13:18Z

jwt_proxy/api.py

+            user_info=decoded_token.get("email") or decoded_token.get("preferred_username"),
+        )
+        return response_content
+    return jsonify(message="invalid request"), 400


as this implies a lack of authorization, a 401 is a more accurate code.

Agreed that 400 generally doesn't fit. I'd advocate to keep the 400 response on parsing errors like get_json failing though.
Would a 403 in response be an even better semantic fit, as by this point the user is authenticated but unauthorized to make the request, and reauthenticating will not help?

Disregard that first bit, I didn't realize werkzeug's BadRequest errors automatically convert to 400 when left uncaught. They sure do think these things through 😛

pbugni · 2024-04-04T00:15:09Z

jwt_proxy/api.py

+    if id_param_value is not None and re.search(identifier_pattern, id_param_value):
+        return True
+    # Search body for keycloak id
+    if req.is_json:


the request.json property is a nice shortcut. will be None if not included or wrong content-type.

Thanks! The Flask docs recommended preferring get_json(), though it looks like I don't need this check as get_json() includes it implicitly.

pbugni · 2024-04-04T00:19:18Z

jwt_proxy/api.py

-    return response_content
+
+    # TODO: call new function here to dynamically load a filter call dependent on config; hardwired for now
+    if scope_filter(request, decoded_token):


probably best to exit early (with the return jsonify... as you have). i fear we'll get a deep nest of it clauses when additional filters come online.

thanks Paul! was going to mention something similar; sometimes called a "guard clause"

Nice catch, thanks both!

pbugni · 2024-04-04T01:13:15Z

jwt_proxy/api.py

+# TODO: to be pulled into its own module and loaded per config
+def scope_filter(req, token):
+    # Check path
+    resource_pattern = rf"(Patient|DocumentReference)$"


I fear this approach leaves a backdoor to well tailored requests. A user could include a string like "Patient" in say an or-clause of a search, or the like.

Any reason to not just limit to the requested resource in the request path? i.e. {url}/Patient or {url}/DocumentReference for this use case?

Any reason to not just limit to the requested resource in the request path? i.e. {url}/Patient or {url}/DocumentReference for this use case?

That's the intention of this pattern, to enforce the request path (sans params) ends in either Patient or DocumentReference (it's checked against req.path, which will typically be fhir/DocumentReference etc.). I preferred containing this check within the function to defining a url rule or something, but let me know if there's a better/more pythonic way!

ivan-c

Looks good, thanks! There's more work remaining, so your TODOs/caveats are much appreciated when things settle down a bit

ivan-c · 2024-04-04T01:27:04Z

jwt_proxy/api.py

-    return response_content
+
+    # TODO: call new function here to dynamically load a filter call dependent on config; hardwired for now
+    if scope_filter(request, decoded_token):


thanks Paul! was going to mention something similar; sometimes called a "guard clause"

Guard clause prior to proxy Response code change to 403 on failure json handling improvement

daniellrgn added 3 commits April 2, 2024 18:41

Add scope filter to limit user access

df8fcae

Limit user access to requests for their patient resource and resources that reference their patient resource, based on the id found in their token.

Add scope filter function

e316db6

Remove temp returns

79e3933

mcjustin mentioned this pull request Apr 3, 2024

Limit HAPI query scope to resources associated with the logged in user (config-enabled) #12

Open

daniellrgn added 6 commits April 3, 2024 12:13

Fix scope_filter arg

1512e58

Add ltt-specific id system and resource types; fix body check

fc099f9

Update api.py

3677e87

Fix resource type match

752c89e

Update api.py resource type check

e6dd0bd

Tighten up checks to limit to resource types and ltt kc identifier sy…

9d5a155

…stem

pbugni requested changes Apr 4, 2024

View reviewed changes

pbugni reviewed Apr 4, 2024

View reviewed changes

ivan-c approved these changes Apr 4, 2024

View reviewed changes

Address feedback

3d201f1

Guard clause prior to proxy Response code change to 403 on failure json handling improvement

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/add scope filter #14

Feature/add scope filter #14

daniellrgn commented Apr 3, 2024 •

edited

Loading

pbugni Apr 3, 2024

pbugni Apr 4, 2024

daniellrgn Apr 4, 2024 •

edited

Loading

pbugni Apr 4, 2024

daniellrgn Apr 4, 2024

daniellrgn Apr 4, 2024 •

edited

Loading

pbugni Apr 4, 2024

daniellrgn Apr 4, 2024

pbugni Apr 4, 2024

ivan-c Apr 4, 2024

daniellrgn Apr 4, 2024

pbugni Apr 4, 2024

daniellrgn Apr 4, 2024 •

edited

Loading

ivan-c left a comment

ivan-c Apr 4, 2024

Feature/add scope filter #14

Are you sure you want to change the base?

Feature/add scope filter #14

Conversation

daniellrgn commented Apr 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daniellrgn Apr 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daniellrgn Apr 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daniellrgn Apr 4, 2024 • edited Loading

Choose a reason for hiding this comment

ivan-c left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daniellrgn commented Apr 3, 2024 •

edited

Loading

daniellrgn Apr 4, 2024 •

edited

Loading

daniellrgn Apr 4, 2024 •

edited

Loading

daniellrgn Apr 4, 2024 •

edited

Loading