Add initial draft of Auth GEP 1494 #3500

youngnick · 2024-12-13T03:35:10Z

/kind gep

What this PR does / why we need it:

This is to get the conversation started around Auth* in Gateway API. Hopefully this part should not be too contentious yet, but reviews gratefully accepted.

Which issue(s) this PR fixes:

Updates #1494

Signed-off-by: Nick Young <nick@isovalent.com> Co-authored-by: Jen Gao <jie.gao.1025@gmail.com>

k8s-ci-robot · 2024-12-13T03:35:18Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: youngnick

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~geps/OWNERS~~ [youngnick]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

LiorLieberman · 2024-12-13T17:59:00Z

/cc

LiorLieberman

Thanks for very detailed content. Left a few comments

LiorLieberman · 2024-12-13T18:04:43Z

geps/gep-1494/index.md

+
+* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.
+
+* Optionally, a way for Ana to have the ability to disable Authentication and/or Authorization for specific routes when needed, allowing certain routes to not be protected.


Does this imply that Ana has the ability to override the cluster-admin configuration? I assume some use cases for this functionality might involve testing, but I’m curious to hear about other potential use cases. My concern is that an app developer, who may not have a strong understanding of authentication, would have the ability to override cluster-admin (or security admin) defaults.

Does this imply that Ana has the ability to override the cluster-admin configuration? I assume some use cases for this functionality might involve testing, but I’m curious to hear about other potential use cases. My concern is that an app developer, who may not have a strong understanding of authentication, would have the ability to override cluster-admin (or security admin) defaults.

I think some public pages, like a status page or help page, might not need authn?

It could be more flexible if the Gateway API allowed an HTTPRoute to either override or inherit settings from the Gateway under a hierarchical control model. This flexibility would be especially useful in cases where roles overlap, such as in smaller organizations where Ana and Chihiro might be the same person.

Yes, the most likely use case here is for healthchecks, or being able to say "the whole website needs auth, except for /public" or something.

From my experience, cluster admins often lack a clear understanding of the deployed applications and their paths unless developers explicitly provide this information. In this context, the role of the cluster admin would be to enable default authentication globally for all routes. Developers, like Ana, can then override this default authentication at the httproute level or for specific routes within the httproute. WDYT?

Yes, this is roughly what I'm thinking here. A default, not an override, although I'm not sure we'll be using Policy Attachment to do this or not yet.

geps/gep-1494/index.md

Signed-off-by: Nick Young <nick@isovalent.com>

Ongy

Couple comments around the auth mechanisms + suggestions for user stories.

New to k8s upstream work. Do you care about style comments? E.g. defining AuthN/AuthZ twice, or would you prefer to keep this to technical bits?

Ongy · 2024-12-17T10:54:13Z

geps/gep-1494/index.md

+
+In this case, the server authenticates the client based on the client presenting a certificate that's signed by an authority that's also trusted by the server's trust chain. Some implementations also allow details about the certificate to be passed through to backend clients, to be used in authorization decisions.
+
+TLS v1.3 is defined in [RFC-8446](https://datatracker.ietf.org/doc/html/rfc8446), with v1.2 defined in [RFC-5246](https://datatracker.ietf.org/doc/html/rfc5246).


While not necessary, I think it's worth pointing to https://datatracker.ietf.org/doc/html/rfc8996 for why TLS 1.1 and earlier are not considered.

Ongy · 2024-12-17T11:00:09Z

geps/gep-1494/index.md

+
+TLS includes the possibility of having both the client and server present certificates for the other party to validate. (This is often called "mutual TLS", but is distinct from the use of that term in Service Mesh contexts, where it means something more like "mutual TLS with short-lifetime, automatically created and managed dynamic keypairs for both client and server").
+
+In this case, the server authenticates the client based on the client presenting a certificate that's signed by an authority that's also trusted by the server's trust chain. Some implementations also allow details about the certificate to be passed through to backend clients, to be used in authorization decisions.


I think the "also" is misplaced, since it's about the server also verifying a client, not about the client's cert being trusted by the same CA as ??.
The "trust chain" seems weird as well. Generally we have trust roots on a server, and the chain is presented by the client.
Since the client side authentication process of the server is implied prior knowledge, I think this can be condensed a bit.

I suggest:

In this case, the server also authenticates the client, based on the certificate chain presented by the client. Some implementations also allow details about the certificate to be passed through to backend clients, to be used in authorization decisions

Ongy · 2024-12-17T11:04:15Z

geps/gep-1494/index.md

+
+In Basic HTTP Auth, a server asks for authentication as part of returning a `401` status response, and the client includes an `Authorization` header that includes a Base64-encoded username and password.
+
+Because the password is only _encoded_ and not _encrypted_, Basic Auth is totally unsafe when used outside of an encrypted session (like a HTTPS connection).


While the "raw" passwords in basic auth have additional issues (long lived, impersonating human on login pages), JWT and afaik OAUTH/OIDC are not generally secure against "replay" attacks.

I.e. besides mTLS (which enforces confidentiality) all mechanisms are insecure in plaintext messages, because the authentication token can at least be re-used on other connections to gain the same level of privileges.

I think it's best to move the notion of requiring encryption for safe usage out of Basic Auth and potentially add a note here that basic auth has an additional issue with long-lived (potentially higher power) tokens being exchanged.

Ongy · 2024-12-17T11:10:34Z

geps/gep-1494/index.md

+## Auth* User Stories
+
+
+* As Ana the Application Developer, I wish to be able to configure that some or all of my service exposed via Gateway API requires Authentication, and ideally to be able to make Authorization decisions about _which_ authenticated clients are allowed to access.


This might leak into the "API" phase, but I think there's 2 levels to this, which are both worth an explicit mention

The API of the proposed implementation provides enough flexibility to integrate with an authorization mechanism and protect resources entirely in the gateway

The API allows to inject information about the authentication result into the requests and allows backend application to make authorization decisions based on this.

Ongy · 2024-12-17T11:12:50Z

geps/gep-1494/index.md

+
+* As Ana the Application Developer, I wish to be able to configure that some or all of my service exposed via Gateway API requires Authentication, and ideally to be able to make Authorization decisions about _which_ authenticated clients are allowed to access.
+* As Chihiro the Cluster Admin, I wish to be able to configure default Authentication settings (at least), with an option to enforce Authentication settings (preferable but not required) for some set of services exposed via Gateway API inside my cluster.
+* More User Stories welcomed here!


As Ana the Application Developer, I wish to be able to redirect users to a login page when they lack authentication, while unauthenticated API access gets the proper 40x response.

^ to make it clear that we need to handle "human" clients (browsers) slightly different to API consumers due to 30X/40X conventions.

Ongy · 2024-12-17T15:36:37Z

geps/gep-1494/index.md

+* Handling all possible authentication and authorization schemes. Handling a (preferably large) subset of authentication and authorization is acceptable.
+
+
+## Deferred Goals


Is there any intend to also cover GRPCRoute?

I think it should be made clear if it's non-goal, or deferred.

JackMyers001 · 2024-12-17T16:10:26Z

geps/gep-1494/index.md

+
+* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.
+
+* Optionally, a way for Ana to have the ability to disable Authentication and/or Authorization for specific routes when needed, allowing certain routes to not be protected.


As I understand it, the current proposal only allows for "add auth to this website, except for this route". Could this be expanded to have some kind of rule-based authentication selection/bypass mechanism? I'd love to have the ability to choose a specific authentication mechanism (or bypass auth entirely) on a per-route basis, based on multiple factors (e.g. client IP, user agent).

This would make the Gateway API an amazing Layer 7 firewall, but I'm not sure if the project wants to support these kinds of capabilities; I saw this proposal was closed partly because the feature "operated too much like a WAF/firewall".

Proposed user stories:

As Ana the Application Developer, I'm maintaining a legacy application that doesn't have any existing authentication mechanisms. I wish to enforce SSO when a user accesses the application through a browser. However, I wish to bypass authentication when the user accesses the application through the mobile app (based on the user agent), as it can't handle the SSO flow.

As Ana the Application Developer, I wish to be able to use different authentication mechanisms for the same route based on the context of the request. I wish to use SSO when a user accesses my API through a browser, and use JWT when using my mobile app,

As Chihiro the Cluster Admin, I wish to block undesirable User Agents or IP ranges (spiders, scrapers etc.) from accessing any services exposed by Gateway API inside my cluster.

As Chihiro the Cluster Admin, I wish to be able to bypass AuthN when requests come in from a trusted, controlled IP range, so external services can access APIs running on my cluster without issue

robscott

Thanks @youngnick!

robscott · 2024-12-18T07:01:15Z

geps/gep-1494/index.md

+
+* A way to configure a Gateway Implementation to perform Authentication (at least), with optional Authorization on behalf of Ana the Application Developer.
+
+* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.


Since we're saying this is exclusively for north-south traffic, I think I'd rather attach this config at the Gateway level. Maybe it would be safer to say something like this:

Suggested change

* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.

* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTP or GRPC matching criteria.

robscott · 2024-12-18T07:03:42Z

geps/gep-1494/index.md

+
+Basic auth is defined in [RFC-7617](https://datatracker.ietf.org/doc/html/rfc7617).
+
+#### TLS Client Certificate Authentication


May be worth referring to https://gateway-api.sigs.k8s.io/geps/gep-91/ as this is already in progress.

Add initial draft of Auth GEP 1494

a9db0f1

Signed-off-by: Nick Young <nick@isovalent.com> Co-authored-by: Jen Gao <jie.gao.1025@gmail.com>

k8s-ci-robot requested review from arkodg and robscott December 13, 2024 03:35

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Dec 13, 2024

youngnick added the release-note-none Denotes a PR that doesn't merit a release note. label Dec 13, 2024

k8s-ci-robot removed the do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. label Dec 13, 2024

k8s-ci-robot requested a review from LiorLieberman December 13, 2024 17:59

LiorLieberman reviewed Dec 13, 2024

View reviewed changes

jgao1025 reviewed Dec 16, 2024

View reviewed changes

geps/gep-1494/index.md Show resolved Hide resolved

geps/gep-1494/index.md Outdated Show resolved Hide resolved

Fix first round of PR comments

6706fbd

Signed-off-by: Nick Young <nick@isovalent.com>

youngnick force-pushed the initial-auth-gep branch from 68f1a60 to 6706fbd Compare December 17, 2024 03:07

Ongy reviewed Dec 17, 2024

View reviewed changes

JackMyers001 reviewed Dec 17, 2024

View reviewed changes

robscott reviewed Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial draft of Auth GEP 1494 #3500

Add initial draft of Auth GEP 1494 #3500

youngnick commented Dec 13, 2024

k8s-ci-robot commented Dec 13, 2024

LiorLieberman commented Dec 13, 2024

LiorLieberman left a comment

LiorLieberman Dec 13, 2024

jgao1025 Dec 15, 2024

youngnick Dec 17, 2024

pubudu538 Dec 18, 2024

youngnick Dec 18, 2024

Ongy left a comment

Ongy Dec 17, 2024

Ongy Dec 17, 2024

Ongy Dec 17, 2024 •

edited

Loading

Ongy Dec 17, 2024 •

edited

Loading

Ongy Dec 17, 2024

Ongy Dec 17, 2024

JackMyers001 Dec 17, 2024 •

edited

Loading

robscott left a comment

robscott Dec 18, 2024

robscott Dec 18, 2024


		* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.

		* Optionally, a way for Ana to have the ability to disable Authentication and/or Authorization for specific routes when needed, allowing certain routes to not be protected.


		In this case, the server authenticates the client based on the client presenting a certificate that's signed by an authority that's also trusted by the server's trust chain. Some implementations also allow details about the certificate to be passed through to backend clients, to be used in authorization decisions.

		TLS v1.3 is defined in [RFC-8446](https://datatracker.ietf.org/doc/html/rfc8446), with v1.2 defined in [RFC-5246](https://datatracker.ietf.org/doc/html/rfc5246).


		TLS includes the possibility of having both the client and server present certificates for the other party to validate. (This is often called "mutual TLS", but is distinct from the use of that term in Service Mesh contexts, where it means something more like "mutual TLS with short-lifetime, automatically created and managed dynamic keypairs for both client and server").

		In this case, the server authenticates the client based on the client presenting a certificate that's signed by an authority that's also trusted by the server's trust chain. Some implementations also allow details about the certificate to be passed through to backend clients, to be used in authorization decisions.


		In Basic HTTP Auth, a server asks for authentication as part of returning a `401` status response, and the client includes an `Authorization` header that includes a Base64-encoded username and password.

		Because the password is only _encoded_ and not _encrypted_, Basic Auth is totally unsafe when used outside of an encrypted session (like a HTTPS connection).

		## Auth* User Stories


		* As Ana the Application Developer, I wish to be able to configure that some or all of my service exposed via Gateway API requires Authentication, and ideally to be able to make Authorization decisions about _which_ authenticated clients are allowed to access.

		* Handling all possible authentication and authorization schemes. Handling a (preferably large) subset of authentication and authorization is acceptable.


		## Deferred Goals


		* A way to configure a Gateway Implementation to perform Authentication (at least), with optional Authorization on behalf of Ana the Application Developer.

		* A way for Chihiro the Cluster Admin to configure a default Authentication and/or Authorization config for some set of HTTPRoutes.


		Basic auth is defined in [RFC-7617](https://datatracker.ietf.org/doc/html/rfc7617).

		#### TLS Client Certificate Authentication

Add initial draft of Auth GEP 1494 #3500

Are you sure you want to change the base?

Add initial draft of Auth GEP 1494 #3500

Conversation

youngnick commented Dec 13, 2024

k8s-ci-robot commented Dec 13, 2024

LiorLieberman commented Dec 13, 2024

LiorLieberman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ongy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ongy Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

Ongy Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JackMyers001 Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

robscott left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ongy Dec 17, 2024 •

edited

Loading

Ongy Dec 17, 2024 •

edited

Loading

JackMyers001 Dec 17, 2024 •

edited

Loading