fix: translator reports errors for existing clusters and secretes #4707

zhaohuabing · 2024-11-12T06:54:43Z

Fixes #4706 xDS translation failed when oidc tokenEndpoint and jwt remoteJWKS are specified within the same security policy and using the same hostname

Refactor: skips adding the cluster/secrets and returns nil to make the code cleaner and easier to maintain. It's safe to remove ErrXdsClusterExists and ErrXdsSecretsExists as they don't need to be handled in any places.

Release Notes: Yes

Test before the fix:

--- FAIL: TestTranslateXds (0.43s)
    --- FAIL: TestTranslateXds/securitypolicy-with-oidc-jwt-authz (0.00s)
        translator_test.go:142: securitypolicy-with-oidc-jwt-authz
        translator_test.go:143: 
                Error Trace:    /home/ubuntu/gateway/internal/xds/translator/translator_test.go:143
                Error:          Received unexpected error:
                                xds cluster exists
                Test:           TestTranslateXds/securitypolicy-with-oidc-jwt-authz
FAIL
FAIL    github.com/envoyproxy/gateway/internal/xds/translator   0.673s
FAIL

After:

ok      github.com/envoyproxy/gateway/internal/xds/translator   0.251s

codecov · 2024-11-12T07:02:34Z

Codecov Report

Attention: Patch coverage is 79.54545% with 9 lines in your changes missing coverage. Please review.

Project coverage is 65.53%. Comparing base (c2b0ee3) to head (bdfb1e7).

Files with missing lines	Patch %	Lines
internal/xds/translator/translator.go	57.14%	1 Missing and 2 partials ⚠️
internal/xds/translator/extauth.go	0.00%	0 Missing and 2 partials ⚠️
internal/gatewayapi/securitypolicy.go	94.44%	0 Missing and 1 partial ⚠️
internal/xds/translator/accesslog.go	50.00%	0 Missing and 1 partial ⚠️
internal/xds/translator/extproc.go	0.00%	0 Missing and 1 partial ⚠️
internal/xds/translator/oidc.go	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4707      +/-   ##
==========================================
- Coverage   65.55%   65.53%   -0.02%     
==========================================
  Files         211      211              
  Lines       31972    31961      -11     
==========================================
- Hits        20960    20947      -13     
- Misses       9768     9772       +4     
+ Partials     1244     1242       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

arkodg · 2024-11-13T02:39:49Z

hey @zhaohuabing is the issue that we are generating the same name for the jwks and oidc clusters if both are set in the policy ? shouldnt we be using an additional string value to differentiate them ?

zhaohuabing · 2024-11-14T14:29:51Z

hey @zhaohuabing is the issue that we are generating the same name for the jwks and oidc clusters if both are set in the policy ? shouldnt we be using an additional string value to differentiate them ?

For the cluster generated by a url, EG uses the host and port for the cluster name to avoid creating duplicated clusters for the same host+port combination.

jwt: 
 providers:
   - remoteJWKS:
       uri: https://oidc.example.com/auth/realms/example/protocol/openid-connect/cert

oidc:
 provider:
   tokenEndpoint: https://oidc.example.com/oauth/token

The name of the generated cluster: oidc_example_com_443.

We could change this logic to generate an unique name for each single oidc or jwt configuration. However, we should also ensure that the translator shouldn't throw error if the cluster already exists.

arkodg · 2024-11-14T15:12:39Z

hey @zhaohuabing is the issue that we are generating the same name for the jwks and oidc clusters if both are set in the policy ? shouldnt we be using an additional string value to differentiate them ?

For the cluster generated by a url, EG uses the host and port for the cluster name to avoid creating duplicated clusters for the same host+port combination.
jwt: 
 providers:
   - remoteJWKS:
       uri: https://oidc.example.com/auth/realms/example/protocol/openid-connect/cert

oidc:
 provider:
   tokenEndpoint: https://oidc.example.com/oauth/token
The name of the generated cluster: oidc_example_com_443.

We could change this logic to generate an unique name for each single oidc or jwt configuration. However, we should also ensure that the translator shouldn't throw error if the cluster already exists.

is it safe to reuse the same cluster configuration ? is the naming different when use the backendRefs field ?

zhaohuabing · 2024-11-14T15:32:57Z

is it safe to reuse the same cluster configuration ? is the naming different when use the backendRefs field ?

It's safe to reuse the asme cluster for ulr generated cluster as the cluster confiugration is identical for the same host+port combination.

For OIDC provider with backendRefs, EG generate an unique cluster name like securitypolicy/envoy-gateway/policy-for-gateway/0

arkodg · 2024-11-14T15:47:27Z

internal/xds/translator/translator.go

 func addXdsCluster(tCtx *types.ResourceVersionTable, args *xdsClusterArgs) error {
 	// Return early if cluster with the same name exists
 	if c := findXdsCluster(tCtx, args.name); c != nil {
-		return ErrXdsClusterExists
+		return nil


this is not great from an API perspective for addXdsCluster, because the the old cluster with the same name may not have the same xdsClusterArgs as the current request, so the caller should decide how to handle this case, not this method imo

Not a strong opinion, but should the caller call findXdsCluster first and decide whether they should handle this situation?

The current approach has an implict assumption: every callers should check whether the return error isErrXdsClusterExists or not, and ignore ErrXdsClusterExists - this cause code duplications for every caller and can cause issues if the caller forget to handle this special case, like this one.

You're absolutely right, the method is doing two many things, fine to split it up, but developers will need to make sure they call both functions when writing new logic

For all the current callers, they just need to call addXdsCluster as they just simply ignore ErrXdsClusterExists and does nothing. For them, they can all safely assume the xdsClusterArgs is the same for the clusters with the same name.

I guess this pattern comes from the Kub Client API - it totally makes sense for the Kub Client API as there is a race condition between the client and the API server. However, EG doesn't need to use the same pattern here as it's a local cache and has no concurrent writes.

zhaohuabing · 2024-11-14T15:48:11Z

For OIDC provider with backendRefs, EG generate an unique cluster name like securitypolicy/envoy-gateway/policy-for-gateway/0.

Ha, there is also a bug here, the index is always 0, which generates duplicated name for different clusters if there're both ext auth and oidc whithin a SecurityPolicy. Will fix it in this PR as well.

arkodg · 2024-11-14T15:51:13Z

For OIDC provider with backendRefs, EG generate an unique cluster name like securitypolicy/envoy-gateway/policy-for-gateway/0.

Ha, there is also a bug here, the index is always 0, which generates duplicated name for different clusters if there're both ext auth and oidc whithin a SecurityPolicy. Will fix it in this PR as well.

nice catch, prob needs another prefix like jwt, oidc after policy name

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing · 2024-11-14T17:17:06Z

internal/gatewayapi/securitypolicy.go

 		}

 		var oidc *ir.OIDC
 		if policy.Spec.OIDC != nil {
 			if oidc, err = t.buildOIDC(
 				policy,
 				resources,
-				gtwCtx.envoyProxy); err != nil {
+				gtwCtx.envoyProxy,  // TODO zhaohuabing: Only the last EnvoyProxy will be used as the OIDC name doesn't include the cluster index


This is a minor issue, will address it in a follow-up PR.

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing added the cherrypick/release-v1.2 label Nov 12, 2024

zhaohuabing requested a review from a team as a code owner November 12, 2024 06:54

zhaohuabing marked this pull request as draft November 12, 2024 06:55

zhaohuabing added cherrypick/release-v1.2.2 and removed cherrypick/release-v1.2 labels Nov 12, 2024

zhaohuabing force-pushed the fix-existing-cluster branch from 87b5867 to 15264f6 Compare November 12, 2024 07:06

zhaohuabing marked this pull request as ready for review November 12, 2024 07:10

zhaohuabing force-pushed the fix-existing-cluster branch from 15264f6 to bda736c Compare November 12, 2024 07:15

zhaohuabing marked this pull request as draft November 12, 2024 07:15

zhaohuabing changed the title ~~fix: existing clusters and secretes~~ fix: translator reports errors for existing clusters and secretes Nov 12, 2024

zhaohuabing force-pushed the fix-existing-cluster branch 6 times, most recently from 21c1901 to a7d7e6b Compare November 13, 2024 00:01

zhaohuabing marked this pull request as ready for review November 13, 2024 00:07

fix: existing clusters and secretes

bd01257

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing force-pushed the fix-existing-cluster branch from a7d7e6b to bd01257 Compare November 13, 2024 00:12

Merge branch 'main' into fix-existing-cluster

ab2a47c

arkodg reviewed Nov 14, 2024

View reviewed changes

fix cluster index for SP

4fe8508

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing marked this pull request as draft November 14, 2024 16:57

zhaohuabing added 3 commits November 14, 2024 17:09

minor change

52b181d

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

minor change

6cca148

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

minor change

d4ec015

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing force-pushed the fix-existing-cluster branch from b240792 to d4ec015 Compare November 14, 2024 17:15

minor change

16ba3fc

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

zhaohuabing commented Nov 14, 2024

View reviewed changes

Merge branch 'main' into fix-existing-cluster

2a25c00

zhaohuabing mentioned this pull request Nov 14, 2024

Only the last EnvoyProxy will be used as the OIDC name doesn't include the cluster index #4720

Open

zhaohuabing added 2 commits November 14, 2024 17:26

fix lint

f03dce4

Signed-off-by: Huabing Zhao <zhaohuabing@gmail.com>

Merge branch 'main' into fix-existing-cluster

bdfb1e7

zhaohuabing marked this pull request as ready for review November 14, 2024 17:26

zhaohuabing requested a review from arkodg November 14, 2024 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: translator reports errors for existing clusters and secretes #4707

fix: translator reports errors for existing clusters and secretes #4707

zhaohuabing commented Nov 12, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading

arkodg commented Nov 13, 2024

zhaohuabing commented Nov 14, 2024

arkodg commented Nov 14, 2024

zhaohuabing commented Nov 14, 2024

arkodg Nov 14, 2024

zhaohuabing Nov 14, 2024 •

edited

Loading

arkodg Nov 14, 2024

zhaohuabing Nov 14, 2024 •

edited

Loading

zhaohuabing commented Nov 14, 2024 •

edited

Loading

arkodg commented Nov 14, 2024

zhaohuabing Nov 14, 2024

fix: translator reports errors for existing clusters and secretes #4707

Are you sure you want to change the base?

fix: translator reports errors for existing clusters and secretes #4707

Conversation

zhaohuabing commented Nov 12, 2024 • edited Loading

codecov bot commented Nov 12, 2024 • edited Loading

Codecov Report

arkodg commented Nov 13, 2024

zhaohuabing commented Nov 14, 2024

arkodg commented Nov 14, 2024

zhaohuabing commented Nov 14, 2024

arkodg Nov 14, 2024

Choose a reason for hiding this comment

zhaohuabing Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

arkodg Nov 14, 2024

Choose a reason for hiding this comment

zhaohuabing Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

zhaohuabing commented Nov 14, 2024 • edited Loading

arkodg commented Nov 14, 2024

zhaohuabing Nov 14, 2024

Choose a reason for hiding this comment

zhaohuabing commented Nov 12, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading

zhaohuabing Nov 14, 2024 •

edited

Loading

zhaohuabing Nov 14, 2024 •

edited

Loading

zhaohuabing commented Nov 14, 2024 •

edited

Loading