What is the proper way to compare resource object? #592

chenwng · 2019-02-06T14:45:35Z

In the generated code, reflect.DeepEqual is used to check if the resource is changed. I created a deployment then used reflect.DeepEqual to compare it with the one fetched from k8s. The issue is the one fetched from k8s was changed by deployment controller (I assume it is) by filling with default values and is different from the deployment I constructed. This will trigger a update to deployment. And it seems this will happen continuously. Is this something expected? Or I am missing something?

Thanks.

Adirio · 2019-02-20T14:10:55Z

/triage support
Only pass the Deployment.Spec to reflect.DeepEqual. Deployment.Status is expected to change.

chenwng · 2019-02-22T14:49:12Z

Thanks for your reply. I did some further tests. It is like what you said the Deployment.Status was kept being updated.
I also dumped the deployment spec fetched from api server. It looks like it was updated by deployment controller to fill with default values, but that was not the reason for the update event. I think I misunderstood the reason for those update events.
If Deployment.Status is something I don't care about, how can I ignore those update events?

Adirio · 2019-02-22T15:58:50Z

Just don't take any action.
You could probably use a predicate to filter those events too but that will make little difference.

nrfox · 2019-02-22T18:49:26Z

I have run into this issue as well. I think doing reflect.DeepEqual(generatedSpec, existingSpec) on the whole spec can be problematic for a couple reasons.

If the apiserver does any defaulting of the resource object then the generated spec from the controller needs to be filled out entirely to ensure that the generated spec matches the existing spec.
This check: reflect.DeepEqual(generatedSpec, existingSpec) will always fail for certain resource types. Services of type ClusterIP that do not have their ClusterIP field set will get a ClusterIP assigned to them by the apiserver (or something internally). The deepequal check will fail and the controller will try to Update the service spec with the empty ClusterIP field and get an error back since this field cannot be updated once it has been set.

Maybe it’d be better if the scaffolding only compared for changes for things that the controller should take action on or are part of the CRD abstraction e.g. the Replicas field on the deployment should be updated if it doesn’t match the Replicas field on the CRD instead of doing a blanket deepequal on the spec.

schweikert · 2019-04-18T12:00:11Z

It also bothered me that too many unnecessary updates were done because of the DeepEqual, so I implemented this approach:

Use hashstructure to calculate a hash of the deployment that I want to have
Store that hash under an annotation ("mydomain/last-applied-hash")
In the reconcile function, compare the stored hash with the newly computed hash, and only update if they differ

This way, I don't need to do a deep-compare each time the reconcile function is triggered, and I also push less changes to kubernetes.

daxmc99 · 2019-05-29T15:31:46Z

https://github.com/banzaicloud/k8s-objectmatcher might be a possible solution to this?

DirectXMan12 · 2019-05-29T23:37:02Z

Server-side apply is the ultimate solution, when it eventually lands as beta.

pepov · 2019-08-02T18:36:19Z

@DirectXMan12 will the server-side apply solution work without contacting the API server? Isn't a dry-run call against the API necessary in that case as well?

pepov · 2019-08-02T19:18:29Z

Ideally (for example with https://github.com/banzaicloud/k8s-objectmatcher and controller-runtime of course) we get some (or most) of our managed objects from the cache and we can decide whether there are any changes or not locally.

I understand server-side apply is more elegant, but if it involves a call to the API server then it's not exactly a solution to the above problem.

DirectXMan12 · 2019-08-05T21:09:41Z

@pepov with server-side apply, you do need to contact the api server, but you don't need to compare objects. Instead, you explicitly set all the fields you care about on an empty object (not one you've gotten from the cache) and then submit that to the API server. The API server takes care of figuring out the difference. That means that you always submit against the API server, but that the submission doesn't always change things, without having to care about local comparison.

DirectXMan12 · 2019-08-05T21:09:50Z

(whoops, didn't mean to close)

pepov · 2019-08-06T08:49:30Z

@DirectXMan12 the issue in our case was that we hit the API server too hard with too much requests, also the overall time for an operator cycle increased because of this.

chenwng · 2019-08-06T12:28:28Z

@pepov with server-side apply, you do need to contact the api server, but you don't need to compare objects. Instead, you explicitly set all the fields you care about on an empty object (not one you've gotten from the cache) and then submit that to the API server. The API server takes care of figuring out the difference. That means that you always submit against the API server, but that the submission doesn't always change things, without having to care about local comparison.

In this case, when should we make the submit? If we make the submit in every reconciliation, will that cause too much load for api server in worst case?
Btw, is this server-side apply available now?

DirectXMan12 · 2019-08-06T20:01:58Z

@ChenDoRo ideally, no, it should not cause too much load, but it depends on your usecase. It's in alpha now, and will most likely be beta in the next kubernetes release

@pepov was it actually a server-side issue? You can also hit client-side rate limiting pretty easy with the default values. At any rate, I'd be curious to see your numbers on that.

fejta-bot · 2019-11-04T20:57:04Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

JeremyMarshall · 2019-11-06T22:41:38Z

Maybe late to the party but I raised this in kubernetes and they pointed me at kubernetes/apimachinery#75

apiequality.Semantic.DeepEqual, see https://godoc.org/k8s.io/apimachinery/pkg/api/equality)

caarlos0 · 2019-11-27T17:00:37Z

apiequality.Semantic.DeepEqual, see https://godoc.org/k8s.io/apimachinery/pkg/api/equality)

this still won't work in some cases, like when the default for a field is not its zero-value.

I end up doing the same thing @schweikert recommended.

fejta-bot · 2019-12-27T17:34:40Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-01-26T18:19:04Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-01-26T18:19:12Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

feloy · 2020-05-08T09:54:13Z

I also have the problem using reflect.DeepEqual or equality.Semantic.DeepEqual because some fields are set with default non-zero values by some controller (like ImagePullPolicy, RestartPolicy, and so on) if these fields are not set by the operator.

I found this utility function that makes the job for me: it compares only fields that are non-zero in the expected struct:

import "k8s.io/apimachinery/pkg/api/equality"


if !equality.Semantic.DeepDerivative(expected.Spec, found.Spec) {
  // some field set by the operator has changed
}

mjaow · 2021-01-27T08:13:08Z

I also have the problem using reflect.DeepEqual or equality.Semantic.DeepEqual because some fields are set with default non-zero values by some controller (like ImagePullPolicy, RestartPolicy, and so on) if these fields are not set by the operator.

I found this utility function that makes the job for me: it compares only fields that are non-zero in the expected struct:
import "k8s.io/apimachinery/pkg/api/equality"


if !equality.Semantic.DeepDerivative(expected.Spec, found.Spec) {
  // some field set by the operator has changed
}

It will ignore some update, maybe I'm trying to add a field to deployment(or delete some fields of it), but it will ignore it

devlifealways · 2021-02-15T17:35:02Z

Same here, Semantic.DeepDerivative ignores a couple of fields, which causes the test to fail each time.

rda3mon · 2021-04-11T02:23:27Z

It also bothered me that too many unnecessary updates were done because of the DeepEqual, so I implemented this approach:

Use hashstructure to calculate a hash of the deployment that I want to have

Store that hash under an annotation ("mydomain/last-applied-hash")

In the reconcile function, compare the stored hash with the newly computed hash, and only update if they differ

This way, I don't need to do a deep-compare each time the reconcile function is triggered, and I also push less changes to kubernetes.

This works very well, but can avoid having one more dependency by using crypto/sha256 instead of hashstructure on marshalled spec to get a hash. Here is an example

var hashStore = make(map[string]string)

newSS := r.buildStatefulSet(m, d)
newSSMarshal, _ := json.Marshal(newSS)

func asSha256(o interface{}) string {
  h := sha256.New()
  h.Write([]byte(fmt.Sprintf("%v", o)))

  return fmt.Sprintf("%x", h.Sum(nil))
}

else if asSha256(newSSMarshal) != hashStore[newSS.Name] {
  log.Info("Updating StatefulSet", "StatefulSet.Namespace", newSS.Namespace, "StatefulSet.Name", newSS.Name)
  err = r.Update(ctx, newSS)
  if err != nil {
    log.Error(err, "Failed to update StatefulSet", "StatefulSet.Namespace", newSS.Namespace, "StatefulSet.Name", newSS.Name)
    return ctrl.Result{}, err
  }
  hashStore[newSS.Name] = asSha256(newSSMarshal)
  return ctrl.Result{Requeue: true}, nil
}

k8s sets fields when we submit our CRD to the api, this causes issues when we come to compare local versions to it later. kubernetes-sigs/kubebuilder#592

* Loosen equality on crds. k8s sets fields when we submit our CRD to the api, this causes issues when we come to compare local versions to it later. kubernetes-sigs/kubebuilder#592 * Remove redundant check * Remove redundant word * Address comments

kubernetes-sigs/kubebuilder#592

aweis89 · 2022-06-27T16:43:46Z

It also bothered me that too many unnecessary updates were done because of the DeepEqual, so I implemented this approach:

Use hashstructure to calculate a hash of the deployment that I want to have

Store that hash under an annotation ("mydomain/last-applied-hash")

In the reconcile function, compare the stored hash with the newly computed hash, and only update if they differ

This way, I don't need to do a deep-compare each time the reconcile function is triggered, and I also push less changes to kubernetes.

One thing to be mindful about this approach is it doesn't eliminate configuration drift that originates outside the controller in-question. If someone updates the controlled object state, say using kubectl, in a way that the desired state is out of sync with the actual state, the generated hash still won't differ and the controller won't revert that change (until the reconciliation happens to produce a different result from its last update). This is often not how controllers are meant to behave.

The controllerutil lib uses the equality.Semantic.DeepEqual(existing, obj) method which I think is generally what folks should use for the above the reason

Madongming · 2022-07-23T06:02:27Z

In the generated code, reflect.DeepEqual is used to check if the resource is changed. I created a deployment then used reflect.DeepEqual to compare it with the one fetched from k8s. The issue is the one fetched from k8s was changed by deployment controller (I assume it is) by filling with default values and is different from the deployment I constructed. This will trigger a update to deployment. And it seems this will happen continuously. Is this something expected? Or I am missing something?

Thanks.

I also encountered a similar problem. The solution is to update first, and use the client.DryRunAll parameter option, and then use reflect.DeepEqual to compare Spec to achieve the goal. Code reference

	if err := r.Client.Update(ctx, object, client.DryRunAll); err != nil { // client package is "sigs.k8s.io/controller-runtime/pkg/client"
		return err
	}

	if reflect.DeepEqual(curObjct.Spec, newObject.Spec) {
		logger.Info("Object is not changed, skip update")
		return nil
	}
        if err := r.Client.Update(ctx, object); err != nil {
		return err
	}

It‘s work now

k8s-ci-robot added the kind/support Categorizes issue or PR as a support question. label Feb 20, 2019

DirectXMan12 closed this as completed Aug 5, 2019

DirectXMan12 reopened this Aug 5, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 4, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 27, 2019

Azadehkhojandi mentioned this issue Jan 16, 2020

Fix 90 updating manifest djob Azure/azure-databricks-operator#138

Open

k8s-ci-robot closed this as completed Jan 26, 2020

exekias mentioned this issue Jul 17, 2020

Remove unnecessary restarts of metricsets while using Node autodiscover elastic/beats#19974

Merged

6 tasks

AdheipSingh mentioned this issue Jul 28, 2020

readOnly for ConfigMaps druid-io/druid-operator#48

Merged

1 task

jeromefroe mentioned this issue Sep 28, 2020

[controller] Add logic to update StatefulSets m3db/m3db-operator#236

Merged

PhanLe1010 mentioned this issue Oct 27, 2020

Prevent upgrade failure due to changed storageclass longhorn/longhorn-manager#738

Merged

steven-zou mentioned this issue Dec 5, 2020

Compare actual and desired spec before triggering updates to avoid useless updating requests goharbor/harbor-operator#284

Closed

grzesuav mentioned this issue Jan 19, 2021

Enhancement: Better recognition of differences between old/new object state metacontroller/metacontroller#155

Open

steven-zou mentioned this issue Mar 25, 2021

fix(harborcluster): fix reconcile not work the harborcluster changed. goharbor/harbor-operator#511

Closed

qinqon mentioned this issue Sep 29, 2021

status-manager, Fix bad compare of status kubevirt/cluster-network-addons-operator#1031

Merged

Chrisbattarbee added a commit to palantir/k8s-spark-scheduler that referenced this issue Oct 21, 2021

Loosen equality on crds.

9278a7a

k8s sets fields when we submit our CRD to the api, this causes issues when we come to compare local versions to it later. kubernetes-sigs/kubebuilder#592

Chrisbattarbee mentioned this issue Oct 21, 2021

Loosen equality on crds. palantir/k8s-spark-scheduler#192

Merged

Payback159 added a commit to Payback159/openfero that referenced this issue Mar 2, 2022

DeepEqual compare against kubernetes resources isnt possible, see issue

fcaac78

kubernetes-sigs/kubebuilder#592

Payback159 mentioned this issue Mar 2, 2022

DeepEqual compare against kubernetes resources isnt possible, see iss… Payback159/openfero#9

Merged

mofanke mentioned this issue May 7, 2022

[notebook controller ]CopyStatefulSetFields always return true which cause unnecessary update kubeflow/notebooks#106

Open

peterghaddad mentioned this issue Mar 15, 2023

Reconcile Ray Workers when VolumeMounts change ray-project/kuberay#945

Closed

4 tasks

This was referenced Mar 22, 2023

Calculate and compare resource hash nais/unleasherator#12

Closed

Check for changes using DeepDerivative nais/unleasherator#13

Merged

skonto mentioned this issue May 4, 2023

Revision reconciler always updates the Kubernetes Deployment knative/serving#13204

Open

pstewy mentioned this issue Sep 8, 2023

Operator sometimes does not apply all updates dragonflydb/dragonfly-operator#93

Closed

didierofrivia mentioned this issue Dec 4, 2023

Dry-run resource update before comparing changes Kuadrant/kuadrant-operator#356

Merged

orozery mentioned this issue Feb 21, 2024

controlplane: Introduce control package clusterlink-net/clusterlink#325

Merged

vbelouso mentioned this issue Mar 16, 2024

OLS-342, OLS-347: creation and deletion of console plugin openshift/lightspeed-operator#30

Merged

11 tasks

l0kix2 mentioned this issue Apr 24, 2024

Operator should react on all out of sync k8s components ytsaurus/ytsaurus-k8s-operator#172

Open

skhalash mentioned this issue May 21, 2024

Reconciliation takes too long to execute kyma-project/telemetry-manager#1097

Open

c-pius mentioned this issue Jun 26, 2024

feat: Avoid Redundant SSA for Manifest Patching kyma-project/lifecycle-manager#1620

Merged

jiridanek mentioned this issue Oct 22, 2024

Unnecessary updates to statefulset, etc. issued by notebook-controller when nothing is actually changed opendatahub-io/kubeflow#427

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the proper way to compare resource object? #592

What is the proper way to compare resource object? #592

chenwng commented Feb 6, 2019

Adirio commented Feb 20, 2019

chenwng commented Feb 22, 2019

Adirio commented Feb 22, 2019

nrfox commented Feb 22, 2019

schweikert commented Apr 18, 2019

daxmc99 commented May 29, 2019

DirectXMan12 commented May 29, 2019

pepov commented Aug 2, 2019

pepov commented Aug 2, 2019

DirectXMan12 commented Aug 5, 2019

DirectXMan12 commented Aug 5, 2019

pepov commented Aug 6, 2019

chenwng commented Aug 6, 2019

DirectXMan12 commented Aug 6, 2019

fejta-bot commented Nov 4, 2019

JeremyMarshall commented Nov 6, 2019

caarlos0 commented Nov 27, 2019

fejta-bot commented Dec 27, 2019

fejta-bot commented Jan 26, 2020

k8s-ci-robot commented Jan 26, 2020

feloy commented May 8, 2020

mjaow commented Jan 27, 2021 •

edited

Loading

devlifealways commented Feb 15, 2021

rda3mon commented Apr 11, 2021 •

edited

Loading

aweis89 commented Jun 27, 2022 •

edited

Loading

Madongming commented Jul 23, 2022

What is the proper way to compare resource object? #592

What is the proper way to compare resource object? #592

Comments

chenwng commented Feb 6, 2019

Adirio commented Feb 20, 2019

chenwng commented Feb 22, 2019

Adirio commented Feb 22, 2019

nrfox commented Feb 22, 2019

schweikert commented Apr 18, 2019

daxmc99 commented May 29, 2019

DirectXMan12 commented May 29, 2019

pepov commented Aug 2, 2019

pepov commented Aug 2, 2019

DirectXMan12 commented Aug 5, 2019

DirectXMan12 commented Aug 5, 2019

pepov commented Aug 6, 2019

chenwng commented Aug 6, 2019

DirectXMan12 commented Aug 6, 2019

fejta-bot commented Nov 4, 2019

JeremyMarshall commented Nov 6, 2019

caarlos0 commented Nov 27, 2019

fejta-bot commented Dec 27, 2019

fejta-bot commented Jan 26, 2020

k8s-ci-robot commented Jan 26, 2020

feloy commented May 8, 2020

mjaow commented Jan 27, 2021 • edited Loading

devlifealways commented Feb 15, 2021

rda3mon commented Apr 11, 2021 • edited Loading

aweis89 commented Jun 27, 2022 • edited Loading

Madongming commented Jul 23, 2022

mjaow commented Jan 27, 2021 •

edited

Loading

rda3mon commented Apr 11, 2021 •

edited

Loading

aweis89 commented Jun 27, 2022 •

edited

Loading