Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error log when running kuma multizone with 2 instances #5881

Closed
lukidzi opened this issue Jan 30, 2023 · 4 comments
Closed

Error log when running kuma multizone with 2 instances #5881

lukidzi opened this issue Jan 30, 2023 · 4 comments
Labels
kind/improvement Improvement on an existing feature triage/accepted The issue was reviewed and is complete enough to start working on it

Comments

@lukidzi
Copy link
Contributor

lukidzi commented Jan 30, 2023

What happened?

When running kuma multizone deployment with 2 instances on global you can notice an error log every 5min and 10 seconds. It is probably because of a timeout on the connection. We should check it and try to figure out if we can fix it.

ERROR	intercp.catalog.heartbeat	could not heartbeat the leader	{"error": "could not send a heartbeat to a leader: rpc error: code = Canceled desc = grpc: the client connection is closing", "errorVerbose": "rpc error: code = Canceled desc = grpc: the client connection is closing\ncould not send a heartbeat to a leader\ngh.neting.cc/kumahq/kuma/pkg/intercp/catalog.(*heartbeatComponent).heartbeat\n\tgh.neting.cc/kumahq/kuma@v0.0.0-20230127105205-446a49676bf8/pkg/intercp/catalog/heartbeat_component.go:90\ngh.neting.cc/kumahq/kuma/pkg/intercp/catalog.(*heartbeatComponent).Start\n\tgh.neting.cc/kumahq/kuma@v0.0.0-20230127105205-446a49676bf8/pkg/intercp/catalog/heartbeat_component.go:57\ngh.neting.cc/kumahq/kuma/pkg/plugins/bootstrap/k8s.componentRunnableAdaptor.Start\n\tgh.neting.cc/kumahq/kuma@v0.0.0-20230127105205-446a49676bf8/pkg/plugins/bootstrap/k8s/plugin.go:214\nsigs.k8s.io/controller-runtime/pkg/manager.(*runnableGroup).reconcile.func1\n\tsigs.k8s.io/controller-runtime@v0.14.1/pkg/manager/runnable_group.go:219\nruntime.goexit\n\truntime/asm_amd64.s:1571"}

2023-01-30T14:02:08.919Z	INFO	intercp.client.pool	closing connection due to lack of activity	{"url": "grpcs://ADDRESS_IP:5683"}
@lukidzi lukidzi added triage/pending This issue will be looked at on the next triage meeting kind/improvement Improvement on an existing feature labels Jan 30, 2023
@lukidzi lukidzi changed the title Error log when running kuma with Global CP with 2 instances Error log when running kuma multizone with 2 instances Jan 30, 2023
@jakubdyszkiewicz jakubdyszkiewicz added triage/accepted The issue was reviewed and is complete enough to start working on it and removed triage/pending This issue will be looked at on the next triage meeting labels Feb 6, 2023
@jakubdyszkiewicz
Copy link
Contributor

Triage: visible on Global CP on GKE

@github-actions github-actions bot added the triage/stale Inactive for some time. It will be triaged again label May 8, 2023
@github-actions
Copy link
Contributor

github-actions bot commented May 8, 2023

This issue was inactive for 90 days. It will be reviewed in the next triage meeting and might be closed.
If you think this issue is still relevant, please comment on it or attend the next triage meeting.

@lukidzi lukidzi removed the triage/stale Inactive for some time. It will be triaged again label May 8, 2023
@lahabana
Copy link
Contributor

lahabana commented Jun 2, 2023

Highly related a user has reported seeing:

ERROR	intercp.catalog.heartbeat	could not heartbeat the leader	{"error": "could not send a heartbeat to a leader: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial tcp 172.18.14.4:5683: i/o timeout\

It may be related to timeout yes. Here are some timestamps => every 20s
2023-06-01T15:30:06.424Z
2023-06-01T15:30:26.426Z
2023-06-01T15:30:46.428Z
2023-06-01T15:31:06.429Z
2023-06-01T15:31:26.430Z

@lahabana
Copy link
Contributor

This was fixed recently with #7323

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/improvement Improvement on an existing feature triage/accepted The issue was reviewed and is complete enough to start working on it
Projects
None yet
Development

No branches or pull requests

3 participants