Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLI command for getting memberlist state #4601

Closed
tnqn opened this issue Feb 3, 2023 · 0 comments · Fixed by #4611
Closed

CLI command for getting memberlist state #4601

tnqn opened this issue Feb 3, 2023 · 0 comments · Fixed by #4611
Assignees
Labels
kind/feature Categorizes issue or PR as related to a new feature.

Comments

@tnqn
Copy link
Member

tnqn commented Feb 3, 2023

Describe the problem/challenge you have

The Egress feature and ServiceExternalIP feature rely on memberlist cluster to know the state of Nodes, with which fail-over could be triggered when a Node is gone. Members in the memberlist cluster use a gossip protocol to communicate. Normally all members should have consistent information about the cluster eventually. But there could be implementation bugs or networking problems leading to inconsistency. For example, #4491 fixed a bug that dead Nodes didn't rejoin the cluster if network downtime lasts more than 15 seconds. It also happened several times that the ports required by gossip protocol, TCP and UDP 10351, were not allowed by firewall.

To troubleshoot such issues effeciently, we need to know the state of memberlist cluster of each antrea-agent. A CLI command below would be helpful:

$ antctl get memberlist
NODE      IP            STATUS
worker1   172.16.0.11   Alive
worker2   172.16.0.12   Alive
worker3   172.16.0.13   Dead

With the tool, we could know every antrea-agent's memberlist cluster state and find inconsitency when a brain-split happens.

To simplify troubleshooting, we could add the information to supportbundle as well, meaning antctl supportbundle and the supportbundleCollection API would collect it autotically.

@tnqn tnqn added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 3, 2023
@tnqn tnqn added this to the Antrea v1.11 release milestone Feb 3, 2023
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 7, 2023
Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 7, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 8, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 9, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 13, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 14, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 20, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Atish-iaf added a commit to Atish-iaf/antrea that referenced this issue Feb 21, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes antrea-io#4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
tnqn pushed a commit that referenced this issue Feb 21, 2023
Add antrea agent command `antctl get memberlist`
to get state of memberlist cluster of antrea agent.

Fixes #4601

Signed-off-by: Kumar Atish <atish.iaf@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants