-
Notifications
You must be signed in to change notification settings - Fork 266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rke-canal pod is not running due to incompatible ipset protocol version #4145
rke-canal pod is not running due to incompatible ipset protocol version #4145
Comments
In my experience the ipset kernel/userspace mismatch is only critical if something else adds ipsets with a newer version than the userspace tool supports. This matches your observation that the broken host has |
cc @rbrtbnfgl @manuelbuil we do need to bump the ipset version bundled with canal though, |
@brandond This is a fresh Ubuntu installation with no other tools installed in it. Is there a way to figure out the source of the ipsets with Revision:7? |
I'm not sure. The ipset names do suggest that they also came from Calio but I'm not sure why there would be some that used the host ipset version and others that used the one bundled in the image. I will defer to the network team tagged above. |
@brandond Also, |
yes, there is an effort in progress to update the base across all of our hardened images. |
Reopening for QA ( reminder not to use fix/fixes/so on alongside an issue number in PR descriptions @rbrtbnfgl ) |
/backport v1.26.5+rke2r1 |
/backport v1.25.10+rke2r1 |
/backport v1.24.14+rke2r1 |
Validated on with /
Environment DetailsInfrastructure
Node(s) CPU architecture, OS, and Version:
Cluster Configuration:
$ sudo mkdir -p /etc/rancher/rke2 && sudo cp config.yaml /etc/rancher/rke2
Validation Results:
|
Environmental Info:
RKE2 Version:
v1.26.4+rke2r1
Node(s) CPU architecture, OS, and Version:
Linux k8s-agent16 5.15.0-70-generic #77-Ubuntu SMP Tue Mar 21 14:02:37 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Cluster Configuration:
2 servers, 16 agents all running Ubuntu 22.04
Describe the bug:
rke2-canal pods on some agents are not starting. The pod logs contain the following.
There was a similar issue reported at projectcalico/calico#5011. But, it's mentioned that it only happens if kube-proxy mode is ipvs and it shouldn't impact if the proxy-mode is iptables. I have confirmed that the proxy-mode is iptables. Here are the logs from the kube-proxy pod.
Steps To Reproduce:
Installed RKE2 using the following steps
Expected behavior:
Running
kubectl get pod -n kube-system
should result in allrke2-canal
pods running successfully.Actual behavior:
Some of the
rke2-canal
are stuck atReady 1/2
Additional context / logs:
On host:
On
rke2-canal
pod andcalico-node
container running in the same host:Note that this behavior is observed only on one server node and one agent node. All other nodes are working fine. One common thing on both these nodes is that the output of
ipset list
contained sets withRevision: 7
in them.Output of
ipset list
from the problematic agent node:Output of
ipset list
from the node that is working fine:The text was updated successfully, but these errors were encountered: