-
Notifications
You must be signed in to change notification settings - Fork 772
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
i/o timeout in coredns pod #1427
Comments
@davigar15 has reported the same issue. This is sporadic, right? Some things we could try is to use the host to resolver. Could anyone try any of the above and report back any results? [1] https://kubernetes.io/docs/tasks/administer-cluster/dns-custom-nameservers/ |
Do not think this is a sporadic/transient issue. I have tried for 3 days continuously ( As mentioned in my comment here, I did not face this issue on AWS EC2 instance with Ubuntu 2004 Server image. Until yesterday i was trying on Ubuntu Desktop 2004 (running in VMware Player) and today i have tested with Ubuntu 2004 Server locally (running in VMware Player) and not very surprising, Kubeflow was deployed successfully. Similar behavior is mentioned here |
I just tried
|
@bipinm: can you expand a little more on this?
Which pod(s) did you go into, and which other public IPs did you try? |
After debugging with @davigar15, I think this issue is not actually Kubeflow-specific, and is a general networking issue that starts happening after a computer with microk8s is rebooted. @bipinm, @atamahjoubfar, can you verify that this is related to rebooting the host machine for microk8s? @davigar15 says that when he runs into this issue, reinstalling the microk8s snap fixes things for him. |
I reinstalled |
Without a restart after
This time instead of a re-start, ran
In my previous tests on Ubuntu 2004 server i followed similar restart step, but did not encounter this problem. Only seem to occur on 2004 Desktop version. @knkski: The pod i used for ping test was nginx-ingress-microk8s-controller Now Test after install of microk8s +
Test after install of microk8s restart (+ failed
|
@bipinm immediately after a reboot the k8s networking is not correctly setup. During that period I was getting:
Within 2 to 3 minutes the pods were reporting state |
@ktsakalozos, i was running
Will try this once again to confirm, i am not absolutely sure if i was rebooting after step 6 |
@ktsakalozos , same here. After manually stopping and starting the cluster via |
@ktsakalozos not rebooting the machine or
|
I am having this issue as well; I noticed it after a reboot. Restarting (disable/enable) dns did not fix for me either. I also tried switching off of 8.8.x.x but that did not help either. Stopping all of microk8s (
The symptom for me is that other services running the in cluster get errors like |
Hi @dkolbly could you attach the tarball produced by There is also this page that may help in debugging DNS resolution issues: https://kubernetes.io/docs/tasks/administer-cluster/dns-debugging-resolution/ |
Thanks @ktsakalozos I was not aware of that debugging page, and I didn't think to grab an inspect while it was broken, but here is the current state of the system in case it helps. FWIW, I'm going to need to power cycle the system this weekend to put it on a UPS so I'll keep an eye for a recurrence of the problem. |
Any updates on this? I have run into this issue multiple times as well. In my case coredns cannot even talk to the master on the same machine: |
I believe this issue is fixed in #1635, which introduces handling around the calico networking. If anybody wants to try it out, it'll be available via |
I'm currently getting in coredns pod with the latest/edge, could my issue be related to this one?
|
I'm also experiencing this issue on I run:
Wait for it to do its thing, then:
And it's full of:
I've tried both v1.21 and v1.22-alpha.1 and the issue is present in both. |
@brendanmckenzie I feel this might be the dns pod not being able to reach 8.8.8.8. Did you try setting a different forward dns as describe in [1]? |
The issue is present no matter what forwarding DNS server I use. Additionally - other pods are able to ping 8.8.8.8 (so is the host machine).
And the subsequent logs from coredns -
|
🤦♂️ for some reason, port 53 outbound requests from my server were being blocked. I switched to using my hosting provider's DNS and now coredns is working as expected. |
I'm facing the exact same issue, anyone come up with a solution to this ? |
I fixed this in my setup by changing the CIDR as detailed in the doco below. It appears there's network and DNS resolution issues when your hosts DNS / local network is within the default microk8s pod subnet of 10.1.0.0/16. |
I did face this issue in following environment:
and upgrading to Ubuntu 22.04 solves the issue. I hope this be useful for fixing this issue. |
I did face this issue after rebooting my machine in following environment:
Fixed by what was already suggested here: microk8s stop/start |
Didn't help |
I also face this issue in following environment: Ubuntu 20.04 |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Copying from #958 (comment), as this looks to be the crux of the issue. For some reason, some queries to 8.8.8.8 and 8.8.4.4 for
api.jujucharms.com
are failing:The text was updated successfully, but these errors were encountered: