This repository has been archived by the owner on Sep 30, 2020. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 295
metrics server install-kube-system fix causing intermittent hang on new clusters #1424
Comments
omar-nahhas
pushed a commit
to HotelsDotCom/kube-aws
that referenced
this issue
Jul 26, 2018
Moving the installation of kube-proxy up, just after kukbe-system-ns
omar-nahhas
added a commit
to HotelsDotCom/kube-aws
that referenced
this issue
Jul 26, 2018
The fix is to move the installation of kube-proxy up in the install-kube-system script, so it gets a chance to start before the installation checks for it and goes into a loop.
omar-nahhas
added a commit
to HotelsDotCom/kube-aws
that referenced
this issue
Jul 26, 2018
Fixing indentation.
mateiidavid
pushed a commit
to HotelsDotCom/kube-aws
that referenced
this issue
Aug 14, 2018
…ubernetes-retired#1426) The fix is to move the installation of kube-proxy up in the install-kube-system script, so it gets a chance to start before the installation checks for it and goes into a loop. Fixes kubernetes-retired#1424
tyrannasaurusbanks
pushed a commit
to tyrannasaurusbanks/kube-aws
that referenced
this issue
Sep 14, 2018
The fix is to move the installation of kube-proxy up in the install-kube-system script, so it gets a chance to start before the installation checks for it and goes into a loop.
tyrannasaurusbanks
pushed a commit
to tyrannasaurusbanks/kube-aws
that referenced
this issue
Sep 14, 2018
Fixing indentation.
tyrannasaurusbanks
pushed a commit
to tyrannasaurusbanks/kube-aws
that referenced
this issue
Sep 14, 2018
…e-proxy-race-condition to hcom-flavour * commit '1d3373d1c2d7a6db17df8dfcbc14606b8fa3c9ad': Fix for issue: kubernetes-retired#1424 Fix for issue: kubernetes-retired#1424
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
It looks like there has been an issue within 'install-kube-system' when the metrics service is enabled which was documented and fixed in issue: #1039 (comment). We are now finding that when we spin up some new clusters that they get stuck within 'install-kube-system' which is not completing and allowing 'cfn-signal' to fire.
We see it get stuck in the logs...
Looking at 'install-kube-system' it looks to be stuck in this code: -
It seems that this code executes before the code that installs the kube-proxy and so gets stuck in the inner loop waiting for the proxy to come up.
The text was updated successfully, but these errors were encountered: