Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retry failed master startup once #2572

Merged
merged 1 commit into from
Oct 10, 2016

Conversation

ironcladlou
Copy link
Contributor

@ironcladlou ironcladlou commented Oct 7, 2016

Master startup can fail when ec2 transparently reallocates the block
storage, causing etcd writes to temporarily fail. Retry failures blindly
just once to allow time for this transient condition to to resolve and for
systemd to restart the master (which will eventually succeed).

etcd-io/etcd#3864
openshift/origin#6065
openshift/origin#6447

Master startup can fail when ec2 transparently reallocates the block
storage, causing etcd writes to temporarily fail. Retry failures blindly
just
once to allow time for this transient condition to to resolve and for
systemd
to restart the master (which will eventually succeed).

etcd-io/etcd#3864
openshift/origin#6065
openshift/origin#6447
@ironcladlou
Copy link
Contributor Author

Still testing this, but wanted to get it ready for discussion.

@ironcladlou
Copy link
Contributor Author

@sdodson @abutcher

@sdodson
Copy link
Member

sdodson commented Oct 7, 2016

aos-ci-test

@abutcher
Copy link
Member

abutcher commented Oct 7, 2016

Will we also want to do this with the controllers service for HA installs?

@openshift-bot
Copy link

@sdodson
Copy link
Member

sdodson commented Oct 8, 2016

Yeah, I think we should probably do this for all the cases where we're
starting services for the first time.

On Oct 7, 2016 5:05 PM, "Andrew Butcher" notifications@github.com wrote:

Will we also want to do this with the controllers service for HA installs?


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#2572 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AAC8IfZ9yACtlxR8AMPMlEJmXJ6dJq-lks5qxrP7gaJpZM4KRbTr
.

@sdodson
Copy link
Member

sdodson commented Oct 10, 2016

I'll get those in my PR.

@sdodson sdodson merged commit a37dc40 into openshift:master Oct 10, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants