Create clusters with HA masters by default #90

jimmycuadra · 2015-10-02T22:21:05Z

Ideally Kubernetes clusters should have highly available masters. Currently k8s nodes are auto scaled, but the master is not. This can be achieved with the combination of 1) an ELB and 2) either the podmaster (whose spec is already included in the public artifacts), or the use of fleet to guarantee that only one copy each of the controller manager and scheduler are running at once.

errm · 2015-10-05T20:07:52Z

👍 This is the one thing that is stopping is from switching to this setup already. it looks like podmaster is already configured so its just a case of dropping in an ELB...

ghost · 2015-10-06T01:33:53Z

+1

https://github.com/Samsung-AG/kraken

eliaslevy · 2016-01-11T17:09:14Z

See #147.

tomdee · 2016-04-19T21:48:49Z

I believe multimaster is now supported in this repo for k8s 1.2

mumoshu · 2016-04-29T08:13:12Z

@tomdee I have recently started looking into this, too. It can be supported if you modify cfn templates kube-aws wrote. Not out of the box though.

Let me share my incomplete thoughts just not to stop this discussion.

IFAIK, we have to think of HA for apiserver, scheduler/proxy/controller-manager, and etcd respectively.

apiservers seem to be state-less. So you just may want to have 2 or more of them(to not make an apiserver your SPOF). Then, at least, you need to tell workers where the live apiservers are. @eliaslevy seems to have done it in his PR #147 through an internal load balancer having a well-known dns name (Btw, thanks for sharing the great PR @eliaslevy !)
This can't be done out of the box with coreos-kubernetes yet.

scheduler/proxy/controller-manager should have --leader-elect=true on their startup. This seems to have already done.

For etcd, I guess you need to form a H/A etcd cluster consists of at-least 3 members. Each member should be located in different availability-zone(Btw, how everyone do this? Is there an AWS region which has 3 AZ open to its users?) to make single member's failure not to result in breaking quorum.

Well, so how everyone is doing it? :)

brandonweeks · 2016-04-29T17:29:06Z

@mumoshu here is a list. Most regions have at least three but there are a few with only two.

Personally we design around a two AZ per region model, so I would prefer the option to have five etcd servers across two AZs.

eliaslevy · 2016-04-29T19:26:45Z

@brandonweeks operating across only two AZs leaves you at risk of failure if a single AZ fails (the ones with the majority of etcd nodes), as you won't have a quorum.

vyshane mentioned this issue Oct 6, 2015

Support multi availability zone deployments on AWS #100

Closed

chancez added the platform/AWS label Oct 7, 2015

aaronlevy added kind/enhancement priority/P2 labels Dec 4, 2015

colhom closed this as completed Jan 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create clusters with HA masters by default #90

Create clusters with HA masters by default #90

jimmycuadra commented Oct 2, 2015

errm commented Oct 5, 2015

ghost commented Oct 6, 2015

eliaslevy commented Jan 11, 2016

tomdee commented Apr 19, 2016

mumoshu commented Apr 29, 2016 •

edited

Loading

brandonweeks commented Apr 29, 2016

eliaslevy commented Apr 29, 2016

Create clusters with HA masters by default #90

Create clusters with HA masters by default #90

Comments

jimmycuadra commented Oct 2, 2015

errm commented Oct 5, 2015

ghost commented Oct 6, 2015

eliaslevy commented Jan 11, 2016

tomdee commented Apr 19, 2016

mumoshu commented Apr 29, 2016 • edited Loading

brandonweeks commented Apr 29, 2016

eliaslevy commented Apr 29, 2016

mumoshu commented Apr 29, 2016 •

edited

Loading