Design doc for multiple worker node groups support #757

bnrjee · 2021-12-03T19:35:24Z

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

designs/multiple-worker-node-groups.md

g-gaston

This looks very good

designs/multiple-worker-node-groups.md

g-gaston · 2021-12-06T13:12:18Z

designs/multiple-worker-node-groups.md

+
+For each group, we will append these three fields corresponding to that group in the capi spec.
+
+Right now, the cli assumes that there will be only one group and it treats worker node group configuration array as a collection of only one element. As a result, the controller just refers to the first element of this array in different places of the code. So we need to do the same operations in loops, which includes capi spec creation, cluster spec validation etc. Once a capi spec is created with this approach, the workload cluster will be created with multiple worker nodes.


It doesn't need to be part of this doc at all, but it might be a good idea to document all the places where this assumption is being made and how deeply they go (at least before starting the execution). Maybe something we can do in parallel to this review.

I fear this refactor might be a bit more complex/lengthy that what it seems

We can discuss this separately.

designs/multiple-worker-node-groups.md

vivek-koppuru · 2021-12-08T23:22:10Z

designs/multiple-worker-node-groups.md

+apiVersion: infrastructure.cluster.x-k8s.io/v1alpha3
+kind: VSphereMachineTemplate
+metadata:
+  name: eksa-test-worker-node-template-1638469395669


Should we be mapping the VSphereMachineTemplate to the worker node group, which means we would have a eksa-test-1-worker-node-template-* and eksa-test-2-worker-node-template-*? I think it gets a little complicated when we think about maintaining a mapping from worker node groups and machine config objects to the capi template, especially when introducing/removing a machine config. Unless we just say that every worker node group configuration warrants a new capi template spec, regardless of whether it references the same machine config or not.

Based on what it is, would like to see a note about that as a sentence or two in the design doc here.

Curious about this as well
In particular: especially when introducing/removing a machine config, super interesting problem I didn't think about

designs/multiple-worker-node-groups.md

vivek-koppuru · 2021-12-08T23:24:52Z

designs/multiple-worker-node-groups.md

+
+For each group, we will append these three fields corresponding to that group in the CAPI spec.
+
+Right now, the cli assumes that there will be only one group and it treats worker node group configuration array as a collection of only one element. As a result, the controller just refers to the first element of this array in different places of the code. So we need to do the same operations in loops, which includes CAPI spec creation, cluster spec validation etc. Once a CAPI spec is created with this approach, the workload cluster will be created with multiple worker nodes.


When we introduce multiple machine configs, are we going to use Go array templating to loop or maintain a default capi spec containing worker node configuration, and append to the resultant capi spec depending on how many worker node groups that we have? I would prefer to do the latter so that we control generating new capi worker node specs based on the number of machine config objects we have configured.

3 option: use capi api (go) structs
This one has my vote

We will use go structs.

designs/multiple-worker-node-groups.md

vivek-koppuru · 2021-12-09T00:00:05Z

designs/multiple-worker-node-groups.md

+
+Also, it needs to be made sure that at the least one of the worker node groups does not have `NoExecute` or `NoSchedule` taint. This validation will be done at the preflight validation stage.
+
+The examples in this design are for vsphere provider. But the same strategy applies for other providers as well.


This means that for docker, only changing the taints for each worker node group would warrant a new capi template spec, or just each worker node group corresponds to a separate capi template spec even if the values are exactly the same?

Yes, for docker also, we will be adding KubeadmConfigTemplate, MachineDeployment, VSphereMachineTemplate in the CAPI spec file for each worker node group.

designs/multiple-worker-node-groups.md

vivek-koppuru · 2021-12-15T06:27:30Z

designs/multiple-worker-node-groups.md


-Right now, the cli assumes that there will be only one group and it treats worker node group configuration array as a collection of only one element. As a result, the controller just refers to the first element of this array in different places of the code. So we need to do the same operations in loops, which includes CAPI spec creation, cluster spec validation etc. Once a CAPI spec is created with this approach, the workload cluster will be created with multiple worker nodes.
+Right now, the cli assumes that there will be only one group and it treats worker node group configuration array as a collection of only one element. As a result, the controller just refers to the first element of this array in different places of the code. So we need to do the same operations in loops, which includes CAPI spec creation, cluster spec validation etc. Once a CAPI spec is created with this approach, the workload cluster will be created with multiple worker nodes. We will use an array of CAPI objects to store the worker node group configurations and then generate CAPI spec file using that array.


Can you mention that one element of the array of worker node group configurations corresponds to a set of CAPI objects consisting of KubeadmConfig, MachineDeployment, and whatever else is there? Just gives us an understanding as to what to expect, even if there are repeated VSphereMachineConfigs for each of worker node groups.

I am not sure, if there is a data structure encompassing all three. But these three types are well defined in capi and capv code bases. What I plan to do is to create a structure of these 3 elements and then create an array of that structure. I will update the design doc.

designs/multiple-worker-node-groups.md

eks-distro-bot · 2021-12-23T18:14:22Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bnrjee, g-gaston, vivek-koppuru

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [bnrjee,g-gaston,vivek-koppuru]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

g-gaston · 2021-12-23T18:18:04Z

/lgtm

eks-distro-bot requested review from mdsgabriel and ptrivedi December 3, 2021 19:35

eks-distro-bot added the approved label Dec 3, 2021

bnrjee requested a review from vivek-koppuru December 3, 2021 19:35

eks-distro-bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Dec 3, 2021

bnrjee requested review from g-gaston, jiayiwang7 and jaxesn December 3, 2021 19:35

abhay-krishna reviewed Dec 3, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

abhay-krishna reviewed Dec 3, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

abhay-krishna reviewed Dec 3, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

bnrjee force-pushed the wng-doc branch from 4124ea2 to ba7d2a4 Compare December 3, 2021 20:11

bnrjee requested a review from abhay-krishna December 3, 2021 20:11

bnrjee force-pushed the wng-doc branch 3 times, most recently from 8ba7400 to 4b5c5b7 Compare December 4, 2021 04:18

eks-distro-bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Dec 4, 2021

g-gaston reviewed Dec 6, 2021

View reviewed changes

bnrjee requested a review from g-gaston December 6, 2021 20:08

bnrjee force-pushed the wng-doc branch from 325b3d9 to 476ab18 Compare December 6, 2021 20:16

g-gaston reviewed Dec 7, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Show resolved Hide resolved

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

bnrjee requested a review from g-gaston December 7, 2021 18:52

g-gaston approved these changes Dec 8, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

bnrjee force-pushed the wng-doc branch from c95dc45 to 3b834b0 Compare December 8, 2021 19:58

bnrjee requested a review from g-gaston December 8, 2021 20:00

vivek-koppuru reviewed Dec 9, 2021

View reviewed changes

jiayiwang7 reviewed Dec 9, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Show resolved Hide resolved

vivek-koppuru reviewed Dec 15, 2021

View reviewed changes

Design doc for multiple worker node groups support

ce5e679

bnrjee added 5 commits December 15, 2021 10:27

Design doc for multiple worker node groups support (Review 1)

c3f5448

Design doc for multiple worker node groups support (Review 2)

ce67303

Design doc for multiple worker node groups support (Review 3)

4da6e65

Design doc for multiple worker node groups support (Review 4)

388ee2e

Design doc for multiple worker node groups support (Review 5)

da2b050

bnrjee force-pushed the wng-doc branch from dd0d88f to da2b050 Compare December 15, 2021 18:29

vivek-koppuru approved these changes Dec 15, 2021

View reviewed changes

bnrjee mentioned this pull request Dec 15, 2021

Support multiple worker node groups. #840

Closed

7 tasks

bnrjee added 2 commits December 15, 2021 23:14

Design doc for multiple worker node groups support (Review 6)

c808ba0

Design doc for multiple worker node groups support (Review 7)

fbde817

bnrjee force-pushed the wng-doc branch from f3ee404 to fbde817 Compare December 17, 2021 18:17

g-gaston reviewed Dec 20, 2021

View reviewed changes

designs/multiple-worker-node-groups.md Outdated Show resolved Hide resolved

bnrjee requested review from vivek-koppuru, jiayiwang7 and g-gaston December 22, 2021 19:28

Design doc for multiple worker node groups support (Review 8)

23ac393

bnrjee force-pushed the wng-doc branch from 7095403 to 23ac393 Compare December 23, 2021 17:59

g-gaston approved these changes Dec 23, 2021

View reviewed changes

eks-distro-bot assigned g-gaston Dec 23, 2021

eks-distro-bot added the lgtm label Dec 23, 2021

eks-distro-bot merged commit f292101 into aws:main Dec 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design doc for multiple worker node groups support #757

Design doc for multiple worker node groups support #757

bnrjee commented Dec 3, 2021

g-gaston left a comment

g-gaston Dec 6, 2021

bnrjee Dec 6, 2021

vivek-koppuru Dec 8, 2021

g-gaston Dec 9, 2021

vivek-koppuru Dec 8, 2021

g-gaston Dec 9, 2021

bnrjee Dec 14, 2021

vivek-koppuru Dec 9, 2021

bnrjee Dec 14, 2021

vivek-koppuru Dec 15, 2021

bnrjee Dec 15, 2021 •

edited

Loading

eks-distro-bot commented Dec 23, 2021

g-gaston commented Dec 23, 2021


		For each group, we will append these three fields corresponding to that group in the capi spec.

		Right now, the cli assumes that there will be only one group and it treats worker node group configuration array as a collection of only one element. As a result, the controller just refers to the first element of this array in different places of the code. So we need to do the same operations in loops, which includes capi spec creation, cluster spec validation etc. Once a capi spec is created with this approach, the workload cluster will be created with multiple worker nodes.


		Also, it needs to be made sure that at the least one of the worker node groups does not have `NoExecute` or `NoSchedule` taint. This validation will be done at the preflight validation stage.

		The examples in this design are for vsphere provider. But the same strategy applies for other providers as well.

Design doc for multiple worker node groups support #757

Design doc for multiple worker node groups support #757

Conversation

bnrjee commented Dec 3, 2021

g-gaston left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bnrjee Dec 15, 2021 • edited Loading

Choose a reason for hiding this comment

eks-distro-bot commented Dec 23, 2021

g-gaston commented Dec 23, 2021

bnrjee Dec 15, 2021 •

edited

Loading