aws-cni on vanilla K8s #508

tvalasek · 2019-06-18T21:41:26Z

We build K8s clusters in AWS using CF/kubeadm from upstream vanilla K8s. For networking we use aws-cni plugin. Our out-of-the-box setup is 3x masters (etcd running inside) and 3x worker nodes.

AWS-cni plugin runs as deamonset, thus on all 6 members of a cluster (masters + workers).

Now, the behaviour I'm seeing is that aws-cni plugin does not differenciate masters from workers.

The result of that is (looking at the cni-metrics-helper stats) aws-cni on demand creates new ENIs and secondary IP address pool (warm pool) also on master nodes which by default have pod scheduling disabled (for obvious reasons). That leaves us with huge amount of warm pool unused IPs on master nodes that can't ever be allocated.

I believe aws-cni was primarily build for EKS (where control plane / master and etcd) are hidden from the EKS admin, but I wonder if aws-cni has feature to distinguish masters from workers (and thus different warm pool scheduling) for those of us who decided not to use EKS.

E.g. labeling master nodes and annotate aws-cni daemon set to act differently (like do not create new ENIs) on labeled nodes

Thanks

mogren · 2019-06-18T23:03:34Z

Hi @tvalasek,

You are right that we have not yet optimized the plugin much for use outside of EKS, so there is more work to be done here. I think it sounds like a good idea to make the CNI more configurable in order work better on the masters. Do you have any more concrete suggestions for what configurations you would need?

tvalasek · 2019-06-19T17:08:17Z

How I see it is something to what already has been done: #68

Ain't sure if is it part of aforementioned PR but for our use case I would like to have something like max number of ENIs for given node that can be created config option. (that way we could control how many warm pool IPs can be created for that specific node)

Secondly these config options would work globally on all members of a cluster (like it's now) or only work on nodes with either specific labels on K8s level (e.g. node-role.kubernetes.io) or specific tags on AWS EC2 level. I reckon the later sounds more like generic AWS approach, kinda like it.

Does it make sense?

mogren · 2019-06-19T22:22:21Z

We do have the MAX_ENI setting already, but that would get applied to both master and worker nodes.

In this case, I guess it would be better to have a way to tag master nodes, or that the CNI is aware of common taints like node-role.kubernetes.io/master and behave differently in that case.

danbeaulieu · 2019-06-20T13:47:04Z

@mogren is creating separate daemonsets an option? One for control plane nodes with the right tolerations and selectors and one for non control plane nodes?

jaypipes · 2019-10-30T18:12:22Z

@tvalasek Hi Tomas, we're actually wondering what the specific feature request is for this. We're hoping you can elaborate. Are you asking for the CNI plugin to behave in a different way if it knows it's running on a master node (via inspection of, say, node-role annotation)? Or are you asking for a way to prevent the CNI plugin (via a daemonset taint/toleration) from running on master nodes?

tvalasek · 2019-10-30T18:23:17Z

@jaypipes Hi Jay.

Are you asking for the CNI plugin to behave in a different way if it knows it's running on a master node (via inspection of, say, node-role annotation)?

yes, thats the correct one

P.S.: If we would prevent it from running on master nodes we would not be able to schedule any pods on masters because aws-cni is responsible for assigning IP addresses to pods

jaypipes · 2020-01-14T15:56:07Z

@jaypipes Hi Jay.

Are you asking for the CNI plugin to behave in a different way if it knows it's running on a master node (via inspection of, say, node-role annotation)?

yes, thats the correct one

P.S.: If we would prevent it from running on master nodes we would not be able to schedule any pods on masters because aws-cni is responsible for assigning IP addresses to pods

Apologies for the long delay in getting back to you @tvalasek! This unfortunately dropped out of my email radar :(

The solution you are looking for is to modify the YAML manifest for the aws-k8s-cni Daemonset you are using to deploy the CNI plugin to include a nodeAffinity specification that prevents the Daemonset from being scheduled to specific nodes:

  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: kubernetes.io/role
            operator: NotIn
            values:
            - master

Depending on how you are installing Kubernetes, the "key" above may be different (it's the label that is applied to a node). The example above is the label that kops applies, for example.

* Cherry-pick the PR #458 from eks charts * bumping chart version to sync with #508 from eks-charts

mogren added feature request help wanted labels Jun 18, 2019

mogren mentioned this issue Jul 2, 2019

NAT issue with pod running in public-subnet with hostnetwork true #519

Closed

jaypipes added priority/P2 Low priority, nice to have. needs investigation and removed priority/P2 Low priority, nice to have. labels Oct 30, 2019

mogren mentioned this issue Dec 30, 2019

Update aws-k8s-cni.yaml #776

Closed

jaypipes closed this as completed Jan 14, 2020

haouc added a commit to haouc/amazon-vpc-cni-k8s that referenced this issue Apr 23, 2021

bumping chart version to sync with aws#508 from eks-charts

23f3416

jayanthvn pushed a commit that referenced this issue Apr 23, 2021

Cherry-pick the PR #458 from eks charts (#1440)

9ad4860

* Cherry-pick the PR #458 from eks charts * bumping chart version to sync with #508 from eks-charts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws-cni on vanilla K8s #508

aws-cni on vanilla K8s #508

tvalasek commented Jun 18, 2019 •

edited

Loading

mogren commented Jun 18, 2019

tvalasek commented Jun 19, 2019

mogren commented Jun 19, 2019

danbeaulieu commented Jun 20, 2019

jaypipes commented Oct 30, 2019

tvalasek commented Oct 30, 2019

jaypipes commented Jan 14, 2020

aws-cni on vanilla K8s #508

aws-cni on vanilla K8s #508

Comments

tvalasek commented Jun 18, 2019 • edited Loading

mogren commented Jun 18, 2019

tvalasek commented Jun 19, 2019

mogren commented Jun 19, 2019

danbeaulieu commented Jun 20, 2019

jaypipes commented Oct 30, 2019

tvalasek commented Oct 30, 2019

jaypipes commented Jan 14, 2020

tvalasek commented Jun 18, 2019 •

edited

Loading