Maintain the right number of ENIs and its IP addresses in WARM-IP pool #169

liwenwu-amazon · 2018-09-05T21:57:25Z

Description of changes:
Today, curMaxAddrsPerENI is set to 1 initially and it is only set to the correct value after allocating the 1st ENI. This can cause nodeIPPoolLow() and nodeIPPoolTooHIgh() returns incorrect value.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

nckturner · 2018-09-06T02:19:28Z

ipamd/ipamd.go

+	c.currentMaxAddrsPerENI, err = c.awsClient.GetENIipLimit()
+
+	if err != nil {
+		c.currentMaxAddrsPerENI = int64(len(ec2Addrs))


Why is it better to use the result from c.awsClient.GetENIipLimit() then int64(len(ec2Addrs))? Can you add a comment why we only do one when the other fails? Does anything not work in that case?

nckturner

LGTM!

edmorley · 2019-07-01T14:47:56Z

Hi!

The description of this PR at first glance makes it seem like a small bug fix, however it in fact appears to be the cause of CNI plugin v1.2.0+ having double the IP address usage requirements of the prior v1.1.0 (under our workflow at least), which caused us to run out of free IP addresses until we adjusted our subnets accordingly (which in itself required an EKS cluster rebuild/migration).

Specifically after this change, our worker nodes went from being allocated one ENI (which was only partly used, since we use smaller instances with fewer pods per instance) to always having two ENIs allocated (with the second being entirely unused), with both ENIs having the full 10 IP addresses allocated, doubling usage. (Whilst we plan on trying out WARM_IP_TARGET in the future, it sounds like it will need some testing/tuning, so we had stuck with the out of the box WARM_ENI_TARGET for now.)

As such, please could significant changes like these (that have the chance of being breaking) be called out more clearly in the PR description and changelog in the future? :-)

mogren · 2019-07-01T20:36:55Z

@edmorley Thanks, good call-out. And like you said, WARM_IP_TARGET is another quite significant change, and it will only work well if the pod-churn is not too high. If pods are scheduled and terminated at a high and variable rate, the calls to EC2 will get throttled, and attaching new IPs and ENIs will take a long time.

liwenwu-amazon requested a review from nckturner September 5, 2018 21:57

liwenwu-amazon changed the title ~~Make sure maxAddrsPerENI is set correctly.~~ Maintain the right number of ENIs and its IP addresses in WARM-IP pool Sep 5, 2018

liwenwu-amazon force-pushed the cronjob branch from 657f953 to a766201 Compare September 5, 2018 22:59

Deepak-Vohra mentioned this pull request Sep 6, 2018

Inconsistent ENI count attached to instances running 1.1.0 #154

Closed

nckturner reviewed Sep 6, 2018

View reviewed changes

liwenwu-amazon force-pushed the cronjob branch from a766201 to 8071f72 Compare September 6, 2018 03:08

Make sure maxAddrsPerENI is set correctly.

860017a

liwenwu-amazon force-pushed the cronjob branch from 8071f72 to 860017a Compare September 6, 2018 16:28

nckturner approved these changes Sep 6, 2018

View reviewed changes

liwenwu-amazon merged commit 7332580 into aws:master Sep 6, 2018

liwenwu-amazon mentioned this pull request Sep 6, 2018

Minutely cronjob fails after swapping out workers with new ASG #155

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintain the right number of ENIs and its IP addresses in WARM-IP pool #169

Maintain the right number of ENIs and its IP addresses in WARM-IP pool #169

liwenwu-amazon commented Sep 5, 2018 •

edited

Loading

nckturner Sep 6, 2018

nckturner left a comment

edmorley commented Jul 1, 2019

mogren commented Jul 1, 2019

Maintain the right number of ENIs and its IP addresses in WARM-IP pool #169

Maintain the right number of ENIs and its IP addresses in WARM-IP pool #169

Conversation

liwenwu-amazon commented Sep 5, 2018 • edited Loading

nckturner Sep 6, 2018

Choose a reason for hiding this comment

nckturner left a comment

Choose a reason for hiding this comment

edmorley commented Jul 1, 2019

mogren commented Jul 1, 2019

liwenwu-amazon commented Sep 5, 2018 •

edited

Loading