Add scale-in cooldown support #6

etaoins · 2019-03-03T00:05:02Z

This reimplements ASG scale-in cooldown inside the lambda. It takes two parameters which correspond to the existing ASG parameters in the elastic CI stack.

SCALE_IN_COOLDOWN_PERIOD is the cooldown time between scale in events. This defaults to the existing 5 minutes.
SCALE_IN_ADJUSTMENT is the maximum adjustment during scale-in events. Unlike the ASG we may scale in less if we calculate that the desired is closer than the adjustment. This defaults to the existing -1.

This cheats a bit by storing lastScaleInTime in a global variable. This means we'll forget about our cooldown during a cold start. This should happen fairly infrequently and just make us a bit aggressive about scaling it; it shouldn't affect correctness.

This reimplements ASG scale-in cooldown inside the lambda. It takes two parameters which correspond to the existing ASG parameters in the elastic CI stack. 1. `SCALE_IN_COOLDOWN_PERIOD` is the cooldown time between scale in events. This defaults to the existing 5 minutes. 2. `SCALE_IN_ADJUSTMENT` is the maximum adjustment during scale-in events. Unlike the ASG we may scale in less if we calculate that the desired is closer than the adjustment. This defaults to the existing -1. This cheats a bit by storing `lastScaleInTime` in a global variable. This means we'll forget about our cooldown during a cold start. This should happen fairly infrequently and just make us a bit aggressive about scaling it; it shouldn't affect correctness.

lox · 2019-03-25T01:49:09Z

This is awesome @etaoins! Somehow I totally missed this PR.

lox · 2019-03-25T01:49:32Z

I wonder how often there will be a cold start on a timed lambda 🤔

etaoins · 2019-03-25T01:57:27Z

I had a timed Lambda running for a few weeks that would cold start about once a day at an unpredictable time. It looks like even if the Lambda is warm its instance will sometimes disappear anyway. That's only a single datapoint that's about a year out of date; I wouldn't be surprised if there are other situations or configurations that could trigger it more often.

I don't think that's too bad here as it will only make a difference if a scale-in is occurring and we will still be limited by the maximum adjustment. My gut instinct would be to try this with local state and if there are some pathological cases it could then be persisted somewhere. I'm just worried about the complexity/reliability of setting up something like a DynamoDB to track this.

etaoins mentioned this pull request Mar 25, 2019

Scaler seems to bypass Lifecycle Hooks #8

Closed

lox merged commit e3e571a into buildkite:master Mar 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scale-in cooldown support #6

Add scale-in cooldown support #6

etaoins commented Mar 3, 2019

lox commented Mar 25, 2019

lox commented Mar 25, 2019

etaoins commented Mar 25, 2019

Add scale-in cooldown support #6

Add scale-in cooldown support #6

Conversation

etaoins commented Mar 3, 2019

lox commented Mar 25, 2019

lox commented Mar 25, 2019

etaoins commented Mar 25, 2019