Storage related auto-scaling issues #4459

barkbay · 2021-04-30T08:24:44Z

This issue is to list a few considerations that should be taken into account to improve the autoscaling controller when handling storage resources:

ECK doesn't know, or can't predict, the available capacity of a volume because of the reserved space for some filesystems (mostly the case for ext4, 5% by defaults, only a few MB for xfs).
For data tiers (except frozen ?), the total storage capacity required by the autoscaling API is always at least the total observed storage capacity from all the Pods in the tier, required_capacity.total = Σ(current_capacity.node.storage) + unassigned_data.
K8S may bind a volume with a larger capacity than the one claimed (Volume Capacity > Volume Claim).

If not handled properly these considerations may lead to 3 issues:

Because of the fs reserved capacity, the capacity available to Elasticsearch might be smaller than the one in the K8S claim: it may delay a scale up event. We should compare the required capacity to the "observed" capacity as reported by the autoscaling API to understand when a scale up must be triggered.
If the actual capacity is higher than the claimed one then Elasticsearch reports that value as a required one (even if it's technically not required) which can lead to some cascading scale up events, up to the limit specified by the user. It can also exceed to limit specified by the user in which case some not pertinent HorizontalScalingLimitReached events are generated.
If the actual capacity of a volume is greater than the claim, then the nodes may hold more data than the maximum one specified in the autoscaling specification. It may lead to overloaded nodes. For example, assuming the following autoscaling policy:

{
    "name": "data",
    "roles": ["data", "ingest", "transform"],
    "resources": {
        "nodeCount": { "min": 2, "max": 5 },
        "memory": { "min": "2Gi", "max": "6Gi" },
        "storage": { "min": "1Gi",  "max": "3Gi" }
    }
}

Say that the claims of 1Gi have been bound to volumes of 1Ti of data each, then chances are that the 2Gi of memory are not enough to handle that amount of data. We should maybe notify the user that the total storage capacity is "unexpected" and maybe immediately scale up the memory to 6Gi ?

The text was updated successfully, but these errors were encountered:

barkbay · 2021-05-03T10:41:16Z

If the actual capacity of a volume is greater than the claim, then the nodes may hold more data than the maximum one specified in the autoscaling specification. It may lead to overloaded nodes.

I'm working on a PR to at least warn the user about that situation. But I'm wondering if we should not reconsider the choice we made regarding scaling memory according to storage.

pebrc · 2021-06-02T08:12:50Z

Good to close @barkbay ?

barkbay · 2021-06-02T08:19:47Z

Yes, sorry, missed this one.

barkbay added >bug Something isn't working autoscaling labels Apr 30, 2021

barkbay mentioned this issue May 3, 2021

ML autoscaling decider always reports a storage capacity of 0 elastic/elasticsearch#72452

Open

barkbay mentioned this issue May 3, 2021

Autoscaling: handling unforeseeable volume capacities #4469

Closed

barkbay mentioned this issue May 18, 2021

[Autoscaling] Introduce resource recommenders #4493

Merged

pebrc assigned barkbay Jun 2, 2021

barkbay closed this as completed Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Storage related auto-scaling issues #4459

Storage related auto-scaling issues #4459

barkbay commented Apr 30, 2021

barkbay commented May 3, 2021

pebrc commented Jun 2, 2021

barkbay commented Jun 2, 2021

Storage related auto-scaling issues #4459

Storage related auto-scaling issues #4459

Comments

barkbay commented Apr 30, 2021

barkbay commented May 3, 2021

pebrc commented Jun 2, 2021

barkbay commented Jun 2, 2021