Elasticsearch should auto-determine appropriate machine memory settings #65025

mark-vieira · 2020-11-12T23:22:34Z

Background

Currently we expect our on-prem users to appropriately set the size of the heap and the allocation of memory to ML. We also have cloud explicitly set heap based on a number of factors. Both of these cases create a bad experience. Our users should not need to accrue knowledge of the internal memory needs of a node including what features will consume heap vs. native memory. This creates a heavy burden for our users to get started in production and to upgrade as well as being an unrealistic ask of the uses we are wanting to adopt.

Since Elasticsearch itself has knowledge of the typical demands of various node roles on heap vs. native memory it should itself set the heap size and other relevant memory settings dependent on the node role and the memory available.

Additionally to make autoscaling successful we need to be able to have the cluster ask Cloud for nodes with specified total memory. In order to calculate the total memory required we need for ML nodes we need to start from the memory required by the native process to satisfy the existing jobs and work back to the total memory required for the container.

Proposal

Quite simply, the function of how determine appropriate memory settings for heap and ML is a combination of the role(s) of the node and available system/container memory. The latter information we have (or can easily get) already, but our JVM ergonomics code currently runs before settings parsing, thus we do not know what roles are applied to the given node. Existing settings parsing logic is complex and rather heavy weight as it requires loading all installed plugins (since they might register their own settings). This is overkill for our purposes so the determined way forward should be to simply implement the minimum required logic to parse elasticsearch.yml and pull out the node roles.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-11-12T23:22:35Z

Pinging @elastic/es-core-infra (:Core/Infra/Core)

elasticmachine · 2020-11-12T23:22:42Z

Pinging @elastic/es-delivery (:Delivery/Cloud)

rjernst · 2020-12-17T00:39:06Z

closed by #65905

mark-vieira added the :Core/Infra/Core Core issues without another label label Nov 12, 2020

elasticmachine added the Team:Core/Infra Meta label for core/infra team label Nov 12, 2020

mark-vieira added the :Delivery/Cloud Cloud-specific packaging and deployment label Nov 12, 2020

elasticmachine added the Team:Delivery Meta label for Delivery team label Nov 12, 2020

mark-vieira self-assigned this Nov 12, 2020

barkbay mentioned this issue Dec 2, 2020

[Meta] Autoscaling elastic/cloud-on-k8s#3999

Closed

13 tasks

rjernst added the needs:triage Requires assignment of a team area label label Dec 3, 2020

rjernst closed this as completed Dec 17, 2020

rjernst removed the needs:triage Requires assignment of a team area label label Dec 17, 2020

mark-vieira mentioned this issue Dec 17, 2020

Update heap setting documentation in light of machine dependent heap #66567

Merged

barkbay mentioned this issue Jan 6, 2021

Elasticsearch >= 7.11 sets appropriate memory settings automatically elastic/cloud-on-k8s#4089

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elasticsearch should auto-determine appropriate machine memory settings #65025

Elasticsearch should auto-determine appropriate machine memory settings #65025

mark-vieira commented Nov 12, 2020

elasticmachine commented Nov 12, 2020

elasticmachine commented Nov 12, 2020

rjernst commented Dec 17, 2020

Elasticsearch should auto-determine appropriate machine memory settings #65025

Elasticsearch should auto-determine appropriate machine memory settings #65025

Comments

mark-vieira commented Nov 12, 2020

Background

Proposal

elasticmachine commented Nov 12, 2020

elasticmachine commented Nov 12, 2020

rjernst commented Dec 17, 2020