kubelet: change image-gc-high-threshold below docker dm.min_free_space #40432

sjenning · 2017-01-25T17:40:09Z

docker dm.min_free_space defaults to 10%, which "specifies the min free space percent in a thin pool require for new device creation to succeed....Whenever a new a thin pool device is created (during docker pull or during container creation), the Engine checks if the minimum free space is available. If sufficient space is unavailable, then device creation fails and any relevant docker operation fails." [1]

This setting is preventing the storage usage to cross the 90% limit. However, image GC is expected to kick in only beyond image-gc-high-threshold. The image-gc-high-threshold has a default value of 90%, and hence GC never triggers. If image-gc-high-threshold is set to a value lower than (100 - dm.min_free_space)%, GC triggers.

xref https://bugzilla.redhat.com/show_bug.cgi?id=1408309

changed kubelet default image-gc-high-threshold to 85% to resolve a conflict with default settings in docker that prevented image garbage collection from resolving low disk space situations when using devicemapper storage.

@derekwaynecarr @sdodson @rhvgoyal

k8s-reviewable · 2017-01-25T17:40:19Z

This change is

dchen1107 · 2017-01-25T22:28:13Z

The change makes sense to me, but @dashpole, could you please take another look at this?

dashpole · 2017-01-25T22:53:39Z

@sjenning do you think this is the cause of this issue?. I noticed many people were using devicemapper, and some were wondering why image garbage collection hadn't kicked in.

The change makes sense to me as well.

Would it also make sense to decrease the ImageGCLowThresholdPercent?

I am not actually sure what kubemark is, but I found a default for ImageGCHighThresholdPercent here as well. Should we change that value to match?

dchen1107 · 2017-01-25T23:16:44Z

@dashpole Yes, that is what I suspected and looped you here since you looked at #32542 lately.

sjenning · 2017-01-26T15:25:53Z

@dashpole could be. The issues seems to have a number of different cases in it. The case this PR handles is when Docker refuses to pull/start a container due to low dm thin pool space, yet the node doesn't report disk pressure and doesn't do image GC, effectively wedging the node.

dchen1107 · 2017-02-01T20:58:11Z

/approve

I approve this change, but expect @dashpole review the code more. Thanks!

derekwaynecarr · 2017-02-01T20:59:01Z

This is /lgtm

dashpole · 2017-02-01T21:02:44Z

@dchen1107 this lgtm as well

derekwaynecarr · 2017-02-01T21:03:50Z

if and when we look to define a default for imagefs based eviction thresholds, we need to keep this in mind as well.

dashpole · 2017-02-01T21:06:17Z

@derekwaynecarr, why is that? The image-gc-high-threshold modified in this PR only affects periodic garbage collection. Eviction passes this check and directly triggers deletion of all unused images IIRC.

vishh · 2017-02-01T22:28:49Z

I wonder if it is not possible to change the docker setting to 8% for example? Given that disk reclamation is slow, I suspect that even with this PR, we will see image pull failures.

vishh · 2017-02-01T22:29:19Z

I don't see why we have to change kube defaults based on the behavior of a single docker storage driver.

vishh · 2017-02-01T22:31:17Z

/approve cancel

k8s-github-robot · 2017-02-01T22:31:23Z

[APPROVALNOTIFIER] This PR is APPROVED

The following people have approved this PR: dchen1107, sjenning

Needs approval from an approver in each of these OWNERS Files:

~~cmd/kubelet/app/OWNERS~~ [dchen1107]
~~pkg/apis/componentconfig/OWNERS~~ [dchen1107]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

derekwaynecarr · 2017-02-01T22:52:21Z

@vishh -- should not the default value kube selects work reasonably well across the popular set of container runtime storage drivers out of the box? if operators wanted to be more aggressive for a particular storage driver, then they can modify the default higher? it seems anti-user to have a default that doesnt work well on popular options by default?

derekwaynecarr · 2017-02-01T22:58:51Z

to phrase it another way, we should select a default for OSS version of k8s that works for the broadest number of users, and causes the least amount of noise in the form of related issue creation when things just do not work.

vishh · 2017-02-01T23:01:10Z

I'd like to understand the rationale behind docker's 10% limit before having kube *just* embrace that default.

…

On Wed, Feb 1, 2017 at 2:59 PM, Derek Carr ***@***.***> wrote: to phrase it another way, we should select a default for OSS version of k8s that works for the broadest number of users, and causes the least amount of noise in the form of related issue creation when things just do not work. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#40432 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKAhpT5t5gqy3NU5cM2mjHTa_IxEkks5rYQ5GgaJpZM4LtyEm> .

derekwaynecarr · 2017-02-02T00:17:02Z

@vishh -- prior to sending this PR, @sjenning and I spoke with @mrunalp and @rhvgoyal to understand why the 10% value for dm.min_free_space value was chosen, and it was chosen because they felt it was a sensible default, and reducing it below 10% felt too low in their opinion. They can weigh in w/ more details if they feel I am misrepresenting our discussion, but it seemed fair.

My prior point remains which is that I think OSS k8s defaults should work with the broadest set of container run-time storage driver options out of the box just to reduce the amount of issues/noise/support in the broader community for when something does not work. Are we more likely to get issues that we are not utlizing an extra 5% of disk, or that something just doesn't work on Centos/Fedora/etc when using POSIX compliant storage drivers? We are not going to get the existing versions of docker out in the wild to all change.

vishh · 2017-02-02T17:32:04Z

@derekwaynecarr k8s has a lot of workarounds to integrate with docker. I would like to understand if this is one such workaround, in which case it would be better to have a plan to try and fix docker in the future, or if it is a necessary permanent change. Just curious, can docker's default be 5% for example, in which case k8s would prevent the node from ever reach that low threshold? I agree that we want kube to work by default across most distros.

…

On Wed, Feb 1, 2017 at 4:17 PM, Derek Carr ***@***.***> wrote: @vishh <https://github.com/vishh> -- prior to sending this PR, @sjenning <https://github.com/sjenning> and I spoke with @mrunalp <https://github.com/mrunalp> and @rhvgoyal <https://github.com/rhvgoyal> to understand why the 10% value for dm.min_free_space value was chosen, and it was chosen because they felt it was a sensible default, and reducing it below 10% felt too low in their opinion. They can weigh in w/ more details if they feel I am misrepresenting our discussion, but it seemed fair. My prior point remains which is that I think OSS k8s defaults should work with the broadest set of container run-time storage driver options out of the box just to reduce the amount of issues/noise/support in the broader community for when something does not work. Are we more likely to get issues that we are not utlizing an extra 5% of disk, or that something just doesn't work on Centos/Fedora/etc when using POSIX compliant storage drivers? We are not going to get the existing versions of docker out in the wild to all change. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#40432 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKBd2P3FQVn0cDrrov8XMtC3e_dPOks5rYSCfgaJpZM4LtyEm> .

vishh · 2017-02-18T03:30:20Z

Ping. This PR is still not merged.

derekwaynecarr · 2017-02-24T03:45:04Z

@vishh - for the moment, we are carrying a reduced default in openshift. for k8s, i am still not sure how to proceed. as a default, i think 85% in OSS is not the worst. ideally, we would have a way to ask a container runtime if it was doing any behind the scenes image management configuration behavior like this and dynamically reduce or warn in the kubelet our settings in response if the one-size fits all default is not portable.

vishh · 2017-02-24T03:55:46Z

Instead of attempting to understand each storage driver, can we instead treat the capacity of a devicemapper partition to be 90% of its actual capacity?

…

On Thu, Feb 23, 2017 at 7:45 PM, Derek Carr ***@***.***> wrote: @vishh <https://github.com/vishh> - for the moment, we are carrying a reduced default in openshift. for k8s, i am still not sure how to proceed. as a default, i think 85% in OSS is not the worst. ideally, we would have a way to ask a container runtime if it was doing any behind the scenes image management configuration behavior like this and dynamically reduce or warn in the kubelet our settings in response if the one-size fits all default is not portable. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#40432 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKJvCjAOR19FDSdfQ6YPwgFnawuoQks5rflJbgaJpZM4LtyEm> .

sjenning · 2017-04-03T15:34:40Z

@k8s-bot cri e2e test this
@k8s-bot kops aws e2e test this

k8s-ci-robot · 2017-04-03T17:01:38Z

@sjenning: The following test(s) failed:

Test name	Commit	Details	Rerun command
Jenkins CRI GCE e2e	`0247a9a`	link	`@k8s-bot cri e2e test this`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

k8s-github-robot · 2017-04-03T17:51:32Z

Automatic merge from submit-queue

kubelet: change image-gc-threshold below docker dm.min_free_space

0247a9a

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jan 25, 2017

k8s-github-robot assigned dchen1107 Jan 25, 2017

k8s-github-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. release-note Denotes a PR that will be considered when it comes time to generate release notes. labels Jan 25, 2017

dchen1107 requested a review from dashpole January 25, 2017 22:27

dashpole mentioned this pull request Jan 27, 2017

Running out of disk space but no warnings from kubectl describe nodes #32542

Closed

k8s-github-robot assigned thockin and Random-Liu and unassigned dchen1107 and thockin Jan 30, 2017

apelisse assigned dchen1107 and thockin Jan 31, 2017

k8s-github-robot assigned erictune Jan 31, 2017

dchen1107 assigned vishh and unassigned thockin, Random-Liu and erictune Feb 1, 2017

derekwaynecarr self-assigned this Feb 1, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 1, 2017

derekwaynecarr added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 1, 2017

dashpole approved these changes Feb 1, 2017

View reviewed changes

sjenning mentioned this pull request Feb 1, 2017

UPSTREAM: <carry>: kubelet: change image-gc-high-threshold below docker dm.min_free_space openshift/origin#12762

Merged

vishh removed the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 1, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 1, 2017

vishh added the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Feb 1, 2017

vishh removed the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Feb 27, 2017

k8s-github-robot merged commit 6f3e5ba into kubernetes:master Apr 3, 2017

sjenning deleted the imagegc-default branch August 16, 2017 02:18

SaaldjorMike mentioned this pull request Sep 6, 2017

Install image-gc configurable will sane defaults Azure/acs-engine#1410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubelet: change image-gc-high-threshold below docker dm.min_free_space #40432

kubelet: change image-gc-high-threshold below docker dm.min_free_space #40432

sjenning commented Jan 25, 2017

k8s-reviewable commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

dashpole commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

sjenning commented Jan 26, 2017

dchen1107 commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

dashpole commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

dashpole commented Feb 1, 2017

vishh commented Feb 1, 2017

vishh commented Feb 1, 2017

vishh commented Feb 1, 2017

k8s-github-robot commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

vishh commented Feb 1, 2017 via email

derekwaynecarr commented Feb 2, 2017

vishh commented Feb 2, 2017 via email

vishh commented Feb 18, 2017

derekwaynecarr commented Feb 24, 2017

vishh commented Feb 24, 2017 via email

sjenning commented Apr 3, 2017

k8s-ci-robot commented Apr 3, 2017 •

edited

Loading

k8s-github-robot commented Apr 3, 2017

kubelet: change image-gc-high-threshold below docker dm.min_free_space #40432

kubelet: change image-gc-high-threshold below docker dm.min_free_space #40432

Conversation

sjenning commented Jan 25, 2017

k8s-reviewable commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

dashpole commented Jan 25, 2017

dchen1107 commented Jan 25, 2017

sjenning commented Jan 26, 2017

dchen1107 commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

dashpole commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

dashpole commented Feb 1, 2017

vishh commented Feb 1, 2017

vishh commented Feb 1, 2017

vishh commented Feb 1, 2017

k8s-github-robot commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

derekwaynecarr commented Feb 1, 2017

vishh commented Feb 1, 2017 via email

derekwaynecarr commented Feb 2, 2017

vishh commented Feb 2, 2017 via email

vishh commented Feb 18, 2017

derekwaynecarr commented Feb 24, 2017

vishh commented Feb 24, 2017 via email

sjenning commented Apr 3, 2017

k8s-ci-robot commented Apr 3, 2017 • edited Loading

k8s-github-robot commented Apr 3, 2017

k8s-ci-robot commented Apr 3, 2017 •

edited

Loading