kubelet evicts resources when observing disk pressure #39

derekwaynecarr · 2016-07-21T14:30:56Z

Description

As a cluster operator, I want the kubelet to monitor local disk usage and respond accordingly to maintain node stability. If the kubelet observes available disk and/or inodes (rootfs or imagefs) are under pressure, the kubelet should pro-actively reclaim related resource to maintain node stability by deleting images, logs, and evicting pods.

Progress Tracker

FEATURE_STATUS is used for feature tracking and to be updated by @kubernetes/feature-reviewers.
FEATURE_STATUS: IN_DEVELOPMENT

More advice:

Design

Once you get LGTM from a @kubernetes/feature-reviewers member, you can check this checkbox, and the reviewer will apply the "design-complete" label.

Coding

Use as many PRs as you need. Write tests in the same or different PRs, as is convenient for you.
As each PR is merged, add a comment to this issue referencing the PRs. Code goes in the http://github.com/kubernetes/kubernetes repository,
and sometimes http://github.com/kubernetes/contrib, or other repos.
When you are done with the code, apply the "code-complete" label.
When the feature has user docs, please add a comment mentioning @kubernetes/feature-reviewers and they will
check that the code matches the proposed feature and design, and that everything is done, and that there is adequate
testing. They won't do detailed code review: that already happened when your PRs were reviewed.
When that is done, you can check this box and the reviewer will apply the "code-complete" label.

Docs

Write user docs and get them merged in.
User docs go into http://github.com/kubernetes/kubernetes.github.io.
When the feature has user docs, please add a comment mentioning @kubernetes/docs.
When you get LGTM, you can check this checkbox, and the reviewer will apply the "docs-complete" label.

The text was updated successfully, but these errors were encountered:

derekwaynecarr · 2016-07-21T14:33:15Z

/cc @kubernetes/sig-node

vishh · 2016-07-23T00:07:19Z

cc @ronnielai

timothysc · 2016-07-25T20:12:05Z

Given todays conversation, perhaps we could have a default policy of: "should pro-actively reclaim related resource to maintain node stability by deleting images, logs, and evicting pods." with the potential of firing an administrator controlled script which could also apply to other resource dimensions.

/cc @nqn

derekwaynecarr · 2016-07-26T00:44:21Z

I think that is feature creep. The goal is disk. Other resource
dimensions can be monitored outside of the Kubelet. We can discuss in
sig-node.

On Monday, July 25, 2016, Timothy St. Clair notifications@github.com
wrote:

Given todays conversation, perhaps we could have a default policy of:
"should pro-actively reclaim related resource to maintain node stability by
deleting images, logs, and evicting pods." with the potential of firing an
administrator controlled script which could also apply to other resource
dimensions.

/cc @nqn https://github.com/nqn

—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub
#39 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AF8dbKgYiMydscDH1bMwo52nrkK3105qks5qZRiWgaJpZM4JR21_
.

derekwaynecarr · 2016-08-18T17:16:54Z

All PRs planned for 1.4 have merged in time for feature freeze, I will update the check-list next week.

janetkuo · 2016-09-02T18:08:56Z

@derekwaynecarr Are the docs ready? Please update the docs in https://github.com/kubernetes/kubernetes.github.io, and then add PR numbers and check the docs box in the issue description

derekwaynecarr · 2016-09-03T00:59:30Z

Docs are coming next week unless @ronnielai has anything yet? I think we
can update the existing eviction doc pretty quickly from the design doc.
They are pretty close for a reason :-)

On Friday, September 2, 2016, Janet Kuo notifications@github.com wrote:

@derekwaynecarr https://github.com/derekwaynecarr Are the docs ready?
Please update the docs in https://github.com/kubernetes/
kubernetes.github.io, and then add PR numbers and check the docs box in
the issue description

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#39 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AF8dbLRdJAt5Uq_Wqlj9DscMukPPJCWeks5qmGY6gaJpZM4JR21_
.

jaredbhatti · 2016-09-07T21:02:33Z

@derekwaynecarr Can you add your docs PR here when you have it ready?

vishh · 2016-09-07T21:07:35Z

@derekwaynecarr My assumption is that this feature is alpha or beta in
v1.4. I hope the docs will reflect that!

On Wed, Sep 7, 2016 at 2:02 PM, Jared notifications@github.com wrote:

@derekwaynecarr https://github.com/derekwaynecarr Can you add your docs
PR here when you have it ready?

—
You are receiving this because you are on a team that was mentioned.
Reply to this email directly, view it on GitHub
#39 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AGvIKNRzEQaiP_bJjF_BFGfRRDM_3UIrks5qnyZsgaJpZM4JR21_
.

derekwaynecarr · 2016-09-09T16:48:02Z

@vishh @jaredbhatti docs PR: kubernetes/website#1196

derekwaynecarr · 2016-09-09T16:48:39Z

@kubernetes/docs -- added feature doc pr kubernetes/website#1196

jaredbhatti · 2016-09-14T19:41:56Z

@derekwaynecarr Is this feature Stable or Beta?

idvoretskyi · 2016-09-21T21:57:49Z

@derekwaynecarr can you provide us with the actual feature status?
Thanks.

derekwaynecarr · 2016-09-23T04:21:25Z

It's not alpha, I would say beta (when dealing with inodes) but stable for
disk capacity.

On Wednesday, September 21, 2016, Ihor Dvoretskyi notifications@github.com
wrote:

@derekwaynecarr https://github.com/derekwaynecarr can you provide us
with the actual feature status?
Thanks.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#39 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AF8dbPWzNauUTXcsgIeSn96pW8bv_C3Mks5qsahfgaJpZM4JR21_
.

fejta-bot · 2018-01-02T15:01:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-02-07T17:33:41Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

ezware · 2018-02-12T11:55:19Z

how to disable disk pressure observe?

fejta-bot · 2018-03-14T12:09:56Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

make accept generic .status.relatedResources

derekwaynecarr changed the title ~~kubelet evicts resources when observing disk pressure to maintain node stability~~ kubelet evicts resources when observing disk pressure Jul 21, 2016

idvoretskyi assigned derekwaynecarr Jul 21, 2016

idvoretskyi added this to the v1.4 milestone Jul 21, 2016

vishh mentioned this issue Jul 23, 2016

Handle out-of-disk and refactor intelligent disk management #49

Closed

22 tasks

idvoretskyi added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Aug 4, 2016

goltermann added beta-in-1.4 and removed beta-in-1.4 labels Aug 22, 2016

idvoretskyi removed the (deprecated label - do not use) stable-in-1.4 label Aug 8, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 2, 2018

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 7, 2018

k8s-ci-robot closed this as completed Mar 14, 2018

ingvagabund pushed a commit to ingvagabund/enhancements that referenced this issue Apr 2, 2020

Merge pull request kubernetes#39 from deads2k/generic-related-resource

f7fd975

make accept generic .status.relatedResources

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubelet evicts resources when observing disk pressure #39

kubelet evicts resources when observing disk pressure #39

derekwaynecarr commented Jul 21, 2016 •

edited

Loading

derekwaynecarr commented Jul 21, 2016

vishh commented Jul 23, 2016

timothysc commented Jul 25, 2016

derekwaynecarr commented Jul 26, 2016

derekwaynecarr commented Aug 18, 2016

janetkuo commented Sep 2, 2016

derekwaynecarr commented Sep 3, 2016

jaredbhatti commented Sep 7, 2016

vishh commented Sep 7, 2016

derekwaynecarr commented Sep 9, 2016

derekwaynecarr commented Sep 9, 2016

jaredbhatti commented Sep 14, 2016

idvoretskyi commented Sep 21, 2016

derekwaynecarr commented Sep 23, 2016

fejta-bot commented Jan 2, 2018

fejta-bot commented Feb 7, 2018

ezware commented Feb 12, 2018

fejta-bot commented Mar 14, 2018

kubelet evicts resources when observing disk pressure #39

kubelet evicts resources when observing disk pressure #39

Comments

derekwaynecarr commented Jul 21, 2016 • edited Loading

Description

Progress Tracker

derekwaynecarr commented Jul 21, 2016

vishh commented Jul 23, 2016

timothysc commented Jul 25, 2016

derekwaynecarr commented Jul 26, 2016

derekwaynecarr commented Aug 18, 2016

janetkuo commented Sep 2, 2016

derekwaynecarr commented Sep 3, 2016

jaredbhatti commented Sep 7, 2016

vishh commented Sep 7, 2016

derekwaynecarr commented Sep 9, 2016

derekwaynecarr commented Sep 9, 2016

jaredbhatti commented Sep 14, 2016

idvoretskyi commented Sep 21, 2016

derekwaynecarr commented Sep 23, 2016

fejta-bot commented Jan 2, 2018

fejta-bot commented Feb 7, 2018

ezware commented Feb 12, 2018

fejta-bot commented Mar 14, 2018

derekwaynecarr commented Jul 21, 2016 •

edited

Loading