Rescheduler #109

davidopp · 2016-10-01T09:22:38Z

Feature Description

One-line feature description (can be used as a release note):
Primary contact (assignee): @davidopp @aveshagarwal
Responsible SIGs: @kubernetes/sig-scheduling-feature-requests
Design proposal link (community repo):
Reviewer(s) - (for LGTM) recommend having 2+ reviewers (at least one from code-area OWNERS file) agreed to review. Reviewers from multiple companies preferred:
Approver (likely from SIG/area to which feature belongs):
Feature target (which target equals to which milestone):
- Alpha release target (x.y)
- Beta release target (x.y)
- Stable release target (x.y)

# Description
A component that evicts pods (that are managed by a controller) to achieve some set of objectives.

This feature needs a detailed design doc; an initial design proposal is here.

Progress Tracker

Before Alpha

Write and maintain draft quality doc

During development keep a doc up-to-date about the desired experience of the feature and how someone can try the feature in its current state. Think of it as the README of your new feature and a skeleton for the docs to be written before the Kubernetes release. Paste link to Google Doc: DOC-LINK

Design Approval

Design Proposal. This goes under docs/proposals. Doing a proposal as a PR allows line-by-line commenting from community, and creates the basis for later design documentation. Paste link to merged design proposal here: PROPOSAL-NUMBER

Decide which repo this feature's code will be checked into. Not everything needs to land in the core kubernetes repo. REPO-NAME

Initial API review (if API). Maybe same PR as design doc. PR-NUMBER

Any code that changes an API (/pkg/apis/...)

cc @kubernetes/api

Identify shepherd (your SIG lead and/or kubernetes-pm@googlegroups.com will be able to help you). My Shepherd is: replace.me@replaceme.com (and/or GH Handle)

A shepherd is an individual who will help acquaint you with the process of getting your feature into the repo, identify reviewers and provide feedback on the feature. They are not (necessarily) the code reviewer of the feature, or tech lead for the area.

The shepherd is not responsible for showing up to Kubernetes-PM meetings and/or communicating if the feature is on-track to make the release goals. That is still your responsibility.

Identify secondary/backup contact point. My Secondary Contact Point is: replace.me@replaceme.com (and/or GH Handle)

Write (code + tests + docs) then get them merged. ALL-PR-NUMBERS

Code needs to be disabled by default. Verified by code OWNERS

Minimal testing

Minimal docs

cc @kubernetes/docs on docs PR

cc @kubernetes/feature-reviewers on this issue to get approval before checking this off

New apis: Glossary Section Item in the docs repo: kubernetes/kubernetes.github.io

Update release notes

Before Beta

Testing is sufficient for beta

User docs with tutorials

Updated walkthrough / tutorial in the docs repo: kubernetes/kubernetes.github.io

cc @kubernetes/docs on docs PR

cc @kubernetes/feature-reviewers on this issue to get approval before checking this off

Thorough API review

cc @kubernetes/api

Before Stable

docs/proposals/foo.md moved to docs/design/foo.md

cc @kubernetes/feature-reviewers on this issue to get approval before checking this off

Soak, load testing

detailed user docs and examples

cc @kubernetes/docs

cc @kubernetes/feature-reviewers on this issue to get approval before checking this off

FEATURE_STATUS is used for feature tracking and to be updated by @kubernetes/feature-reviewers.
FEATURE_STATUS: IN_DEVELOPMENT

More advice:

Design

Once you get LGTM from a @kubernetes/feature-reviewers member, you can check this checkbox, and the reviewer will apply the "design-complete" label.

Coding

Use as many PRs as you need. Write tests in the same or different PRs, as is convenient for you.

As each PR is merged, add a comment to this issue referencing the PRs. Code goes in the http://github.com/kubernetes/kubernetes repository,
and sometimes http://github.com/kubernetes/contrib, or other repos.

When you are done with the code, apply the "code-complete" label.

When the feature has user docs, please add a comment mentioning @kubernetes/feature-reviewers and they will
check that the code matches the proposed feature and design, and that everything is done, and that there is adequate
testing. They won't do detailed code review: that already happened when your PRs were reviewed.
When that is done, you can check this box and the reviewer will apply the "code-complete" label.

Docs

Write user docs and get them merged in.

User docs go into http://github.com/kubernetes/kubernetes.github.io.

When the feature has user docs, please add a comment mentioning @kubernetes/docs.

When you get LGTM, you can check this checkbox, and the reviewer will apply the "docs-complete" label.

The text was updated successfully, but these errors were encountered:

davidopp · 2016-10-18T22:22:26Z

Removing from 1.5 milestone.

aveshagarwal · 2017-04-25T13:51:47Z

milestone 1.7?

davidopp · 2017-04-25T17:06:06Z

@aveshagarwal Will you be working on it for 1.7? If so, then yes we should set 1.7 milestone.

aveshagarwal · 2017-04-25T17:07:07Z

@davidopp yes.

davidopp · 2017-04-25T17:08:12Z

done

idvoretskyi · 2017-05-03T22:05:11Z

@davidopp @aveshagarwal I've updated the feature description to fit the new template. Please, fill the empty fields in the new template (their actual state was unclear).

davidopp · 2017-05-11T19:00:07Z

@aveshagarwal I assume we won't have any code for this in 1.7, probably just a design at most, so we should move it to next-milestone?

aveshagarwal · 2017-05-11T19:03:15Z

@davidopp I am prototyping utilization based use case as per existing design doc in the current rescheduler code in contrib. So I am planning to have that by 1.7. But since it will be in contrib repo outside kube repo, not sure it would impact kube 1.7.

aveshagarwal · 2017-05-11T19:09:55Z

@davidopp the one thing that I am looking into is a new priority function based on node utilization that might be needed as part of existing scheduler, so that when a rescheduler moves a pod off a over utilized node, existing scheduler can schedule that pod to less/under utilized node to be in alignment with rescheduler decision. So that is the thing that might be needed for kube 1.7 as per my current understanding for the first version of rescheduler.

gyliu513 · 2017-05-12T02:50:07Z

@aveshagarwal Does the rescheduler still only works for pods under kube-system ns after your work?

davidopp · 2017-05-12T05:04:54Z

This issue is referring to a different rescheduler than the one we currently have. The naming is unfortunate. The current rescheduler will go away once #268 is implemented.

gyliu513 · 2017-05-12T05:24:30Z

Good to know, thanks @davidopp

fgrzadkowski · 2017-05-12T11:28:47Z

Shouldn't we just use existing priority functions in scheduler for rescheduler, instead of adding more? I think the first thing we should focus on is actually spreading function for services, replicas sets, deployments etc.

…

-- Filip

On Fri, May 12, 2017 at 7:24 AM, Guang Ya Liu ***@***.***> wrote: Good to know, thanks @davidopp <https://github.com/davidopp> — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#109 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AKUcdv5OSMPE_e4NbPBvXfnicZpEtdIHks5r4-0RgaJpZM4KLtjg> .

aveshagarwal · 2017-05-12T14:03:30Z

Yes I am focusing on spreading use case based on node's resource utilization.

vishh · 2017-05-12T15:06:02Z

Why not reschedule when pods suffer from performance issues instead of rescheduling whenever nodes have high utilization? What's the benefit of the latter?

…

On May 12, 2017 7:03 AM, "Avesh Agarwal" ***@***.***> wrote: Yes I am focusing on spreading use case based on node's resource utilization. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#109 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKE2IkK4ZCEIRra6GwXz-RkFa4_gMks5r5Ga1gaJpZM4KLtjg> .

aveshagarwal · 2017-05-12T15:26:18Z

I think a benefit of the latter is to have a balanced cluster after following events, for rexample:

a node comes back from maintenance
auto scaling
over time, pods' first scheduling decision might turn out a sub-optimal one.

To reschedule a pod experiencing performance issue or poor service is also an use case that we would like to handle eventually but not as a first step. Moreover, i think if we act pro actively, perhaps a pod may not probably experience poor service in the first place.

So there are various trigger that can cause a rescheduler to act like poor service as you mentioned and also node utilization and there are many others. But i think as per discussion, spreading based on node utilization seems to be the first step most users might be interested in.

vishh · 2017-05-12T16:40:46Z

What is the rescheduling optimizing for then in the short term? Improved bin packing?

…

On Fri, May 12, 2017 at 8:26 AM, Avesh Agarwal ***@***.***> wrote: I think a benefit of the latter is to have a balanced cluster after following events, for rexample: 1. a node comes back from maintenance 2. auto scaling 3. over time, pods' first scheduling decision might turn out a sub-optimal one. To reschedule a pod experiencing performance issue or poor service is also an use case that we would like to handle eventually but not as a first step. Moreover, i think if we act pro actively, perhaps a pod may not probably experience poor service in the first place. So there are various trigger that can cause a rescheduler to act like poor service as you mentioned and also node utilization and there are many others. But i think as per discussion, spreading based on node utilization seems to be the first step most users might be interested in. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#109 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGvIKEZz1zh7SRV9o7CeMUatGJOlrRFeks5r5HodgaJpZM4KLtjg> .

aveshagarwal · 2017-05-12T17:43:11Z

I'd say to optimize (specifically minimize) number of over utilized nodes (x) in a cluster, such that 0<=x<=N where N<= Number of nodes in the cluster. Utilization threshold and N are configurable. also rescheduler makes best effort to optimize it but can not provide a guarantee. Also If it results in improved bin packing (as you mentioned) as a side effect, thats good, but not a direct effort to optimize it atleast for the first step.

davidopp · 2017-05-12T19:38:25Z

Features repo should not be used for technical discussions. Please move the discussion to kubernetes/kubernetes#12140.

BTW @aveshagarwal it would probably be good if you were to write a short design doc for what you're doing.

aveshagarwal · 2017-05-12T19:45:23Z

@davidopp Yeah sure, planning to have something by next week.

idvoretskyi · 2017-05-15T09:18:05Z

@davidopp @aveshagarwal have you agreed to have this feature for 1.7? If yes, please, update the features template to reflect the actual status.

davidopp · 2017-05-15T09:23:57Z

@aveshagarwal mentioned just one change that he might want in 1.7 #109 (comment)

But Avesh, what you described sounds like the current default scheduling policy (try to spread based on resources). So maybe you don't need a new priority function in 1.7?

aveshagarwal · 2017-05-16T13:38:26Z

@davidopp Yeah that sounds good, so in that case does not seem any changes for kube for initial version, so should not impact kube 1.7.

Though, I was thinking a priority function based on actual resource utilization (like by obtaining metrics from something like heapster) which is different than how the existing spreading function works.

davidopp · 2017-05-16T14:18:11Z

Though, I was thinking a priority function based on actual resource utilization (like by obtaining metrics from something like heapster) which is different than how the existing spreading function works.

We've talked about doing usage-based scheduling for best-effort pods (kubernetes/kubernetes#18438), but don't have it yet.

idvoretskyi · 2017-05-18T16:46:51Z

@davidopp @aveshagarwal so, any update on the status, gentlemen?

idvoretskyi · 2017-06-14T19:52:38Z

@davidopp @aveshagarwal does this feature going to land in 1.7? If not, I'll remove the 1.7 association.

aveshagarwal · 2017-06-14T19:56:18Z

@idvoretskyi No.

idvoretskyi · 2017-10-02T21:05:45Z

@davidopp @aveshagarwal @kubernetes/sig-scheduling-feature-requests any plans to continue the feature development for 1.9?

davidopp · 2017-10-03T09:23:23Z

There will be development in the future, but I'm not sure about 1.9. @aveshagarwal are you planning to do more work on this for 1.9?

aveshagarwal · 2017-10-03T12:46:48Z

@davidopp @idvoretskyi yes, there will be on-going development for adding new features/functionalities and regular releases. Here is the repo: https://github.com/kubernetes-incubator/descheduler . After every kubernetes release, it will be rebased to latest kube release.

idvoretskyi · 2017-10-03T14:45:06Z

@aveshagarwal @davidopp cool, I'll add this item for 1.9 features track.

fejta-bot · 2018-01-07T01:46:37Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-02-10T09:35:11Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

fejta-bot · 2018-03-12T10:21:32Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

davidopp added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Oct 1, 2016

idvoretskyi modified the milestone: v1.5 Oct 11, 2016

idvoretskyi assigned davidopp Oct 13, 2016

davidopp modified the milestones: next-milestone, v1.5 Oct 18, 2016

davidopp modified the milestones: v1.7, next-milestone Apr 25, 2017

davidopp assigned aveshagarwal Apr 29, 2017

idvoretskyi added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label May 3, 2017

luxas removed this from the v1.7 milestone Jun 15, 2017

idvoretskyi added this to the next-milestone milestone Oct 2, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 7, 2018

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 10, 2018

k8s-ci-robot closed this as completed Mar 12, 2018

howardjohn pushed a commit to howardjohn/enhancements that referenced this issue Oct 21, 2022

Update link do to change in istio.io #10496 (kubernetes#109)

2350182

Rescheduler #109

Rescheduler #109

Comments

davidopp commented Oct 1, 2016 • edited by idvoretskyi Loading

Feature Description

Progress Tracker

davidopp commented Oct 18, 2016

aveshagarwal commented Apr 25, 2017

davidopp commented Apr 25, 2017

aveshagarwal commented Apr 25, 2017

davidopp commented Apr 25, 2017

idvoretskyi commented May 3, 2017

davidopp commented May 11, 2017

aveshagarwal commented May 11, 2017

aveshagarwal commented May 11, 2017 • edited Loading

gyliu513 commented May 12, 2017

davidopp commented May 12, 2017

gyliu513 commented May 12, 2017

fgrzadkowski commented May 12, 2017 via email

aveshagarwal commented May 12, 2017

vishh commented May 12, 2017 via email

aveshagarwal commented May 12, 2017

vishh commented May 12, 2017 via email

aveshagarwal commented May 12, 2017

davidopp commented May 12, 2017

aveshagarwal commented May 12, 2017

idvoretskyi commented May 15, 2017

davidopp commented May 15, 2017

aveshagarwal commented May 16, 2017

davidopp commented May 16, 2017 • edited Loading

idvoretskyi commented May 18, 2017

idvoretskyi commented Jun 14, 2017

aveshagarwal commented Jun 14, 2017

idvoretskyi commented Oct 2, 2017

davidopp commented Oct 3, 2017

aveshagarwal commented Oct 3, 2017

idvoretskyi commented Oct 3, 2017

fejta-bot commented Jan 7, 2018

fejta-bot commented Feb 10, 2018

fejta-bot commented Mar 12, 2018

davidopp commented Oct 1, 2016 •

edited by idvoretskyi

Loading

aveshagarwal commented May 11, 2017 •

edited

Loading

davidopp commented May 16, 2017 •

edited

Loading