Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpreadingPriority #84606

alculquicondor · 2019-10-31T15:11:09Z

/kind feature

What this PR does / why we need it:
The added tests exercise EvenPodsSpreadPriority using constraints that have a similar effect to the hardcoded parameters to the SelectorSpreadingPriority algorithm.

Current results:

BenchmarkTestDefaultEvenPodsSpreadPriority/100nodes-56       2000      918838 ns/op
BenchmarkTestDefaultEvenPodsSpreadPriority/1000nodes-56      300	   5137881 ns/op
BenchmarkTestSelectorSpreadingPriority/100nodes-56           10000     181157 ns/op
BenchmarkTestSelectorSpreadingPriority/1000nodes-56          1000      1654661 ns/o

Note that EvenPodsSpreadPriority is rouglhy 5x slower than SelectorSpreadingPriority.

Which issue(s) this PR fixes:
Part of #80639

Does this PR introduce a user-facing change?:

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

KEP section

alculquicondor · 2019-10-31T15:13:30Z

/assign @Huang-Wei

Huang-Wei · 2019-10-31T16:44:17Z

@alculquicondor There are some things I don't quite follow:

In test BenchmarkTestSelectorSpreadingPriority, it has one selector in the Service; while in BenchmarkTestDefaultEvenPodsSpreadPriority, there are two constraints, hence two selectors. I don't think the result of them is comparable.
It seems we can only define one default global "topologyKey", and then apply it to all selectors? If so, it's not the same semantics of EvenPodsSpead. In EvenPodsSpread's case, users can define different topologyKey on different topologySpreadConstraint.

alculquicondor · 2019-10-31T18:15:39Z

In test BenchmarkTestSelectorSpreadingPriority, it has one selector in the Service; while in BenchmarkTestDefaultEvenPodsSpreadPriority, there are two constraints, hence two selectors. I don't think the result of them is comparable.

Yes, EvenPodsSpreadPriority has two constraints, but the test is using the same selector for both.

It seems we can only define one default global "topologyKey", and then apply it to all selectors? If so, it's not the same semantics of EvenPodsSpead. In EvenPodsSpread's case, users can define different topologyKey on different topologySpreadConstraint.

SelectorSpreadingPriority uses 2 values: the node name and the zone. That's what we are trying to replicate with the 2 constraints for EvenPodsSpread. We are not concerning about flexibility in this case. We just want to ensure that using those 2 exact constraints, we get similar performance for the same set of nodes and pods.

Huang-Wei · 2019-10-31T22:43:06Z

One reason why SelectorSpreadingPriority is quicker is b/c its map function only counts the matching number for filteredNodes. However, that's inaccurate - we need to take the matching pods on unqualified nodes (i.e., they failed at predicates/filter phase) into consideration.

~~And in the test, only 10% of the all nodes are qualified nodes.~~ Edited: I misread, it's 100%.

alculquicondor · 2019-11-01T07:48:10Z

I noticed that in the algorithm. However, we are using 100% in this test, to be more fair.

Huang-Wei · 2019-11-01T16:38:39Z

However, we are using 100% in this test, to be more fair.

@alculquicondor I misread the percentage, updated in #84606 (comment)

Essentially, it's not about the percentage of how many nodes are "filtered", it's about whether or not we have a pre-calculation phase for a particular Priority.

If we can skip the pre-calculation phase, for sure it's much faster. In other words, it's a "pure" map-reduce implementable Priority.
Otherwise, we have to run the pre-calculation for all the nodes, then it's slower.

alculquicondor · 2019-11-04T16:21:34Z

They are different algorithms and SelectorSpreadPriority is skipping calculations by only going through the filtered nodes. However, if you consider the 100% of nodes, they are essentially doing the same calculations.

SelectorSpread does the "precalculation" in Map, whereas EvenPodsSpread does it in metadata. But ultimately overall (at 100% nodes), they are doing the same.

ahg-g · 2019-11-04T16:38:35Z

I think we can merge the benchmark for now and iterate over it as we try to bridge the gap.

…adPriority Signed-off-by: Aldo Culquicondor <acondor@google.com>

alculquicondor · 2019-11-04T18:30:00Z

I just rebased onto the Map/Reduce implementation. Note that I'm including metadata calculation in the benchmarks.

alculquicondor · 2019-11-04T19:29:19Z

/test pull-kubernetes-e2e-gce-device-plugin-gpu

alculquicondor · 2019-11-05T20:38:09Z

/test pull-kubernetes-e2e-gce

Huang-Wei

/lgtm
/approve

k8s-ci-robot · 2019-11-05T20:57:31Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alculquicondor, Huang-Wei

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [Huang-Wei]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

fejta-bot · 2019-11-06T00:46:47Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

k8s-ci-robot requested review from ahg-g and k82cn October 31, 2019 15:12

k8s-ci-robot added sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Oct 31, 2019

k8s-ci-robot assigned Huang-Wei Oct 31, 2019

alculquicondor force-pushed the test/priorities branch from 45d77b3 to 335a334 Compare October 31, 2019 18:16

Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpre…

75d5227

…adPriority Signed-off-by: Aldo Culquicondor <acondor@google.com>

alculquicondor force-pushed the test/priorities branch from 335a334 to 75d5227 Compare November 4, 2019 18:29

Huang-Wei reviewed Nov 5, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 5, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 5, 2019

palnabarun mentioned this pull request Nov 5, 2019

Add a configurable default constraint to PodTopologySpread kubernetes/enhancements#1258

Closed

k8s-ci-robot merged commit 3c4ae1c into kubernetes:master Nov 6, 2019

k8s-ci-robot added this to the v1.17 milestone Nov 6, 2019

alculquicondor mentioned this pull request Nov 7, 2019

Improve performance of EvenPodSpread (priority and predicate) #84936

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpreadingPriority #84606

Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpreadingPriority #84606

alculquicondor commented Oct 31, 2019 •

edited

Loading

alculquicondor commented Oct 31, 2019

Huang-Wei commented Oct 31, 2019

alculquicondor commented Oct 31, 2019

Huang-Wei commented Oct 31, 2019 •

edited

Loading

alculquicondor commented Nov 1, 2019

Huang-Wei commented Nov 1, 2019

alculquicondor commented Nov 4, 2019

ahg-g commented Nov 4, 2019

alculquicondor commented Nov 4, 2019

alculquicondor commented Nov 4, 2019

alculquicondor commented Nov 5, 2019

Huang-Wei left a comment

k8s-ci-robot commented Nov 5, 2019

fejta-bot commented Nov 6, 2019

Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpreadingPriority #84606

Add benchmark test to compare EvenPodsSpreadPriority and SelectorSpreadingPriority #84606

Conversation

alculquicondor commented Oct 31, 2019 • edited Loading

alculquicondor commented Oct 31, 2019

Huang-Wei commented Oct 31, 2019

alculquicondor commented Oct 31, 2019

Huang-Wei commented Oct 31, 2019 • edited Loading

alculquicondor commented Nov 1, 2019

Huang-Wei commented Nov 1, 2019

alculquicondor commented Nov 4, 2019

ahg-g commented Nov 4, 2019

alculquicondor commented Nov 4, 2019

alculquicondor commented Nov 4, 2019

alculquicondor commented Nov 5, 2019

Huang-Wei left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Nov 5, 2019

fejta-bot commented Nov 6, 2019

alculquicondor commented Oct 31, 2019 •

edited

Loading

Huang-Wei commented Oct 31, 2019 •

edited

Loading