-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
enhance: Optimize workload based replica selection policy #36181
enhance: Optimize workload based replica selection policy #36181
Conversation
d210076
to
d8db3e1
Compare
@weiliu1031 go-sdk check failed, comment |
@weiliu1031 E2e jenkins job failed, comment |
d8db3e1
to
bb96b15
Compare
@weiliu1031 go-sdk check failed, comment |
@weiliu1031 E2e jenkins job failed, comment |
bb96b15
to
d21a606
Compare
@weiliu1031 go-sdk check failed, comment |
@weiliu1031 E2e jenkins job failed, comment |
f4f31df
to
4206540
Compare
before this pr:
after this pr:
|
4206540
to
36cebca
Compare
@weiliu1031 E2e jenkins job failed, comment |
rerun ut |
/run-cpu-e2e |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #36181 +/- ##
===========================================
+ Coverage 71.72% 82.37% +10.64%
===========================================
Files 1276 1276
Lines 150629 150653 +24
===========================================
+ Hits 108039 124100 +16061
+ Misses 37614 21583 -16031
+ Partials 4976 4970 -6
|
rerun go-sdk |
1 similar comment
rerun go-sdk |
Key: "proxy.workloadToleranceFactor", | ||
Version: "2.4.12", | ||
DefaultValue: "0.1", | ||
Doc: "tolerance factor for query node workload difference", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please explain more about what's ToleranceFactor
@weiliu1031 go-sdk check failed, comment |
@weiliu1031 E2e jenkins job failed, comment |
This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 200ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. Signed-off-by: Wei Liu <wei.liu@zilliz.com>
Signed-off-by: Wei Liu <wei.liu@zilliz.com>
cba3950
to
2694d09
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: congqixia, weiliu1031, XuanYang-cn The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
…36181) issue: milvus-io#35859 This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>
…36384) issue: #35859 pr: #36181 This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>
issue: #35859
This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score.
if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns.
if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment.