-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Updating some search backpressure settings crash the cluster #15495
Comments
@kaushalmahi12 - Can you look into this? While @gaobinlong already has PR for |
[Triage - attendees 1 2 3] - @jainankitk / @gaobinlong - Can we add more details/stacktraces around as to why the cluster-manager fails to come back after restart ? |
@gaobinlong mind please updating the documentation for these settings [1], thank you |
Thanks @reta, I've created a documentation PR for it: opensearch-project/documentation-website#8555. |
Here're the stacktraces:
, the cluster_manger is not able to apply the invalid settings because the cluster state is corrupt, after execute |
Describe the bug
This issue comes from the forum: https://forum.opensearch.org/t/unable-to-start-opensearch-loop-failed-to-apply-settings-and-rate-must-be-greater-than-zero/20908.
When update the setting
search_backpressure.cancellation_burst
(deprecated),search_backpressure.search_task.cancellation_burst
orsearch_backpressure.search_shard_task.cancellation_burst
to an non-default value, the cluster fails to apply the settings and throwsorg.opensearch.OpenSearchException: java.lang.IllegalArgumentException: rate must be greater than zero
, the cluster gets stuck in it and all operations on the master node fail, even restarting the cluster doesn't work.Related component
Cluster Manager
To Reproduce
, to avoid making your cluster never come back even after restarting it, you can change
persistent
totransient
.Expected behavior
Fix the bug.
Additional Details
Plugins
Please list all plugins currently enabled.
Screenshots
If applicable, add screenshots to help explain your problem.
Host/Environment (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: