You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
API _cluster/allocation/explain is returning incorrect response on clusters with batch mode enabled because the request for shard explain allocation are being served by GatewayAllocator instead of ShardsBatchGatewayAllocator.(AllocatorFetchLogic, ExistingShardAllocatorSetting). A change in AllocationService is required to switch to the ShardsBatchGatewayAllocator when batch mode is enabled.
Issue was identified by:
Enabling index.unassigned.node_left.delayed_timeout and taking down nodes with 2 replicas of the shard, the expected response from _cluster/allocation/explain was allocation_delayed whereas the API returned awaiting_info instead.
Related component
Cluster Manager
To Reproduce
Create a cluster with dedicated master and 10 data nodes.
Validate value for can_allocate field in response is awaiting_info, response would look like this:
{"index":"test-ind","shard":0,"primary":false,"current_state":"unassigned","unassigned_info":{"reason":"NODE_LEFT","at":"2024-06-05T05:33:16.753Z","details":"node_left [Bvu-mf5XSPu3DEmv9ndBgw]","last_allocation_status":"no_attempt"},"can_allocate":"awaiting_info","allocate_explanation":"cannot allocate because information about existing shard data is still being retrieved from some of the nodes","node_allocation_decisions":[{"node_id":"3YYYQYZLQaGck1tIOJ57xg","node_name":"517c7e06d65968c38f1a4140b265ccc4","
Expected behavior
Value for can_allocate field in response is delayed_timeout
Additional Details
OpenSearch Version: 2.14
The text was updated successfully, but these errors were encountered:
Describe the bug
API _cluster/allocation/explain is returning incorrect response on clusters with batch mode enabled because the request for shard explain allocation are being served by GatewayAllocator instead of ShardsBatchGatewayAllocator.(AllocatorFetchLogic, ExistingShardAllocatorSetting). A change in AllocationService is required to switch to the ShardsBatchGatewayAllocator when batch mode is enabled.
Issue was identified by:
Enabling
index.unassigned.node_left.delayed_timeout
and taking down nodes with 2 replicas of the shard, the expected response from _cluster/allocation/explain wasallocation_delayed
whereas the API returnedawaiting_info
instead.Related component
Cluster Manager
To Reproduce
Expected behavior
Value for can_allocate field in response is delayed_timeout
Additional Details
OpenSearch Version: 2.14
The text was updated successfully, but these errors were encountered: