-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add level=datastreams to Indices Stats #83049
Comments
Pinging @elastic/es-data-management (Team:Data Management) |
Any rough plan on what the data structure would look like? I'm curious how well this shape up with things like spotting a shard getting heavy search activity causing node trouble. Seems like do to that we'd still need to publish information about which shards are on what nodes, which would be basically index-level. At least for indexing operations, those would always been on the latest index for the data stream. I don't know all the index metrics offhand, but it might be good to start by listing which ones we could avoid publishing for non-write indices taking part in a data stream. Then we could look at potential savings for expected workloads. |
The thinking was to just have DS display like indices do today if
Before we go on I think it's important to understand the answer to this question: |
There's a couple of places in the stack monitoring UI we have shards called out (which nodes have which shards, recent shard migrations). I'm not sure offhand just how detailed the data is but I've definitely used the shard/node association to help pinpoint hot ingest situations in the past. |
We want to add another level "datastreams" in addition to the existing "shards" and "indices" to the indices stats API.
This level would group+aggregate index-stats by data-stream instead of returning stats by individual index. It would not otherwise change the response format and a datastream would simply appear as a single index to callers.
Indices covered by the request that are not part of a datastream would be included in the response the same way as they are at level "indices".
This is motivated by scalability concerns around the response sizes generated by this API for very larger clusters. Also bandwidth used by monitoring when calling this API every 10s or so for all indices becomes problematic for large index counts.
relates #77466
The text was updated successfully, but these errors were encountered: