-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Different aggregation functions for the latency metrics #1741
Comments
In the detailed dashboard, we show the calculated latency. So for example, when we are showing a cluster view, we don't calcualte p99 for shards and than avg over it, instead, we calculate the actuall p99 of the cluster. So there's no aggreation. Specificaly for avg latencies becasue it's mathematically possible to avg over avg, we can add aggregation function, |
Not sure I understand the point above. Could you, please, clarify? |
I mean that min and max are useful, but I currently have no good way of applying it on latency without making the latencies graph unsuable by default. |
@vladzcloudius I've made an enhencement out of it to keep it open for future functionality. The question is what is the use case you are trying to solve? |
@tzach We had this discussion in the past about latency max. I think the overall understanding is that max is not relevant for latency, just quantile (for example, P99). |
Indeed, showing Max can lead to an outlier, when one random point affects the entire day or year of data. |
@amnonh you are confusing things.
|
The solution will be: when the aggregation function is set to sum, we will have the current behavior (e.g. the actual latency of the node, dc or cluster).
|
Aren't we currently showing the |
We are calculating the P50, P95 and P99 not simply averaging over the per-shard values, especially with P95 and P99 it could be a very different result than simple average. |
@vladzcloudius I'm working on an updated version, when looking at the latencies shard view (per scheduling group) do we want to remove the one that are zero? |
This will only be the case when corresponding groups are not used at all. |
Installation details
Panel Name: average XXX latency
Dashboard Name: Detailed
Scylla-Monitoring Version: 3.10.0
Scylla-Version:
2021.1.10-0.20220410.e8e681dee
Description
Changing an aggregation function to, for instance,
max
doesn't change the output result in a consistent way:"sum" is a default but it definitely doesn't show a sum - it looks more like an average:
And with the "max" it looks quite the same as with the "sum":
The text was updated successfully, but these errors were encountered: