-
Notifications
You must be signed in to change notification settings - Fork 456
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding first draft of [OTEL Kubernetes] Cluster Overview Dashboard #10443
Conversation
"68bd6f9e-6894-4991-b2ff-fac3a4461b2b": { | ||
"dataType": "number", | ||
"isBucketed": false, | ||
"label": "Average of k8s.node.cpu.utilization", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The way it is reported right now by the Collector is not correct. I'd suggest keeping this out for now or use the k8s.node.cpu.usage
instead.
You can find more info at:
- [receiver/kubeletstat] Review
cpu.utilization
naming open-telemetry/opentelemetry-collector-contrib#27885 (comment) - https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/6a5b9e673a433ad2c6ed7ef2526a8920fd18cf0d/receiver/kubeletstatsreceiver/metadata.yaml#L96
There is a fix for that at open-telemetry/opentelemetry-collector-contrib@e248353 but i'm not sure yet if we will first land this or if we should first deprecate and remove the metrics and then add the correct metric back.
You can follow open-telemetry/opentelemetry-collector-contrib#27885 for updates.
Quality Gate passedIssues Measures |
Adding another iteration
|
💔 Build Failed
Failed CI StepsHistory
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I might miss some content here but something that looks unclear is which type of data this Dashboard aims to consume.
Is this going to use translated metrics coming from the Infra Metrics Processor along with OTel native metrics?
Since this is focused on OTel users and the translation lib emits both the translated and the OTel native ones, why not to use the native ones here directly?
So instead of kubernetes.pod.cpu.usage.limit.pct
to use the k8s.pod.cpu_limit_utilization
directly?
{ | ||
"embeddableConfig": { | ||
"attributes": { | ||
"description": "Average of 100 Top Pod CPU Usage based on kubernetes.pod.usage.limit.pct", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If those dashboards aim OTel users the namings should meet the semantic convention expectations, using the Utilization
or Usage
according to the underlying metric's name/notion.
Hi! We just realized that we haven't looked into this PR in a while. We're sorry! We're labeling this issue as |
changes: | ||
- description: Create General Kubernetes Dashboard [Otel] to support Opentelemetry observability | ||
type: enhancement | ||
link: https://github.com/elastic/integrations/pull/10406 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
link: https://github.com/elastic/integrations/pull/10406 | |
link: https://github.com/elastic/integrations/pull/10443 |
Closed as work was done in different PR |
Proposed commit message
Adding a new
[OTEL Kubernetes] Cluster Overview
dashboard in k8s integrationPlease explain:
Checklist
changelog.yml
file.How to test this PR locally
kind create cluster
elastic/integrations/packages/kubernetes
elastic-package build
kubectl apply -f elastic-otel-collector_all.yaml
elastic-package stack up -d -v --version=8.15.0-SNAPSHOT
[OTEL Kubernetes] Cluster Overview
Screenshots