Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding Prometheus metric for Routes health #2597

Merged
merged 1 commit into from
Sep 18, 2023

Conversation

universam1
Copy link
Contributor

@universam1 universam1 commented Sep 14, 2023

ref: #2595

adding Prometheus metrics for Routes health of routes-total and routes-last-update gauges that will allow to defines alerts if Skipper instances drift.

@webframp
Copy link

We definitely don't expect this to be merged as is but wanted to get a PR up to make sure we're on the right track with how to wire the counters in.

@universam1
Copy link
Contributor Author

@szuecs @AlexanderYastrebov Wiring up the metric into the routing is a challenge, not a good idea at the moment. Happy to your feedback how a proper access would look like?

@AlexanderYastrebov AlexanderYastrebov marked this pull request as draft September 14, 2023 15:39
proxy/proxy.go Outdated Show resolved Hide resolved
@universam1 universam1 force-pushed the promRoutesHealth branch 2 times, most recently from 104d03b to a8566a5 Compare September 15, 2023 14:18
@universam1 universam1 changed the title Draft: adding Prometheus metric for RoutesTotal adding Prometheus metric for Routes health Sep 15, 2023
@universam1 universam1 marked this pull request as ready for review September 15, 2023 14:22
@universam1
Copy link
Contributor Author

universam1 commented Sep 15, 2023

The metrics, combined with a diff threshold seem quite useful:
image

for Routes health of `routes.total` and
`routes.updated_timestamp` gauges
that will facilitate to define alerts if Skipper instances drift

Signed-off-by: Samuel Lang <gh@lang-sam.de>
@AlexanderYastrebov
Copy link
Member

👍

1 similar comment
@RomanZavodskikh
Copy link
Contributor

👍

@AlexanderYastrebov AlexanderYastrebov merged commit a685c95 into zalando:master Sep 18, 2023
6 checks passed
@universam1 universam1 deleted the promRoutesHealth branch September 18, 2023 11:37
@universam1
Copy link
Contributor Author

Thank you @AlexanderYastrebov @szuecs @RomanZavodskikh for quick release!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants