Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement retry functionality? #3

Open
atc0005 opened this issue Dec 10, 2021 · 1 comment
Open

Implement retry functionality? #3

atc0005 opened this issue Dec 10, 2021 · 1 comment
Assignees
Labels
app/check_statuspage_components config documentation Improvements or additions to documentation enhancement New feature or request
Milestone

Comments

@atc0005
Copy link
Owner

atc0005 commented Dec 10, 2021

Similar to what we're doing in the go-teams-notify project, allow user-specified number of retries using a user-specified delay between retry attempts.

It will be important to get the default values setup so that the total attempts falls under the default plugin timeout value used by Nagios, Icinga, etc.

@atc0005 atc0005 added the enhancement New feature or request label Dec 10, 2021
@atc0005 atc0005 added this to the Future milestone Dec 10, 2021
@atc0005 atc0005 self-assigned this Dec 10, 2021
@atc0005 atc0005 added documentation Improvements or additions to documentation config app/check_statuspage_components labels Aug 21, 2023
@atc0005
Copy link
Owner Author

atc0005 commented Aug 21, 2023

With the DUO auth issues today (https://status.duo.com/incidents/rw7g0q7ztj8f) I'm seeing a lot of flapping between UNKNOWN and WARNING states.

The UNKNOWN state notifications have this output:

Info:

UNKNOWN: Failed to process JSON feed from URL: failed to decode JSON data

Additional Info:

ERRORS

The upstream https://metastatuspage.com/ Statuspage dashboard does not indicate any problems, so without digging deeper it is unclear whether it is an issue with the Nagios "console" or timeouts for the status.duo.com site/API.

Not longer after I filed this GH issue I realized that implementing a retry directly in the plugin was perhaps not needed. We could instead set the retry behavior within the service check before declaring a state change.

I'm divided on that point; added some additional context as noted above for further consideration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
app/check_statuspage_components config documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant