Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Status page for *.decred.org #1163

Open
peterzen opened this issue Jan 9, 2024 · 7 comments
Open

Status page for *.decred.org #1163

peterzen opened this issue Jan 9, 2024 · 7 comments

Comments

@peterzen
Copy link
Member

peterzen commented Jan 9, 2024

It would be great to set up a status page that monitors the health of:

https://decred.org
https://matrix.decred.org
https://dex.decred.org
... others?

It should monitor

  • port availability
  • HTTP return code == 200
  • certificate availability (warning when cert about to expire)
@xaur
Copy link
Contributor

xaur commented Jan 9, 2024

Also:

Maybe the monitoring code could also keep an eye on all the VSP hosts (14 currently).

But this might be quite a bit of work. Is there existing easy to use Go code that you just feed a list of hosts and it does the job?

And should it really be in dcrweb or somewhere else?

@peterzen
Copy link
Member Author

peterzen commented Jan 10, 2024

First stab at a status page, set up using Uptime Kuma.

Public status page:
https://is.decred.online/status/overview

For internal use:
https://is.decred.online/status/all
https://is.decred.online/status/certificates

Further plans:

  • Decred colors, branding
  • notifications pushed to Matrix
  • Maybe monitoring other aspects of the network, e.g. LN, VSP servers

Comments, suggestions welcome.

@peterzen
Copy link
Member Author

* https://dcrdata.decred.org
* https://api.decred.org (for the [VSP API](https://github.com/decred/dcrwebapi))

Thanks, added to the PoC.

Maybe the monitoring code could also keep an eye on all the VSP hosts (14 currently).

Good idea, I'll look into it.

But this might be quite a bit of work. Is there existing easy to use Go code that you just feed a list of hosts and it does the job?

The PoC is built on Uptime Kuma, that is a decent platform to implement this.

And should it really be in dcrweb or somewhere else?

It has to be separate from and not dependent in any way on the systems it is monitoring so that when there's an outage it can reliably detect (and send notifications, ideally).

@xaur
Copy link
Contributor

xaur commented Jan 11, 2024

Ok thanks. I guess discussing this status page is fine here, since decred.org is the face of everything and when the status page is up we should definitely link to it from decred.org.

@matthawkins90
Copy link

Amazing. I love Uptime Kuma. This should possibly go into a public repo that people can review/make PRs to.

Sidenote, why can't I visit https://api.decred.org/ even though Uptime Kuma says it's alive? I keep getting 404s in my browser.

@peterzen
Copy link
Member Author

Amazing. I love Uptime Kuma. This should possibly go into a public repo that people can review/make PRs to.

Yeah if this gets adopted then the configuration/customizations would go in a repo.

Sidenote, why can't I visit https://api.decred.org/ even though Uptime Kuma says it's alive? I keep getting 404s in my browser.

It's a JSON endpoint at https://api.decred.org/?c=vsp

@xaur
Copy link
Contributor

xaur commented Mar 3, 2024

@peterzen https://is.decred.online/ is down for me right now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants