Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracking issue: Improve parachain liveliness observability #196

Open
4 tasks
sandreim opened this issue Jan 6, 2023 · 0 comments
Open
4 tasks

Tracking issue: Improve parachain liveliness observability #196

sandreim opened this issue Jan 6, 2023 · 0 comments

Comments

@sandreim
Copy link
Collaborator

sandreim commented Jan 6, 2023

The Problem

We currently rely only on node side metrics to observe the liveliness of the network. Mainly, we look at things like parachain block times, approval checking, disputes and finality lag which we are able to determine only if we have access to metrics scraped from validators and collators. This works really nice for test networks where we manage both the validators and collators. We want to build some tooling to enable additional observability via RPC . This would not obsolete any of the node metrics, as we would still rely on those when debugging.

Plan

We need to implement tracking of all parachains in the parachain commander which is currently limited to only one parachain. This full tracking should be the deafult mode for running the tool in Prometheus mode while CLI remains unchanged to tracking one parachain.

Metrics

The following metrics will be computed from the parchain inherent data from each relay chain block:

  • parachain block times (measure in relay chain blocks)
  • relay chain block times (via inherent timestamps)
  • availability health (bitfield count and 1 bits when core occupied)
  • backing
  • DMP/UMP/HRMP throughput
  • dispute initiation and conclusion times
  • dispute initiation per validator (via address label)

Deployment

We want to deploy this for both prod and test networks: Kusama, Polkadot, Westend, Rococo and Versi.
Additionally to the implementation we need to create Grafana dashboards that will be available in the polkadot-introspector repo.

Milestones

  • all metrics implemented
  • dashboards created and published
  • deployed on Polkadot/Kusama/Westend/Rococo
  • alerting and paging configured

Project tracking board

https://github.com/orgs/paritytech/projects/70/views/1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: In Progress
Status: In progress
Development

No branches or pull requests

1 participant