-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MGDAPI-5085 - Add alerting for MCG on GCP #3075
MGDAPI-5085 - Add alerting for MCG on GCP #3075
Conversation
4dd146d
to
0451230
Compare
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## mgdapi-3425-gcp #3075 +/- ##
================================================
Coverage 72.56% 72.57%
================================================
Files 104 104
Lines 29241 29247 +6
================================================
+ Hits 21219 21225 +6
Misses 7281 7281
Partials 741 741
|
01d3a57
to
ea94686
Compare
ea94686
to
1ac6788
Compare
e2e flaky - clusters failed to provision |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I followed the verification instructions and the alerts worked as expected.
- NooBaa and MCG alerts were created and did not fire on fresh RHOAM install
- After scaling down the required deployments, statefulsets and deleting a pod the alerts below started firing
- Finally, scaling up the deployments and statefulsets resolved the previously firing alerts
/test rhoam-e2e |
e2e failed to provision cluster... |
Thank you for addressing the changes I suggested, @adam-cattermole! I'm happy to approve the pull request now 👍🏻 |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: KevFan The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test multitenant-rhoam-e2e |
Issue link
MGDAPI-5085
What
Add alerts for various MCG resources - requires rebase once #3052 is merged.
Verification steps
Provision a GCP CCS cluster - if you do not have an access key, request one from myself - place in root of delorean repo and run:
Once provisioned we can deploy RHOAM from this branch:
Navigate to alerting in
redhat-rhoam-observability
namespace:Networking -> Routes -> Prometheus (location) -> Alerts
Verify that the alerts are listed and are not firing.
RHOAMMCGOperatorMetricsServiceEndpointDown
RHOAMMCGOperatorRhmiRegistryCsServiceEndpointDown
NooBaaCorePod
NooBaaDBPod
NooBaaDefaultBackingStorePod
NooBaaEndpointPod
NooBaaS3Endpoint
NooBaaBucketCapacityOver85Percent
NooBaaBucketCapacityOver95Percent
Scale down:
Delete:
After a few minutes, the following alerts should be firing:
RHOAMMCGOperatorMetricsServiceEndpointDown
NooBaaCorePod
NooBaaDBPod
NooBaaDefaultBackingStorePod
NooBaaEndpointPod
NooBaaS3Endpoint
Scaling the all of the deployments back up again should result in the alerts to stop.