Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slack Receives Incorrect or Duplicate Alerts from Jenkins #18198

Closed
2 tasks
Tracked by #18129
7hunderbird opened this issue May 22, 2024 · 2 comments
Closed
2 tasks
Tracked by #18129

Slack Receives Incorrect or Duplicate Alerts from Jenkins #18198

7hunderbird opened this issue May 22, 2024 · 2 comments
Assignees
Labels
CMS Team CMS Product team that manages both editor exp and devops Needs refining Issue status

Comments

@7hunderbird
Copy link

7hunderbird commented May 22, 2024

User Story or Problem Statement

Today we've noticed that there are distracting and in-accurate alerts coming into the #cms-notifications slack channel.

  1. Alerts for jobs that don't exist.
  2. Duplicate alerts.

Alerts for jobs that don't exist

CleanShot 2024-05-22 at 13 37 41

When this issue emerged, the most recent build in the cms-db-sanitize job was number 103121.

As you can see in the figure above, the errors showed a URL to a build job number 103187. Each of these job numbers that were about 60 builds ahead led to a 404 page not found because the builds had not run yet.

When we looked at the live status of this job, it was succeeding and so the main issue here is that the noise and distraction that this causes. 😢

Duplicate alerts

At first the notification that the prod deploy was about to go out was duplicated.

CleanShot 2024-05-22 at 13 37 15

Then, when the prod deploy had determined that there was no new version to ship out, it ALSO duplicated the "ABORT" which is why I created this ticket.

CleanShot 2024-05-22 at 13 42 39

Not only did we get duplicate errors from the prod deploy "ABORT" notification, but the first one also had the problem of linking to the wrong job that was somehow in the future.

Description or Additional Context

Steps for Implementation

Acceptance Criteria

  • There are no more duplicate alerts coming into #cms-notifications slack channel
  • There are no alerts for non-existent jobs
@7hunderbird 7hunderbird added the Needs refining Issue status label May 22, 2024
@gracekretschmer-metrostar gracekretschmer-metrostar added the CMS Team CMS Product team that manages both editor exp and devops label May 22, 2024
@7hunderbird
Copy link
Author

Searching in slack there was a time period around April 17th and 18th where there were repeated "cms/cms-db-sanitize failed" errors.

CleanShot 2024-05-22 at 13 52 24

@7hunderbird
Copy link
Author

With the other jenkins server turned off the duplicate alerts from Jenkins are gone and this is resolved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CMS Team CMS Product team that manages both editor exp and devops Needs refining Issue status
Projects
None yet
Development

No branches or pull requests

2 participants