Suppress errors while RQ is retrying #1074

BobReid · 2021-04-08T01:25:58Z

I am using RQ as my job queue. I have some use cases where retries and optimistic locking are used to smooth out contentious writes to the database and out of order execution.

This results in a lot of false positives in Sentry. These issues fix themselves after retry so I do not want them reported to sentry until all the retries are exhausted. Currently there isn't a way to suppress the exception due to the way the sentry integration is monkey patching RQ. You need to inspect the RQ job instance status in order to determine whether or not retries have been exhausted.

I am happy to contribute the change if it has a chance at being accepted.

I see three possible solutions:

Add a settings to the RQ integration along the lines of failed_jobs_only. This setting would default to false to maintain backwards compatibility.
Provide a hook mechanism that passes the job in and let users customize whether or not to capture the event
Stuff some job information like the status into the hint so it can be inspected and filtered via the standard before_send hook.

IMO 1 or 3 is the way to go. 1 is the simplest and 3 is more flexible. I am not sure if stuffing random data into hint like this is desirable from Sentry's standpoint.

untitaker · 2021-04-08T15:31:15Z

celery integration does not report those kinds of exceptions so for sake of consistency I also wouldn't do that for RQ (regardless of whether the behavior is controllable by an option)

BobReid · 2021-04-08T19:03:48Z

@untitaker

I have thrown together a quick PR. It can be as simple as checking if the job has failed. Let me know your thoughts and I can write a test for it.

#1076

BobReid · 2021-04-13T21:18:30Z

I have added a test to my PR. I had to add an ignore_logger call to the RQ integration or else the exception was still captured by the logging integration.

BobReid · 2021-05-03T13:32:30Z

@untitaker any thoughts on the resolution I have proposed.

This is causing unnecessary disruptions to my team. I would prefer to fix this upstream in sentry but if now we will have to look at a way to patch it in our system.

untitaker · 2021-05-03T14:46:19Z

@BobReid CI is not green so I have restarted it

I think the fix is fine (and I really apprechiate it and will make sure to get it merged this week, sorry for the delay!) but in general I would always find a way to unblock yourself from getting OSS contributions merged. If you need to escalate an issue I would go through Sentry support to get things looked at, because to support tickets we actually have SLOs and resources attached.

BobReid · 2021-05-03T17:21:14Z

@untitaker I understand things won't always get fixed in the timely manner. In this case, it meant the team would not be able to rely on retries for smoothing out of order execution, which is a tool I want to be able to employ when necessary.

There didn't seem to be anything we could hook into in order to suppress the event. The problem is that the typical before send hook did not have any job context to inspect in order to determine whether or not the event should be suppressed.

The only thing to explore was abandoning retries completely, or monkey patching sentry to apply my fix from our code base. Both solutions were less than ideal.

I appreciate you taking another look at this. Thanks again.

vaal- · 2021-12-21T14:18:40Z

Hello

I have the opposite situation in which it would be more convenient if all retries were logged. So I wanted to ask - if I make a PR that will add an option that will enable logging - will such a change be accepted? or a principled position that such should not be logged?

BobReid mentioned this issue Apr 8, 2021

fix(rq): Only capture exception if RQ job has failed (ignore retries) #1076

Merged

untitaker closed this as completed in #1076 May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suppress errors while RQ is retrying #1074

Suppress errors while RQ is retrying #1074

BobReid commented Apr 8, 2021

untitaker commented Apr 8, 2021

BobReid commented Apr 8, 2021

BobReid commented Apr 13, 2021

BobReid commented May 3, 2021

untitaker commented May 3, 2021

BobReid commented May 3, 2021

vaal- commented Dec 21, 2021

Suppress errors while RQ is retrying #1074

Suppress errors while RQ is retrying #1074

Comments

BobReid commented Apr 8, 2021

untitaker commented Apr 8, 2021

BobReid commented Apr 8, 2021

BobReid commented Apr 13, 2021

BobReid commented May 3, 2021

untitaker commented May 3, 2021

BobReid commented May 3, 2021

vaal- commented Dec 21, 2021