Unique job keys never dissapear #161

kpheasey · 2016-02-04T14:57:52Z

I'm using unique jobs with Sidekiq Pro (reliability enabled) and ActiveJob and have setup my initializer according to the README.

Most of my jobs are created with sidekiq, however, there is a large portion that comes form ActiveJob.

The uniquejob keys are not removed from redis.

Notice the large memory usage in the screenshot below. There are no jobs in the queue

When looking threw keys in Redis, I see a lot of

my_namespace:uniquejobs:c6af61b6571bc422a78f6f3d043a2635
my_namespace:uniquejobs:38cfda2fd8f924da936175dde391dd3d
my_namespace:uniquejobs:4bff3c9a5b97670209a509c6d5dd95a5
my_namespace:uniquejobs:38cfda2fd8f924da936175dde391dd3d

The text was updated successfully, but these errors were encountered:

kpheasey · 2016-02-04T18:51:14Z

After looking further, I discovered that it's the uniquejobs hash in redis that takes up the majority of the memory usage.

mhenrixon · 2016-02-04T21:57:21Z

For a quick fix, I've implemented a

A what now? :) Did you see you can clear the jobs by command line or console?

kpheasey · 2016-02-04T22:48:07Z

Sorry, didn't finish my thought.

I believe the problem is because the of sidekiq pro reliability and an autoscaling environment.

Reliability puts jobs in private queues. When the job is executed, the unique job is unlocked. However, when autoscaling brings down servers, the sidekiq process dissapears and the private queue is not processed. To get around that, there is a second process that finds old private queues and puts the jobs back into the main queue with RPOPLPUSH. Finally, when the job is executed by another sidekiq process, the unique job is not unlocked.

I believe there may need to be another unique: :unti_l* type of :unitl_reliably queued. This will unlock the unique job when the job has been moved to a private queue. However, this seems very environment and application specific, so I will build it on my own.

Closing the issue. Let me know if you want me to make a pull request.

warmwaffles · 2016-02-10T16:34:37Z

@kpheasey I'm interested in the solution you came up with. We are experiencing similar issues.

mhenrixon · 2016-02-10T16:35:47Z

While waiting for a fix you can use the console or commandline app to clear the keys.

warmwaffles · 2016-02-10T16:39:14Z

Thanks @mhenrixon, if a fix is found, can you post here as well?

mhenrixon · 2016-02-10T16:41:26Z

Sure thing! The only solution that I can see is if I could get access to the pro source code to see how to hook into it but I guess @mperham wouldn't just hand that out so my hands are a little tied on that matter.

ropiku · 2016-02-10T16:48:41Z

@mhenrixon If you bought pro then you have access to it: bundle show sidekiq-pro.

mperham · 2016-02-10T16:49:58Z

The unique jobs implementation in Sidekiq Enterprise requires a TTL to ensure data is expired quickly for exactly this reason: sidekiq_options unique_for: 10.minutes

https://github.com/mperham/sidekiq/wiki/Ent-Unique-Jobs#use

I'm not sure I understand the issue to advise you. There is nothing to hook into when reliabling enqueuing since it uses a single atomic Redis command, RPOPLPUSH.

mhenrixon · 2016-02-10T16:51:52Z

@ropiku I am not buying an enterprise license to maintain an open source gem. That economy doesn't make sense to me but if you buy a license for us by all means go ahead :)

Thanks @mperham!

@warmwaffles @kpheasey would sidekiq_options unique_for: 10.minutes help you at all?

ropiku · 2016-02-10T16:53:51Z

Sorry I mistook you with kpheasey who was saying is running pro. Will give this a try.

warmwaffles · 2016-02-10T17:04:07Z

would sidekiq_options unique_for: 10.minutes help you at all?

Unfortunately, we only have Pro so I don't think uniqueness comes with that gem.

kpheasey · 2016-02-10T19:11:12Z

@warmwaffles I haven't gotten a chance to implement the solution yet. I believe it's possible to override a method Sidekiq::Pro::ReliableFetch.retrieve_work() or Sidekiq::Pro::ReliableFetch::Retriever to unlock the the job at that point in time. Similar to the overrides here, https://github.com/mhenrixon/sidekiq-unique-jobs/blob/9f184aacebe2d9395eef3c0ca84f89f07972c2e1/lib/sidekiq_unique_jobs/sidekiq_unique_ext.rb

kpheasey · 2016-02-11T19:26:47Z

@warmwaffles I don't the resources currently to look into implementing a new lock type that would unlock jobs when they have been fetched for a private queue.

We have a lot of pre-calculated data that relies on outside sources which are constantly changing. Previously we created a job to re-calculate data when the outside data changed.

Our solution involved creating a small, non-unique, job to mark the calculations as expired. Then a second, unique, job will re-calculated. This ensures that we correctly expire the cache and can know what calculations are needed, even if the unique job is not unlocked correctly because the state is saved to the database.

Since the problem only happens after the environment has been scaled down. We clear the unique job locks when we merge the stale private queues. Here's a gist with our rake task for doing so; https://gist.github.com/kpheasey/9c9255c4ce20beeabde0

kpheasey closed this as completed Feb 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unique job keys never dissapear #161

Unique job keys never dissapear #161

kpheasey commented Feb 4, 2016

kpheasey commented Feb 4, 2016

mhenrixon commented Feb 4, 2016

kpheasey commented Feb 4, 2016

warmwaffles commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

warmwaffles commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

ropiku commented Feb 10, 2016

mperham commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

ropiku commented Feb 10, 2016

warmwaffles commented Feb 10, 2016

kpheasey commented Feb 10, 2016

kpheasey commented Feb 11, 2016

Unique job keys never dissapear #161

Unique job keys never dissapear #161

Comments

kpheasey commented Feb 4, 2016

kpheasey commented Feb 4, 2016

mhenrixon commented Feb 4, 2016

kpheasey commented Feb 4, 2016

warmwaffles commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

warmwaffles commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

ropiku commented Feb 10, 2016

mperham commented Feb 10, 2016

mhenrixon commented Feb 10, 2016

ropiku commented Feb 10, 2016

warmwaffles commented Feb 10, 2016

kpheasey commented Feb 10, 2016

kpheasey commented Feb 11, 2016