keep_jobs integer intervals are too large #55295

squidpickles · 2019-11-13T21:27:49Z

Description of Issue

For a large installation (>3000 minions) running frequent operations, the job cache grows quite large. In our case, we don't need a job cache beyond about 5 minutes. It would be helpful to be able to specify keep_jobs as a fraction of an hour, to keep the cache small. (We keep ours in RAM to reduce disk IO.)

Setup

1 master, 3000 minions

Steps to Reproduce Issue

Set keep_jobs: 1, run test.ping every 60 seconds, job cache grows to 10 GB over 24 hours.

Versions Report

Salt Version:
           Salt: 2019.2.2

Dependency Versions:
           cffi: Not Installed
       cherrypy: Not Installed
       dateutil: 2.6.1
      docker-py: Not Installed
          gitdb: 2.0.3
      gitpython: 2.1.8
          ioflo: Not Installed
         Jinja2: 2.10
        libgit2: 0.26.0
        libnacl: Not Installed
       M2Crypto: Not Installed
           Mako: Not Installed
   msgpack-pure: Not Installed
 msgpack-python: 0.5.6
   mysql-python: Not Installed
      pycparser: Not Installed
       pycrypto: 2.6.1
   pycryptodome: Not Installed
         pygit2: 0.26.2
         Python: 3.6.8 (default, Oct  7 2019, 12:59:55)
   python-gnupg: 0.4.1
         PyYAML: 3.12
          PyZMQ: 16.0.2
           RAET: Not Installed
          smmap: 2.0.3
        timelib: Not Installed
        Tornado: 4.5.3
            ZMQ: 4.2.5

System Versions:
           dist: Ubuntu 18.04 bionic
         locale: UTF-8
        machine: x86_64
        release: 4.15.0-66-generic
         system: Linux
        version: Ubuntu 18.04 bionic

The text was updated successfully, but these errors were encountered:

xeacott · 2019-11-14T23:43:39Z

Thanks for submitting this ticket as well as getting a PR together. Pinring @saltstack/team-core about this one so we can review the PR and address this. 😄

dwoz · 2019-11-14T23:47:12Z

@squidpickles I'm wondering if there is not a more elegant way of accomplishing what you are trying to do. What is the motivation for running test.ping every 60 seconds?

squidpickles · 2019-11-14T23:57:08Z

We've noticed a number of hosts lose salt connectivity, despite being reachable via SSH. This occasionally predicts localized network outages. Our systems team wrote a check for the monitoring system using a ping to check for responsive hosts.

stale · 2020-01-07T14:05:15Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

If this issue is closed prematurely, please leave a comment and we will gladly reopen the issue.

stale · 2020-01-09T19:11:32Z

Thank you for updating this issue. It is no longer marked as stale.

sagetherage · 2020-02-10T22:27:33Z

@dwoz can you take a look at this one, please?

This was referenced Nov 13, 2019

Support fractional keep_jobs times #55296

Closed

Support fractional keep_jobs times #55313

Merged

xeacott added fixed-pls-verify fix is linked, bug author to confirm fix Pending-Discussion The issue or pull request needs more discussion before it can be closed or merged labels Nov 14, 2019

xeacott added this to the Approved milestone Nov 14, 2019

stale bot added the stale label Jan 7, 2020

sagetherage added the Confirmed Salt engineer has confirmed bug/feature - often including a MCVE label Jan 9, 2020

stale bot removed the stale label Jan 9, 2020

sagetherage assigned dwoz Feb 10, 2020

sagetherage modified the milestones: Approved, Blocked Mar 26, 2020

sagetherage removed the fixed-pls-verify fix is linked, bug author to confirm fix label Mar 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

keep_jobs integer intervals are too large #55295

keep_jobs integer intervals are too large #55295

squidpickles commented Nov 13, 2019

xeacott commented Nov 14, 2019

dwoz commented Nov 14, 2019

squidpickles commented Nov 14, 2019

stale bot commented Jan 7, 2020

stale bot commented Jan 9, 2020

sagetherage commented Feb 10, 2020

keep_jobs integer intervals are too large #55295

keep_jobs integer intervals are too large #55295

Comments

squidpickles commented Nov 13, 2019

Description of Issue

Setup

Steps to Reproduce Issue

Versions Report

xeacott commented Nov 14, 2019

dwoz commented Nov 14, 2019

squidpickles commented Nov 14, 2019

stale bot commented Jan 7, 2020

stale bot commented Jan 9, 2020

sagetherage commented Feb 10, 2020