fix ExpiringDict caching solution #1200

terrazoon · 2024-07-19T20:59:00Z

Description

For various reasons we are currently forced to use an in-memory cache to store job information that is used to populate the reports. Although we increased the cache time to 7 days and the number of objects to 20,000, there are still intermittent problems retrieving data being reported.

We found two separate issues:

Sometimes we cannot download a csv file (404) even though we know that csv was successfully uploaded and the message in the csv was successfully sent. This should not happen and the reason why it does is currently a mystery (seems to happen on Fridays!). We are going to ignore this issue for the moment.
Every time the app restarts, the in-memory cache is obviously wiped out, and it seems that the app restarts more frequently with more dire consequences for report generation than we anticipated. In addition to this, no in-memory solution can run across processes so we will need a separate cache for every gunicorn worker.

So the solution proposed here is run a 'repair' task periodically that checks the cache to make sure it has all jobs currently in the bucket.

NOTE: The 'best practice' here would be to use redis or memcached, but until or unless we decide to go that route, ExpiringDict where we rebuild it on a schedule seems like a workable solution. It potentially does require more memory (although how much memory is being used by one instance of the cache is unknown, and probably not much).

Security Considerations

N/A

xlorepdarkhelm

See my question below.

app/aws/s3.py

xlorepdarkhelm

LGTM

ccostino

Thanks, @terrazoon!

Let's see if this helps alleviate some of the pain at least and buys us more time and space to figure out our next steps with all of this.

fix ExpiringDict caching solution

875d378

terrazoon self-assigned this Jul 19, 2024

terrazoon marked this pull request as draft July 19, 2024 21:40

clean up

4d24b82

terrazoon temporarily deployed to staging July 19, 2024 22:03 — with GitHub Actions Inactive

cleanup

aa63003

terrazoon temporarily deployed to staging July 22, 2024 14:21 — with GitHub Actions Inactive

terrazoon marked this pull request as ready for review July 22, 2024 14:21

terrazoon requested review from ccostino and a team July 22, 2024 14:24

xlorepdarkhelm requested changes Jul 22, 2024

View reviewed changes

app/aws/s3.py Show resolved Hide resolved

jskinne3 approved these changes Jul 22, 2024

View reviewed changes

app/aws/s3.py Outdated Show resolved Hide resolved

code review feedback

f61a47a

terrazoon temporarily deployed to staging July 22, 2024 16:11 — with GitHub Actions Inactive

terrazoon requested a review from xlorepdarkhelm July 22, 2024 16:16

code review feedback

e97b567

terrazoon temporarily deployed to staging July 22, 2024 17:05 — with GitHub Actions Inactive

fix flake8

946b1e9

terrazoon temporarily deployed to staging July 22, 2024 17:26 — with GitHub Actions Inactive

xlorepdarkhelm approved these changes Jul 22, 2024

View reviewed changes

remove print statement

f1e0990

terrazoon temporarily deployed to staging July 22, 2024 17:59 — with GitHub Actions Inactive

ccostino approved these changes Jul 22, 2024

View reviewed changes

ccostino merged commit 26c6d39 into main Jul 22, 2024
7 checks passed

ccostino deleted the grrr branch July 22, 2024 21:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix ExpiringDict caching solution #1200

fix ExpiringDict caching solution #1200

terrazoon commented Jul 19, 2024 •

edited

Loading

xlorepdarkhelm left a comment

xlorepdarkhelm left a comment

ccostino left a comment

fix ExpiringDict caching solution #1200

fix ExpiringDict caching solution #1200

Conversation

terrazoon commented Jul 19, 2024 • edited Loading

Description

Security Considerations

xlorepdarkhelm left a comment

Choose a reason for hiding this comment

xlorepdarkhelm left a comment

Choose a reason for hiding this comment

ccostino left a comment

Choose a reason for hiding this comment

terrazoon commented Jul 19, 2024 •

edited

Loading