Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport to 2 6 remove collection size feature hyrax 4207 #4237

Merged

Conversation

fritzfreiheit
Copy link
Contributor

Backport of #4207 to v2.6.* / 2.x-stable

See: #4207

@samvera/hyrax-code-reviewers

mjgiarlo and others added 2 commits January 30, 2020 15:32
While looking into #4100, we noticed that `CharacterizeJob` not only saves a file set's `original_file`, but also reindexes the file set, *and* reindexes every collection to which the file set's work belongs. It appears that this feature was added way back in the Sufia days in order to support the use case of displaying on the collection show page the total size of all files uploaded to a collection's works.

Why is this problematic?

At large scale, this means touching a potentially huge portion of the repository—iterating over all files in all works in a collection—in order to display an arguably useful string to a subset of repository users who care about such information. Doing this in `CharacterizeJob` introduces significant potential delay in the ingest pipeline, increasing how long it takes for a deposit to finish and render in the Hyrax UI (e.g., by delaying the creation of derivatives).

We propose removing this feature for now and encourage folks who need this feature to contribute it back with a better-performing, more streamlined design.
@jeremyf
Copy link
Contributor

jeremyf commented Jan 31, 2020

@fritzfreiheit thanks for the work on this. One question I have is if this is something to push to 2.x-stable; My understanding would be that we want to have a branch that starts at tag v2.6.0 (which I recently made as 2.6-stable). We would then craft a PR onto the 2.6-stable branch. @no-reply does that reconcile with your understanding? If so, do we have documentation speaking to that? If not, I'm more than happy to write up procedures for back porting.

@jeremyf jeremyf merged commit 97cf1e5 into 2.x-stable Jan 31, 2020
@jeremyf jeremyf deleted the backport-to-2-6-remove-collection-size-feature-hyrax-4207 branch January 31, 2020 16:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants