Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add WACZ archives to collections #799

Closed
kuechensofa opened this issue Jan 19, 2023 · 1 comment
Closed

Add WACZ archives to collections #799

kuechensofa opened this issue Jan 19, 2023 · 1 comment

Comments

@kuechensofa
Copy link
Contributor

Is your feature request related to a problem? Please describe.

When receiving a web archive in WACZ format it currently isn't possible to add it to a collection using wb-manager.

Describe the solution you'd like

It should be possible to use wb-manager add COLLECTION WACZ-FILE to add the contents of a WACZ archive to a collection. wb-manager should unpack the WARC files of the archive and the indexes to a temporary directory, move the extracted WARC files to the collection and then add the WACZ indices to the collection index.

Describe alternatives you've considered

Currently it is possible to add the contents of a WACZ archive to a collection by unpacking the WACZ archive and manually adding its contents but it's a very manual process.

kuechensofa added a commit to kuechensofa/pywb that referenced this issue Jan 19, 2023
Add wacz archives to collections using wb-manager
tw4l pushed a commit that referenced this issue Feb 15, 2023
…wacz (#800)

Add WACZ support for `wb-manager add` by unpacking WACZ files with --uncompress-wacz.

A future commit will add pywb support for WACZ files without requiring them to be unpacked.
@tw4l
Copy link
Member

tw4l commented Apr 24, 2024

Hi @kuechensofa , sorry it took so long but this is finally being released in pywb 2.8.0 today! We've renamed the argument to wb-manager add --unpack-wacz but otherwise it's the same :) We're not planning WACZ support without unpacking in 2.9.

Thanks so much for this great contribution!

@tw4l tw4l closed this as completed Apr 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants