-
-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add WACZ archives to collections #799
Comments
kuechensofa
added a commit
to kuechensofa/pywb
that referenced
this issue
Jan 19, 2023
Add wacz archives to collections using wb-manager
8 tasks
tw4l
pushed a commit
that referenced
this issue
Feb 15, 2023
…wacz (#800) Add WACZ support for `wb-manager add` by unpacking WACZ files with --uncompress-wacz. A future commit will add pywb support for WACZ files without requiring them to be unpacked.
This was referenced Apr 23, 2024
Hi @kuechensofa , sorry it took so long but this is finally being released in pywb 2.8.0 today! We've renamed the argument to Thanks so much for this great contribution! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Is your feature request related to a problem? Please describe.
When receiving a web archive in WACZ format it currently isn't possible to add it to a collection using
wb-manager
.Describe the solution you'd like
It should be possible to use
wb-manager add COLLECTION WACZ-FILE
to add the contents of a WACZ archive to a collection.wb-manager
should unpack the WARC files of the archive and the indexes to a temporary directory, move the extracted WARC files to the collection and then add the WACZ indices to the collection index.Describe alternatives you've considered
Currently it is possible to add the contents of a WACZ archive to a collection by unpacking the WACZ archive and manually adding its contents but it's a very manual process.
The text was updated successfully, but these errors were encountered: