-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Validate PDF/A files #285
Comments
@mrivoal @loicjaouen @gfoo Lukas says we're not going to do this. You have to validate your own PDFs before putting them into Sipi. You could use https://verapdf.org/home/ for that. |
I guess we could do that by ourselves now. But at some point (and I guess sooner than later), when a user can upload a PDF to Sipi using Salsah or KUIRL, his PDF should be validated somehow, isn't it? So, the idea is that Sipi only hosts validated PDF/A? |
If I understood correctly, @lrosenth said each project is responsible for ensuring that its data is valid before upload/import. Perhaps a GUI could handle validation before submitting the file to Sipi. |
Hi to all from Corsica...
Validating a PDF/A is a complex process. I could imagine that I could use the ghostscript-library to di it, but it will be a lot of work. Acrobat allows to validate PDFs before upload. So I suggest for the moment its up to the user. Later - as i mentioned - we can add some validation...
Lukas
Von meinem iPhone gesendet
Am 05.07.2019 um 09:58 schrieb Benjamin Geer <notifications@github.com<mailto:notifications@github.com>>:
If I understood correctly, @lrosenth<https://github.com/lrosenth> said each project is responsible for ensuring that its data is valid before upload/import. Perhaps a GUI could handle validation before submitting the file to Sipi.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#285>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ABJX3TCH3G77PDK5WUQESLDP535JHANCNFSM4G3265GA>.
|
I think that Sipi needs to do this. In the context of long-term preservation, this is a must-have. But as @lrosenth said, it will take a bit longer to actually implement. |
Ok, fine. We will do our own PDF validation for now. But at some point, Sipi should definitely check during the upload every file format we are going to accept. |
@lrosenth : a question regarding the PDF formats we are willing to store and preserve though Sipi.
(As we are going to convert some 1200 PDF files for Lumières.Lausanne, let's choose the right version/format!) Here are the comments from the CINES on PDF/A versions:
|
This is the only PDF/A validation library I found that looks serious and well-maintained:
https://verapdf.org/home/
The text was updated successfully, but these errors were encountered: