-
Notifications
You must be signed in to change notification settings - Fork 389
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PdfEmitter, Enhancement: Enable options to set the PDF/Version & archiving formats PDF/A (A1A & A1B) #1486
Comments
archiving formats PDF/A (A1A & A1B) (eclipse-birt#1486)
This is definitely a step into the right direction. I'm not sure about some details, though. First of all, would it be possible to also create PDF/A-3 (or at least in a later step?) Next, I'm not sure about PDF/A-1a (EDITED: correct is 1a (a for accessible) instead of 1b (b for basic). If the PDF is not properly tagged, it would be better to not allow declaring the PDF to be PDF/A-1b conformant. Regarding the title and language: Regarding font handling: |
I'm aware of the topic with PDF/A-3 and the ZUGFeRD-invoice-method of german governance agencies.
|
I cannot believe this. AFAIK calling You can use the PAC 2021 (PDF Accessibility Checker) to actually see the tag structure. |
I have not set the the structure will be a full representation of all elements which are included on the DPF-document. But this is not my target. These topics are steps for the next level. |
Even though the validators say that the generated PDF is valid PDF/A-1a, this is not the case. If we add an option to create (seemingly) PDF/A-1a, people would of course think that BIRT supports PDF/A-1a. This topic is very important for the future of BIRT and thus I think the output should not "lie" by stating it's PDF/A-1a conformant when it isn't really. So, in a first step we should only support PDF/A-1b ("b" for basic profile). Adding support for PDF/A-1b (and for PDF/UA, by the way) will be easy once we can create tagged PDF. |
@hvbtup The PDF/UA conformance needs according to your explanations more details and the full tagged structure. The PDF/UA stanards is a subset of PDF/A1a and PDF/UA is more restrictive instead of PDF/A1a. So I checked also a PDF/A1a with the wrong "dc:title" definition with the added Title but the unnoted language-attribute. Currently my point of view is that the PDF/A1a version is a valid option. But you can test the created PDF/A1a documents from my demo-zip-file and you will finde, |
I have tested some further PDF/A-validators and here is a result which represents the difference between PDF/A-1a & PDF/UA-1. So I think our PDF/A-1a based on openPDF is a valid option. Additional tested validators |
I don't own a copy of the EN ISO 19005 standard, so probably you are right and it only means that the reading order is OK. I tested your document This tool says:
The validator does not complain about the missing tag structure, although this structure is empty (as can be seen with PAC 2021), so that's ok for me. The same test result also for PDF/A-1b conformance. Probably many documents claiming PDF/A1-a (or PDF/A1-b) conformance aren't actually conformant, so while we know that there are issues left (like the |
@wimjongman |
This issue is the release note. |
I prefer that we add the new option on the release notes and additional on it we should not that the PDF/A* will be genereated with "openPDF version 1.3.30". So it is full transparent which library is used to generete the PDF/A. I agree that we should add the hints PDF/A1b = ok, PDF/A1a with limitation based openPDF. At the end with my experience in the past and you Henning that PDF/A is complex and one of the main issue is to have a good generator of PDF/A and also to have a good validator. I tested with "https://www.pdf-online.com/osa/validate.aspx" agaian and there is noted the "cm"-operator and the issue with the "dc:title". So we have know 9 validators of PDF/A1a with 7time=ok, 1times = faild due to "cm", 1times failed due to "cm" and "dc:title" (if it used). And all this differences of a standard which was published 2005. |
🙈 |
One of the main problems with standards like EN ISO 19005 is that they are not open source. For example, I was not able to find a statement that |
static constants access) (eclipse-birt#1486)
The target is to support different PDF-versions and PDF archive formats PDF/A (A1A & A1B)
Based on openPDF version 1.3.30 the following features will be available:
• setting of the PDF-version: 1.3 till 1.7
• setting of the PDF/A-version: A1A & A1B
• settong of the PDF/A-version: PDF.X32002
The default of the PDF creation won't be changed.
The new options will be activated only with the configuration of the according user properties.
The following user properties will be available on the PdfEmitter:
PdfEmitter.Version, String, enumeration
• supported values: 1.3. 1.4, 1.5, 1.6, 1.7
• default: 1.5
PdfEmitter.Conformance, String, enumeration
• supported values: PDF.A1A, PDF.A1B, PDF.X32002
• default: PDFXNONE
• PDF.A1A & PDF.A1B overwrite the configured PDF-version with PDF-version 1.4
PdfEmitter.IccProfileFile, String, file path
• path to the ICC color profile
• default: sRGB IEC61966-2.1
PdfEmitter.IccColorType, String, enumeration
• supported values: RGB, CMYK
• default: RGB
PdfEmitter.PDFA.AddDocumentTitle, Boolean
• false, avoid the title at document / true, add the title at document (openPDF 1.3.30: PDF/A will be invalid)
• default: false
• Workaround: openPDF 1.3.30 issue on creating the PDF/A title, the tag "dc:title" is invalid due to missing language-atrribute at the title content
PdfEmitter.PDFA.FallbackFont, String, font file name
• path of an alternative installed font name which can be used instead of an not mebeddable font
• default: null
The PDF/A results are tested and validated with 3 different PDF/A validators:
• pdfforge.org
• avepdf.com
• veraPDF
For the testing the results of 3 different basic test cases was used incl. images (direct & background images),
the basic tests are:
• Version: PDF/A1A
• Version: PDF/A1A with fallback font of unembeddable fonts
• Version: PDF/A1B with fallback font of unembeddable fonts
(• Version: PDF with configured PDF-version is tested and validated with the AdobeReader details dialog.)
First results of the new options, PDF/A1A with fallback font to be a valid PDF/A document:
pdf_enhancement_version_PDF_A1A_FBfont.pdf
Attached the following test reports are given:
• 4 test reports of:
pdf_enhancement_test_reports.zip
Attached the following documentation is given:
• 4 pdf documents of
pdf_enhancement_documents.zip
Attached the following PDF/A validation documentation is given:
• 3 screens of each PDF/A-validator of the results of the PDF/A-documents
pdf_enhancement_validation_documentation.zip
The text was updated successfully, but these errors were encountered: