ROB: Soft failure for flate encode image mode 1 with wrong LUT size #2900

stefan6419846 · 2024-10-12T17:21:13Z

Closes #2889.

codecov · 2024-10-12T17:29:38Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.44%. Comparing base (c8bfa5f) to head (db2d79c).
Report is 7 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2900   +/-   ##
=======================================
  Coverage   96.44%   96.44%           
=======================================
  Files          52       52           
  Lines        8726     8728    +2     
  Branches     1589     1589           
=======================================
+ Hits         8416     8418    +2     
  Misses        182      182           
  Partials      128      128

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pubpub-zz · 2024-10-12T20:57:10Z

Is it possible to build tests from the pdf provided in the issue ?

stefan6419846 · 2024-10-13T08:37:58Z

While it is possible, I honestly do not see the need for this here. The existing test already runs on an artificial 3x3 pixel image and covers all edge cases for the LUT. Adding another test does not really have any benefit IMHO and just requires downloading more data.

pubpub-zz · 2024-10-13T08:43:05Z

While it is possible, I honestly do not see the need for this here. The existing test already runs on an artificial 3x3 pixel image and covers all edge cases for the LUT. Adding another test does not really have any benefit IMHO and just requires downloading more data.

I understand and agree with what you mean. My point is just I prefer to use actual data to be able to compare pypdf behavior versus other libraries/viewers.

stefan6419846 · 2024-10-18T13:16:34Z

I would argue that this is not what tests are about - tests should be written to ensure that everything covered by them is working correctly and does not break for whatever reason. If we are able to ensure this with a small dummy image, this is much easier than downloading at least 1 MB of data beforehand and essentially verifying that no error is raised by default - with the small dummy image, we are able to verify the extraction result directly from within code as well.

Where are we doing a comparison at the moment? We might have a table or written list and some benchmarking, but nothing which would indicate this from a test perspective. Nothing prevents us to use actual data in the case that we explicitly need comparison code later on.

pubpub-zz · 2024-10-18T17:16:59Z

Sorry if you though I was waiting for something. Just a little overbusy

stefan6419846 · 2024-10-18T17:21:36Z

No worries - I just added on your comment and would have waited for the discussion result anyway.

@hpierre001

## What's new ### New Features (ENH) - Add `layout_mode_font_height_weight` argument to `PageObject.extract_text()` (#2920) by @hpierre001 ### Bug Fixes (BUG) - Fix font specificier for FreeText annotation (#2893) by @ssjkamei - Line breaks are not generated due to incorrect calculation of text leading (#2890) by @ssjkamei - Improve handling of spaces in text extraction (#2882) by @ssjkamei ### Robustness (ROB) - Soft failure for flate encode image mode 1 with wrong LUT size (#2900) by @stefan6419846 ### Documentation (DOC) - Use latest package versions (#2907) by @stefan6419846 - Correct example of reading FileAttachment annotation (#2906) by @j-t-1 ### Developer Experience (DEV) - Update pinned requirements (#2918) by @stefan6419846 - Make make_release.py compatible with Windows environment (#2894) by @pubpub-zz ### Maintenance (MAINT) - Remove references to outdated Python versions (#2919) by @stefan6419846 - Generalize the method of obtaining space_code (#2891) by @ssjkamei - Unnecessary character mapping process (#2888) by @ssjkamei - New LZW decoding implementation (#2887) by @MartinThoma ### Testing (TST) - Add LzwCodec for encoding (#2883) by @MartinThoma ### Code Style (STY) - Capitalize error messages (#2903) by @j-t-1 - Modify error messages in PdfWriter (#2902) by @j-t-1 [Full Changelog](5.0.1...5.1.0)

ROB: Soft failure for flate encode image mode 1 with wrong LUT size

db2d79c

Closes #2889.

pubpub-zz approved these changes Oct 18, 2024

View reviewed changes

stefan6419846 merged commit 80c3939 into main Oct 18, 2024
16 checks passed

stefan6419846 deleted the flate-mode1 branch October 18, 2024 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROB: Soft failure for flate encode image mode 1 with wrong LUT size #2900

ROB: Soft failure for flate encode image mode 1 with wrong LUT size #2900

stefan6419846 commented Oct 12, 2024

codecov bot commented Oct 12, 2024 •

edited

Loading

pubpub-zz commented Oct 12, 2024

stefan6419846 commented Oct 13, 2024

pubpub-zz commented Oct 13, 2024

stefan6419846 commented Oct 18, 2024

pubpub-zz commented Oct 18, 2024

stefan6419846 commented Oct 18, 2024

ROB: Soft failure for flate encode image mode 1 with wrong LUT size #2900

ROB: Soft failure for flate encode image mode 1 with wrong LUT size #2900

Conversation

stefan6419846 commented Oct 12, 2024

codecov bot commented Oct 12, 2024 • edited Loading

Codecov Report

pubpub-zz commented Oct 12, 2024

stefan6419846 commented Oct 13, 2024

pubpub-zz commented Oct 13, 2024

stefan6419846 commented Oct 18, 2024

pubpub-zz commented Oct 18, 2024

stefan6419846 commented Oct 18, 2024

codecov bot commented Oct 12, 2024 •

edited

Loading