Problem in part of the translation #273

mauricioros · 2020-02-25T12:33:47Z

Hello!
First of all thanks for making this amazing tool available!
Let's get to the problem .... I'm creating a small application using PDFParser.
But it cannot convert part of a pdf.

Can you help me?

I'm reading the following url:
https://dje.tjsp.jus.br/cdje/consultaSimples.do?cdVolume=14&nuDiario=2986&cdCaderno=12&nuSeqpagina=2993

And in the first occurrence of "REQDO" it should bring forward "Guaraná e Ramos Empreendimentos Imobiliários Ltda.", However bring the following set of characters "<001d0003002a00580044005500440051006900030048000300000000000000000000000000000000000000000000000000000000000000003000000000000000000030000000
440011 ["

Using the demo guys (https://www.pdfparser.org/demo) it also presents the same error.

Can you help me? Please!!!

Connum · 2020-09-25T19:10:47Z

The hexadecimal string in this position contains a newline character that causes this issue. I'm working on a fix that strips any newlines from hexadecimal strings bevore trying to decode them.

…eaks first (fix smalot#273)

Connum · 2020-09-25T20:21:28Z

I just created a PR with a fix! There's this other decoding issue when testing the provided file

EXEQTE  : Infinit����)�D�V�K�L�R�Q���&�R�P�p�U�F�L�R���H���'�L�V�W�U�L�E�X�L�G�R�U�D���(�L�U�H�O�i
ADVOGADO  : 67978/SP - Cleodilson Luiz Sforzin

which will be fixed as well once #342 is merged.

k00ni · 2020-09-30T08:04:42Z

#342 was merged.

…eaks first (fix #273) (#346) * process hexadecimal strings containing line breaks, but strip line breaks first (fix #273) * remove binary symbold from test data string * code linting

…eaks first (fix smalot#273) (smalot#346) * process hexadecimal strings containing line breaks, but strip line breaks first (fix smalot#273) * remove binary symbold from test data string * code linting

k00ni added the bug label May 26, 2020

Connum added a commit to Connum/pdfparser that referenced this issue Sep 25, 2020

process hexadecimal strings containing line breaks, but strip line br…

4455b0d

…eaks first (fix smalot#273)

Connum mentioned this issue Sep 25, 2020

process hexadecimal strings containing line breaks, but strip line breaks first (fix #273) #344

Closed

Connum mentioned this issue Sep 28, 2020

process hexadecimal strings containing line breaks, but strip line breaks first (fix #273) #346

Merged

k00ni closed this as completed Sep 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem in part of the translation #273

Problem in part of the translation #273

mauricioros commented Feb 25, 2020

Connum commented Sep 25, 2020

Connum commented Sep 25, 2020

k00ni commented Sep 30, 2020

Problem in part of the translation #273

Problem in part of the translation #273

Comments

mauricioros commented Feb 25, 2020

Connum commented Sep 25, 2020

Connum commented Sep 25, 2020

k00ni commented Sep 30, 2020