Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

preprocessing file missing #135

Merged
merged 1 commit into from
Apr 25, 2022
Merged

Conversation

TITC
Copy link
Collaborator

@TITC TITC commented Apr 23, 2022

this repo missing a file which could cause the below error

Error: Cannot find module './LaTeX-OCR/pix2tex/dataset/preprocessing/third_party/match-at/lib/matchAt.js'. Please verify that the package.json has a valid "main" entry

copy from im2markup

and I am not sure whether you encountered this issue or not?

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe7 in position 2270: invalid continuation byte

if you encountered it, I think it would be better to add a prompt that uses python2.7 on readme.


2022.4.24 11:41
I notice above error can reproduce at im2latex-100K's im2latex_formulas.lst file, but work fine at my own dataset.

@lukas-blecher
Copy link
Owner

I have not encountered this issue. Will investigate it.
Maybe it is related to something I noticed a while ago. When trying to normalize latex code with \sqrt[3]{...} it wont't work correctly resulting in \sqrt [ Object object ] { ... } or something like that.

@TITC
Copy link
Collaborator Author

TITC commented Apr 24, 2022

wield.

I re-download file from here. It will report such issue. Then I open it type CTRL S triggle save action even nothing be changed. Then everything is normal.

@lukas-blecher
Copy link
Owner

Hi I've just noticed the error was in the .gitignore. I do have the file locally which is why there was no error for me.
I'll quickly check if the files match and merge after

@lukas-blecher lukas-blecher merged commit d5b0d9c into lukas-blecher:main Apr 25, 2022
lukas-blecher added a commit that referenced this pull request Apr 25, 2022
@TITC TITC deleted the preprocessing branch April 26, 2022 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants