Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

calamari-predict truncates filename #308

Open
stefanCCS opened this issue Mar 1, 2022 · 3 comments
Open

calamari-predict truncates filename #308

stefanCCS opened this issue Mar 1, 2022 · 3 comments

Comments

@stefanCCS
Copy link

If the image to be "OCRed" has more than one '.' in filename, some parts of the resulting filename are truncated.
E.g.:
something.else.png --> something.pred.txt instead of something.else.pred.txt

@andbue
Copy link
Member

andbue commented Mar 1, 2022

Right, that's a little bit annoying, I've struggled with that myself before. In ocropus, the image file names contain information on preprocessing (e.g. 001.bin.png) that have to be ignored. If we change the current behaviour, we might brake support for legacy datasets. I don't know if ocr4all needs this - @chreul ?
Maybe we could either implement a command line switch to toggle file extension handling or just ignore a specific set of strings (bin, raw, nrm, maybe col?).

@maxnth
Copy link
Member

maxnth commented Mar 1, 2022

I don't know if ocr4all needs this

OCR4all currently indeed needs this but we could just use a small wrapper / postprocessing script for this (and the newly written back end manages files different anyways) so changing this wouldn't really be a problem for OCR4all.

@stefanCCS
Copy link
Author

Well, in my opinion the current behaviour is unexpected for newcomers like myself.
I (and I assume any other newcomer) like the idea to change this - any additional command line switch would be ok, of course.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants