Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tesseract 4.1 - Using PageSegMode.SingleBlockVertText does not work #490

Open
charlesw opened this issue Oct 12, 2019 · 3 comments
Open
Assignees
Labels

Comments

@charlesw
Copy link
Owner

The following unit test are failing in Tesseract 4.1 build (branch features\321-Tesseract-4):

  • EngineTests.CanParseText_UsingMode(SingleBlockVertText,"A line of text")

Preliminary investigation indicates the problem is with tesseract itself, however yet to be confirmed (haven't yet reproduced directly using tesseract).

@charlesw charlesw added the bug label Oct 12, 2019
@charlesw charlesw self-assigned this Oct 12, 2019
@Robban1980
Copy link

Completely green on tesseract, just started a project to implement it in a .Net app. When I test with Japanese and --psm 5 I get a line of text, with Japanese and --psm 6 I get a vertical line of text.
Could it be that the English data is not trained on vertical text that is causing the issue?

@charlesw
Copy link
Owner Author

charlesw commented Oct 21, 2019 via email

@charlesw
Copy link
Owner Author

Still broken in Tesseract 5.0 (#579) :(

charlesw added a commit that referenced this issue Jan 23, 2022
Known issues:
* #489 - Setting regions of interest doesn't work
* #490 - PageSegMode.SingleBlockVertText does not work
* #594 - Unz files don't work
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants