Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

Closed
divyangashah opened this issue Oct 1, 2019 · 5 comments
Labels

Comments

@divyangashah
Copy link

Hi team,

When we are calling:

string getContent = currentPage.GetText();

It gives error below mentioned in some of the pdf documents when trying to perform OCR Text Extraction.
Please update the solution if it is available.

Environment

Tesseract Version: 4.0.0.0-beta3
Platform: Windows 64-bit

Current Behavior:
giving below error in some documents:

[ERROR] 2019-09-30 14:14:51.43,Attempted to read or write protected memory. This is often an indication that other memory is corrupt.,(:0)
System.AccessViolationException: Attempted to read or write protected memory. This is often an indication that other memory is corrupt.
at Imaging.OCR.TExtraction.RecognizeandExtractText(String inputFile, String outputFile, Boolean isReduceToBitonal)
at Imaging.IG.ConsumerTest.FormMain.OCRExtraction()

Expected Behavior:
It should give text content available in pdf file when perform OCR Text Extraction.

Suggested Fix:
Same issue was in tesseract 3.0v which was resolved in 3.0.2-alpha1

@stweil
Copy link
Contributor

stweil commented Oct 1, 2019

Please use a current Tesseract version, either 4.1.0 or the latest version from Git master.

Your error report also indicates that you don't run the Tesseract executable, but some third party software, so you should report any problems there.

Please use the user forum for additional questions.

@stweil stweil closed this as completed Oct 1, 2019
@stweil stweil added the question label Oct 1, 2019
@divyangashah
Copy link
Author

@stweil can you please share the github repo. or link where can I find either 4.1.0 or the latest version.

Thanks!

@stweil
Copy link
Contributor

stweil commented Oct 1, 2019

@divyangashah
Copy link
Author

Thanks @stweil
actually, I am looking for Tesseract.dll for .net c#
Please share if any available.

@zdenop
Copy link
Contributor

zdenop commented Oct 1, 2019

We provide only source code. You need to create binary file for yourself or take it from other projects.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants