System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

divyangashah · 2019-10-01T10:55:59Z

Hi team,

When we are calling:

string getContent = currentPage.GetText();

It gives error below mentioned in some of the pdf documents when trying to perform OCR Text Extraction.
Please update the solution if it is available.

Environment

Tesseract Version: 4.0.0.0-beta3
Platform: Windows 64-bit

Current Behavior:
giving below error in some documents:

[ERROR] 2019-09-30 14:14:51.43,Attempted to read or write protected memory. This is often an indication that other memory is corrupt.,(:0)
System.AccessViolationException: Attempted to read or write protected memory. This is often an indication that other memory is corrupt.
at Imaging.OCR.TExtraction.RecognizeandExtractText(String inputFile, String outputFile, Boolean isReduceToBitonal)
at Imaging.IG.ConsumerTest.FormMain.OCRExtraction()

Expected Behavior:
It should give text content available in pdf file when perform OCR Text Extraction.

Suggested Fix:
Same issue was in tesseract 3.0v which was resolved in 3.0.2-alpha1

stweil · 2019-10-01T11:11:44Z

Please use a current Tesseract version, either 4.1.0 or the latest version from Git master.

Your error report also indicates that you don't run the Tesseract executable, but some third party software, so you should report any problems there.

Please use the user forum for additional questions.

divyangashah · 2019-10-01T15:34:34Z

@stweil can you please share the github repo. or link where can I find either 4.1.0 or the latest version.

Thanks!

stweil · 2019-10-01T15:37:35Z

https://github.com/tesseract-ocr/tesseract

divyangashah · 2019-10-01T15:43:25Z

Thanks @stweil
actually, I am looking for Tesseract.dll for .net c#
Please share if any available.

zdenop · 2019-10-01T18:49:13Z

We provide only source code. You need to create binary file for yourself or take it from other projects.

stweil closed this as completed Oct 1, 2019

stweil added the question label Oct 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

divyangashah commented Oct 1, 2019

stweil commented Oct 1, 2019

divyangashah commented Oct 1, 2019

stweil commented Oct 1, 2019

divyangashah commented Oct 1, 2019

zdenop commented Oct 1, 2019

System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

System.AccessViolationException: Attempted to read or write protected memory when performing OCR Extraction on pdf files #2680

Comments

divyangashah commented Oct 1, 2019

stweil commented Oct 1, 2019

divyangashah commented Oct 1, 2019

stweil commented Oct 1, 2019

divyangashah commented Oct 1, 2019

zdenop commented Oct 1, 2019