Skip to content

Latest commit

 

History

History
40 lines (28 loc) · 1.89 KB

ReadMe.md

File metadata and controls

40 lines (28 loc) · 1.89 KB

A .NET wrapper for tesseract-ocr.

Code License: Apache License 2.0
Site Content License (Documentation etc): [Creative Commons Attribution 3.0 Unported License](license" href="http://creativecommons.org/licenses/by/3.0/)

Warning - Prerelease software (alpha)

This is currently prerelease software as such the public API is subject to change.

Dependencies

Visual Studio 2008 SP1 x86 Runtime

Since tesseract and leptonica binaries are compiled with Visual Studio 2008 SP1 you'll need to ensure you have the Visual Studio 2008 SP1 Runtime installed. This can be found here.

Getting started quickly

Note: Compiling the project requires at least MS Visual Studio 11 Express for Desktop or SharpDevelop 4.2.

  1. Fork this project (see: https://help.github.com/articles/fork-a-repo)
  2. Ensure you have Visual Studio 2008 SP1 x86 runtime installed (see note above).
  3. Download language data files for tesseract 3.02 from http://code.google.com/p/tesseract-ocr/
  4. Build BaseApiTester project
  5. Copy language files into BaseApiTester\bin\[config]\tessdata
  6. Run BaseApiTester for example (work in progress)

To-do

Please help yourselves to one of the following tasks (or create a new one). Please leave a comment in the corresponding task to avoid duplication of work.

Task Num Task Status
| Tesseract - Core Apis 					| Active
| Tesseract - Regression test infrastructure			| Pending
| Leptonica - Scanned image preparation functions		| Pending
| Tesseract - API Cleanup and improvements			| Pending
| Tesseract - Create sample app					| Pending
| Tesseract - Dictionary lookup [not sure about this]		| Pending