A .NET wrapper for tesseract-ocr.
Code License: Apache License 2.0
Site Content License (Documentation etc): [Creative Commons Attribution 3.0 Unported License](license" href="http://creativecommons.org/licenses/by/3.0/)
This is currently prerelease software as such the public API is subject to change.
Since tesseract and leptonica binaries are compiled with Visual Studio 2008 SP1 you'll need to ensure you have the Visual Studio 2008 SP1 Runtime installed. This can be found here.
Note: Compiling the project requires at least MS Visual Studio 11 Express for Desktop or SharpDevelop 4.2.
- Fork this project (see: https://help.github.com/articles/fork-a-repo)
- Ensure you have Visual Studio 2008 SP1 x86 runtime installed (see note above).
- Download language data files for tesseract 3.02 from http://code.google.com/p/tesseract-ocr/
- Build BaseApiTester project
- Copy language files into
BaseApiTester\bin\[config]\tessdata
- Run BaseApiTester for example (work in progress)
Please help yourselves to one of the following tasks (or create a new one). Please leave a comment in the corresponding task to avoid duplication of work.
Task Num | Task | Status |
---|
| Tesseract - Core Apis | Active
| Tesseract - Regression test infrastructure | Pending
| Leptonica - Scanned image preparation functions | Pending
| Tesseract - API Cleanup and improvements | Pending
| Tesseract - Create sample app | Pending
| Tesseract - Dictionary lookup [not sure about this] | Pending