Optical Character Recognition Using DeepLearning

Text is everywhere! It is present in PDFs, docs as well as images. There are lots of applications where text data is useful for doing analytics. Such applications include receipts recognition, number plate detection, extracting the latex formulas from the images etc. General Computer Vision can be used for such task but it lacks in accuracy. In order to solve the low accuracy and variance problem, we use the state of the art deep neural networks.

This repository includes:

1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR.
2. supporting scripts to apply the RCNN appraoch for OCR.

Architecture

Instructions on How to run

Get the repository

git clone https://github.com/harshul1610/OCR.git

Get the NIST19 dataset

mkdir data
wget https://s3.amazonaws.com/nist-srd/SD19/by_class.zip
unzip by_class.zip
mv by_class NIST19

Get the Captcha data

cd OCR
python2 generate_captcha.py

Run the final notebook for training and testing

CNN_LSTM_CTC_OCR-captcha.ipynb

LICENSE

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.ipynb_checkpoints		.ipynb_checkpoints
images		images
.gitignore		.gitignore
CNN_LSTM_CTC_OCR-captcha.ipynb		CNN_LSTM_CTC_OCR-captcha.ipynb
Combine_Images_annotations_data.ipynb		Combine_Images_annotations_data.ipynb
LICENSE		LICENSE
LSTM_CTC_OCR-captcha.ipynb		LSTM_CTC_OCR-captcha.ipynb
LSTM_CTC_OCR.ipynb		LSTM_CTC_OCR.ipynb
README.md		README.md
captcha		captcha
generate_captcha.py		generate_captcha.py
generate_tfrecord.py		generate_tfrecord.py
label_cls_name.json		label_cls_name.json
make_annotations.ipynb		make_annotations.ipynb
make_pbtxt.ipynb		make_pbtxt.ipynb
ocr_classification.ipynb		ocr_classification.ipynb
xml_to_csv.py		xml_to_csv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition Using DeepLearning

Architecture

Instructions on How to run

LICENSE

About

Releases

Packages

Languages

License

harshuljain13/OCR

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition Using DeepLearning

Architecture

Instructions on How to run

LICENSE

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages