Digit-Recognition

In this project, we train our own ANN with one hidden layer using backpropagation algorithm on MNIST dataset which later can help our machine to identify a given numeric digit.

More About Dataset

Information and documentation of the dataset can be found on: http://yann.lecun.com/exdb/mnist/

Initializing the dataset

Dataset consists the matrix, where each image is represented in the form 1x784

Follow the code in file imageProjection.m to project/visualize the image

Link to download dataset is given in the pdf file present in Theory folder

init() function initializes the dataset

We build a bigger training dataset which contains images from each class one by one i.e. we make a training dataset containing images arranged as 0,1,2....9. In order to make a uniform dataset, we take 5421[as the class having minimum (class 5) samples is equal to 5421] images from each training class as training dataset of 5421x10.

Arrangement like this is done so as to maintain degree of randomness in our dataset thus weight distribution becomes uniform for all types of digits

BackPropagation Algorithm

The backpropagation algorithm was originally introduced in the 1970s, but its importance wasn't fully appreciated until a famous 1986 paper by David Rumelhart, Geoffrey Hinton, and Ronald Williams. That paper describes several neural networks where backpropagation works far faster than earlier approaches to learning, making it possible to use neural nets to solve problems which had previously been insoluble. Today, the backpropagation algorithm is the workhorse of learning in neural networks.

This chapter is more mathematically involved than the rest of the book. If you're not crazy about mathematics you may be tempted to skip the chapter, and to treat backpropagation as a black box whose details you're willing to ignore. Why take the time to study those details?

Link for step-by-step backpropagation execution. The contents of the blog are also present in the theory folder. (Just in case ¯＼(º_o)/¯ )

Stopping Criteria

$Error_{total} = \frac{1}{2} (Output_{Expected} - Output_{Calculated})^2$
The above is the squared error function used to calculate the error. For each iteration, we check the error obtained in the previous iteration and the current iteration and the difference is then compared with a momentum value. If this goes below a value, we stop the process.

Results

Classwise accuracies achieved are:
Accuracy for class 0 is 98.469388 percent
Accuracy for class 1 is 98.149780 percent
Accuracy for class 2 is 95.930233 percent
Accuracy for class 3 is 95.049505 percent
Accuracy for class 4 is 94.602851 percent
Accuracy for class 5 is 93.609865 percent
Accuracy for class 6 is 96.764092 percent
Accuracy for class 7 is 94.844358 percent
Accuracy for class 8 is 96.201232 percent
Accuracy for class 9 is 94.449950 percent

The overall accuracy obtained is 95.85%

Module Explanation

DataSet/mnist_all.mat: Matfile containing processed MNIST dataset containing 10 classes (0-9) having 60000 training images and 10000 testing images.
DataSet/DigitRecognition.mat: Matfile containing the final weight matrices after completely training the model after the absolute error is unity. When tested using these weight matrices, we get the accuracy discussed above.
DataSet/trainSet.mat: Refer cotent under the heading Initializing The Dataset.
Code/imageProjection: Visualizing the dataset.
Code/init: Initializing the dataset as explained under the heading Initialing The Dataset.
Code/backPropagation: Declaring the weight matrix and creating the neural network. Calls the feedForward.m method and feeds the input using wait matrix to the hidden layer which further feeds forward to the output layer. Takes the total error at the output layer and corrects the error.In short, performs the backpropagation algorithm.
Code/feedForward: Method that uses the formula input*weight + bias and later uses the sigmoidal function to normalise the value.
Code/Test: Testing the 10000 test images over the trained weights obtained from the module backPropagation.m.
Code/DigitRecognition: Main Method that triggers all the other method.

Refer summary and statistics file for more details.

I would like to show my deepest and humble gratitude towards my mentor, Professor Akshay Rajput who helped me in all sorts. Much thanks! ಥ‿ಥ
Feel free to drop any suggestions at parthendo@geu.ac.in

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Digit-Recognition

More About Dataset

Initializing the dataset

BackPropagation Algorithm

Stopping Criteria

Results

Module Explanation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Digit-Recognition

More About Dataset

Initializing the dataset

BackPropagation Algorithm

Stopping Criteria

Results

Module Explanation