Every data scientist eventually tackles MNIST. Here's my attempts.
- (0.75171) LeNet set to minimize cross-entropy. Trained for 100 epochs.
- (0.98614) AlexNet with the same parameters.
- (0.98914) Modified GoogleNet with Batch Normalization instead of LRN, and an average pooling layer removed.
- (0.98743) Googlenet from 3 with adamax optimizer used.
- (Running) AlexNet with a grid search for parameters
Selected models have been uploaded in case anyone wants to play with them.