Efficient-mnist-learner

Problem Statement: In most real-world applications, labelled data is scarce. Suppose you are given the Fashion-MNIST dataset (https://github.com/zalandoresearch/fashion-mnist), but without any labels in the training set. The labels are held in a database, which you may query to reveal the label of any particular image it contains. Your task is to build a classifier to >90% accuracy on the test set, using the smallest number of queries to this database.

Approach: Use a variational autoencoder for unsupervised training on the full unlabeled dataset. Then train the classifier on the encodings of the autoencoder.

Demo

The encoded features:

The images after reconstruction through the variational autoencoder:

Features

This project demonstrates

Convolutional variational autoencoder in Pytorch
VS-Code dev container with CUDA support
Pytorch-Lightning for training. It would be possible to use vanilla Pytorch, however, if there exists a framework for it, it's better to use it.
Torchmetrics for calculating accuracy and f1-scores.
Hydra for experiment management. All experiments are named according to the deviation from the default configuration.
Tensorboard for advanced visualization of the encoded labels, and training statistics.
- Scalars for training progress
- Images for visualizing the reconstructed images from the autoencoder
- Projector for visualizing the embeddings
Ax for hyperparameter optimization
flake8 for linting, black for formatting, isort for import sorting.

Architecture is inspired by RDenseCNN and SAM Optimizer.

Side Note: Usually, I use poetry or conda as a package manager. However, the NVIDIA-Docker container comes with an existing environment, that suffers from a InvalidVersionSpec error. Conda is also incredible slow in this case, therefore I am using pip install to add missing dependencies to the Docker Image.

(base) vscode@54a0b6598fe8:/workspaces/efficient-mnist-learner$ conda env export

InvalidVersionSpec: Invalid version '1.10.0+cu113<1.11.0': invalid character(s)

Evaluation results

Number of training labels	Accuarcy on validation
1000	84.74%
2000	86.25%
5000	89.37%
7500	90.42%
10000	91.03%
20000	91.65%
30000	93.04%
60000	93.56%

The auto-encoder results in a boost of 2-3% over a model which has not been pre-trained on the whole dataset.

Setup GPU (tested on Ubuntu)

Open the project in the VS-Code dev container. Select the base conda environment that ships with the container. This is tested on CUDA Version 11.4, Ubuntu 20.04 LTS.

If you don't want to use VS-Code, you can run it in docker as well.

cp requirements.txt .devcontainer/
docker build .devcontainer/ -t devcontainer
docker run --rm -it -v $(pwd):/workspace devcontainer bash

Usage

You can run an experiment using the command line.

python -m eml

Parameters can be overwritten the following way:

python -m eml variational_sigma=0.001 unsupervised_epochs=10 classifier_epochs=10

More details can be found on the Hydra documentation. All configuration options are described in the Config.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.devcontainer		.devcontainer
.vscode		.vscode
docs		docs
eml		eml
sam		sam
.flake8		.flake8
.gitignore		.gitignore
Evaluation.py		Evaluation.py
Hyperopt.ipynb		Hyperopt.ipynb
Hyperopt.py		Hyperopt.py
README.md		README.md
log.txt		log.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient-mnist-learner

Demo

Features

Evaluation results

Setup GPU (tested on Ubuntu)

Usage

About

Releases

Packages

Languages

TobiasJacob/efficient-mnist-learner

Folders and files

Latest commit

History

Repository files navigation

Efficient-mnist-learner

Demo

Features

Evaluation results

Setup GPU (tested on Ubuntu)

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages