Machine Learning Gladiator - Battle Between Algorithms

Repository containing comparisons between different out-of-the-box machine learning algorithms in some classic datasets.

How does different models compare to each other in equal conditions?
What's the impact of fine-tuning (using Grid Search or other methods) the models?
What's the impact of data normalization in different models?
Is it ok to use out-of-the-box scikit-learn's models?
When to use Machine Learning after all? Are there situations where is better not to use ML models?

The idea for this project came from Elite Data Science's blog, in this particular list of projects. The concept of a gladiator, with several algorithms competing on similar terms, is fairly commomn, so there's other sources that use this term.

The Data

Bellow is a brief of the datasets, the models and the final scores. More information, further conclusions and analysis can be found in the READMEs inside their subfolders in this repository.

MNIST

The first dataset that I'll apply this concept is the well known MNIST dataset. As the classes are balanced, the metric used to evaluate the models is accuracy. The models created were:

Support Vector Classifier
Random Forest
K-Nearest Neighbors Classifier
MLP Classifier
Gradient Boosting Classifier
Logistic Regression
Perceptron
Ridge Classifier
Ridge Classifier CV
Bernoulli Naive Bayes
Gaussian Naive Bayes
Decision Tree

MNIST Models' Scores

One surprise is how well KNN model did in comparison with other more robust methods. Accuracy on test data with out-of-the-box sklearn algorithms. No optimization was made. The training data was a subset of 5000 random images from the MNIST dataset, and the test data was a subset of 2000 random images.

The notebook is avaiable in the MNIST subfolder and also on Google Colab.

Fashion MNIST

The models used to predict the class of the Fashion MNIST's clothes are:

Deep Neural Network
Convolutional Neural Network
CNN with Transfer Learning from the VGG16 Model

Fashion MNIST Models' Scores

No surprises here. The CNN model did better than the DNN and the Transfer Learning model did (marginally) better than the CNN one. All models were trained, validated and tested with the same datasets.

The notebook is avaiable in the Fashion MNIST subfolder and also on Google Colab.

Titanic

For this classic dataset, the models created were:

Dummy Classifier - an instance of sklearn that is used purely to stablish a baseline to the real models.
Bernoulli Naive Bayes
K-Nearest Neighbors Classifier
SGD Classifier
Logistic Regression
Ridge Classifier
Support Vector Classifier
Decision Tree
Random Forest
XGBoost Classifier
Neural Network (Deep Neural Network)

Titanic Models' Scores

Here is the final plot, a comparison of the validation accuracy score both with normalized and unnormalized data:

Comparison Between the Validation Accuracy using Normalized and Unnormalized Data

Wine Quality

The Wine Quality Dataset is related to red and white variants of the Portuguese "Vinho Verde" wine. The data can be found in Kaggle (only the Red Wine variant) and the UCI Machine Learning Repository.

The task can either be classification or regression. For this comparison I chose to treat the dataset as a classification problem, to see if the wine is good (quality equal or above 7) or bad (otherwise).

The dataset is not balanced, because there are many more examples of normal wines than of excellent and poor ones, and the authors of the dataset state that "we are not sure if all input variables are relevant. So it could be interesting to test feature selection methods.", so a robust EDA could improve the results (or straight up replace the ML models).

Wine Quality Models' Scores

This is a good examples of a scenario where the use of Machine Learning may not be the best path to take. Using out-of-the-box algorithms and only performing normalization in the data (without an extensive exploratory data analysis/feature engineering) yelded the results bellow. The high scores of the dummy classifier indicates that the logic behind the classification could be extracted without training and deploying a sofisticated machine learning model.

Wine Quality Models' Accuracy Comparison in Validation and Training (Cross Validation)

Roadmap

I plan to do similar analysis with House Prices and TMDB Box Office datasets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Gladiator - Battle Between Algorithms

Table of Contents

The Concept

The Data

MNIST

MNIST Models' Scores

Fashion MNIST

Fashion MNIST Models' Scores

Titanic

Titanic Models' Scores

Wine Quality

Wine Quality Models' Scores

Roadmap

About

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
Fashion MNIST		Fashion MNIST
House Prices		House Prices
MNIST		MNIST
Titanic		Titanic
Wine Quality		Wine Quality
README.md		README.md

pedrohortencio/machine-learning-gladiator

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Gladiator - Battle Between Algorithms

Table of Contents

The Concept

The Data

MNIST

MNIST Models' Scores

Fashion MNIST

Fashion MNIST Models' Scores

Titanic

Titanic Models' Scores

Wine Quality

Wine Quality Models' Scores

Roadmap

About

About

Topics

Resources

Stars

Watchers

Forks

Languages