Skip to content

Classical prediction of future data using models trained by labeled data

Notifications You must be signed in to change notification settings

KizMan-23/supervised_models

Repository files navigation

supervised model is a machine learning approach which is trained by using pre-labelled data to predict on unseen data.

TYPES OF SUPERVISED TECHNIQUES

  1. Classification
  2. Regression

Supervised_Models contains projects that have utilized either of the two types of supervised techniques to predict on unseen data. In nba_data and nba_cleaning, NBA dataset is loaded into jupyter notebook, using python frameworks like pandas, the data was prepared and cleaned to be usable for futhur model training and prediction

nba_cleaning ss2 ss3

cleaning dataset is a very important stage for data uniformity, offering consistency in data used for training and testing models.

mvp_prediction this is the training of a regression model to predict the next Most Valuable Player(MVP) lineup for the NBA. The model is trained by using past data and metrics on previous MVPs in correlation to their team performances. The model training employed the powers of Linear SVR and KNeighbors Regression from scikit-learn framework to determine the better perfomed model in predicting the MVP for the NBA using past data. The models were fine-tuned using a robust GridSearchCV to fined the model parameters thus improving accuracy of the model

optimization

feature impt

backtesting the models

backtesting

supervised_models repo contains other classical machine learning prediction works such as Premier_league Prediction. The premier_league prediction is a robust model for predicting which team will win the next match. The model can serve to provide better predictive information for game betting markets, provided the accuracy improves. The Premier League is the English Football league that comprises of 20 teams that are to play 38 matches within the leagues calender to determine the champion for the premier league. The model is trained on a dataset I scraped from the football website for years of the league standing tables and teams' performances

premier league 1

In training the model, different regression techniques from scikit-learn were tested to determine the best technique that offers the optimal accuracy for the dataset..

pr lg models

setting the target for the model prediction

pr_lg target

pr lg preds

Spotify Trend Is an analysis of artists' tracks and albums released across years and its performances across different streaming platforms. This analysis and modeling helps to understand the reach of a tracks and possible returns on investment in making such tracks for artist. The analysis gives insight to platforms that can be of more importance to artists in reaching higher listeners and consumers.

sportify

Student Performance Is a comprehensive and indepth analysis of a dataset of students' academic performance, grouping the students according to thier Grade Classes. For every student, there are factors that could affect their class room performances in a positively or negatively manner. In this analysis, I looked into the different factors that influence the output of the students in thier grades such as parental support, voluntering works, participating in extracurricular activites etc.

students corr

I also, developed a regression model to predict the grade level of the students based on several factors that affect students as factors. More Importantly, I applied Monotonic Constraint of the HistGradientBoostingRegressor to the model to influence the weight of each feature in relation to the target.

mono_cst

About

Classical prediction of future data using models trained by labeled data

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published