Team Members:
- Aniket Kanoujia (2021377)
- Ashutosh Gera (2021026)
- Piyush Kumar (2021180)
- Tushar Chandra (2021211)
Dataset: Spotify 1.2M+
Important Files:
tracks_featues.csv
: Original dataset (deleted due to size constraints)final.csv
: Processed datasettsne1.2.csv
: T-SNE of the whole dataset (took around 15 hrs to run 😅).
For any queries, feel free to reach out! 🚀
Our project grooves to the beat of creating a Music Recommendation System using Machine Learning. 🕺 We stand out by spinning unsupervised models that predict song recommendations based on user preferences, creating a more personalized and diverse music listening experience. 🎧
In a world filled with music, the quest for discovering new artists and tunes is never-ending. Our Music Recommendation System is designed to decode the complexities of recommendation algorithms, focusing on content-based filtering and real-time learning. Unlike mainstream platforms, our project strives to deliver a personalized music recommendation system for individual users, enhancing their overall music listening experience.
Our project addresses the limitations of existing music recommendation systems, such as limited diversity, popularity bias, and challenges in incorporating user feedback. By leveraging machine learning on audio features, we aim to strike a balance between personalized recommendations and introducing diversity.
We draw inspiration from various studies, including:
- Dynamic K-means Clustering Algorithm for Music Recommendation
- Personalized Music Recommendation model based on Machine Learning
- Music Recommendation System Using Machine Learning
- Automated Playlist Generation from Personal Music Libraries
These studies influence our methodology, focusing on dynamic clustering, context-aware recommendation, and correlation analysis.
We utilize the Spotify 1.2M+ dataset, containing audio features of over 1.2 million songs obtained with the Spotify API.
After preprocessing, our dataset includes 16 features, such as danceability, energy, and tempo. We analyze trends across generations and create visualizations, including word clouds and feature trend plots.
Our project employs content-based filtering using machine learning algorithms on various audio features. We explore collaborative filtering, content-based filtering, and hybrid filtering.
We implement various machine learning algorithms, including:
- K Nearest Neighbours
- Cosine Similarity
- K Means Clustering
- Gaussian Mixture Model
- DBSCAN
- Agglomerative Clustering
- Spectral Clustering
We provide insights into each algorithm's application and use cases.
We generate playlists using cluster similarity and feature similarity with content-based filtering. The output songs are obtained through cosine similarity, K Means, Gaussian Mixture Model, DBSCAN, Agglomerative Clustering, and Spectral Clustering.
We present the top recommended songs for a given input song using different algorithms. There is a partial overlap between recommended songs, indicating the effectiveness of our algorithms in suggesting songs with high feature similarity.
Our project successfully implements various unsupervised machine learning algorithms to recommend songs based on user input. The partial overlap in recommendations validates the efficacy of our algorithms in providing diverse yet personalized suggestions.