Skip to content

Extracted all unique tracks from Million Playlist Dataset and added features to every track using Spotify API.

License

Notifications You must be signed in to change notification settings

mujsigai/mpd-track-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MPD TRACK DATASET 🎵

Extracted all unique tracks from Million Playlist Dataset and added audio features to every track using Spotify API.

Total Unique Songs: 2,261,644

Useage of dataset

The dataset is in the form of SQL table. You can find the data on kaggle here. Use this while respecting Spotify's Policies. We are not responsible for anything done by you using this dataset.

Replicate the extraction

  1. Install the Million Playlist Dataset from here.
  2. Copy the data folder into the root project folder.
  3. Run per_json.py -> sql_data.py -> easy_extraction.py.
  4. The last step will take weeks due to rate limits by spotify.

Future plans

Million Playlist Dataset only has songs till 2018. We plan on looking for more songs from 2018 to 2024.

About

Extracted all unique tracks from Million Playlist Dataset and added features to every track using Spotify API.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages