Skip to content

Implementation of the Shazam paper, "An Industrial-Strength Audio Search Algorithm" - Wang 2003

Notifications You must be signed in to change notification settings

ruarim/audio_identification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Identification

Audio identification is a class of information retrieval systems designed to match a query with a set of database documents. The objective is to find the most similar track in a database given a short audio snippet.

This implementation utilizes Wang's fingerprinting method (Wang 2003), leveraging frequency pairs and time difference hashes to efficiently match query audio with the database.

Audio Identification Diagram

Performance

Requirements

The following Python libraries are required:

  • Numpy
  • Librosa
  • Matplotlib
  • Skimage

Install the required packages via pip:

pip install numpy librosa matplotlib scikit-image

References

  • An Industrial-Strength Audio Search Algorithm (Wang, 2003)

About

Implementation of the Shazam paper, "An Industrial-Strength Audio Search Algorithm" - Wang 2003

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages