Skip to content

tanmayy24/Multi-task-Learning-for-sound-event-detection

Repository files navigation

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Domestic environment sound event detection task leveraging some distinctive high-level acoustic characteristics of various sound events to assist the SED model training, without requiring additional labeled data.


DCASE Task 4

Multitask Learning DCASE Task 4 recipe:

Challenge website here

Installation Notes

You want to run the MTL DCASE 2022 Task 4 system

Go to ./recipes/dcase2022_task4_baseline and follow the instructions there in the README.md

In the recipe, we provide a conda script that creates a suitable conda environment with all dependencies, including pytorch with GPU support in order to run the recipe. There are also instructions for data download and preparation.

You need only desed_task package for other reasons

Run python setup.py install to install the desed_task package

Citation

If this work is helpful, please feel free to cite the following paper:

"Tanmay Khandelwal and Rohan Kumar Das, “A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds”, in Proc. Interspeech 2023, Dublin, Ireland, August 2023."
To access the paper: Arxiv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published