Author: Toni Heittola, Tampere University Email, Homepage, GitHub
This repository contains code examples for the tutorial presented in ICASSP 2019 conference by Tuomas Virtanen, Annamaria Mesaros, and Toni Heittola.
Slides for the tutorial can be found here.
- Sound classification using acoustic scene classification as example application, system is based convolutional neural networks (CNN) architecture (
code_examples/sound_classification_example.ipynb
) - Sound event detection system based on convolutional recurrent neural networks (CRNN) architecture (
code_examples/sound_event_detection_example.ipynb
)
Example systems are implemented in Python (version 3.x) with Keras (Tensorflow backend) and published as Jupyter notebooks. Jupyter notebooks allow either just viewing or interactive code execution through web browser based interface.
-
To view notebooks just click them above.
-
For interactive code execution:
- Clone or download the repository
- Make sure Jupyter is installed (
pip install jupyter
), e.g. Anaconda Python has it pre-installed - Make sure pre-requisites are installed, e.g. by running command
pip install -r requirements.txt
insidecode_examples
directory - Start notebook from command prompt with command
jupyter notebook
(inside thecode_examples
directory), this opens browser interface. - Open notebook file (*.ipynb) from the list
- Notebook is constructed from code and text cells, start from the top and execute cell by cell starting from first cell with
shift+enter
Note : Training of these systems is computationally rather heavy, and GPU accelerated training is advisable. (Make sure to install GPU enabled Tensorflow version). Directory code_examples/data/
contains pre-trained models (along with training history), and system outputs for the test set to allow only partial code execution.
- First public release
This software is released under the terms of the MIT License.