This project explores the feasibility of using voice cloning as a solution for anonymizing the voices of child speakers. The aim is to protect the privacy of minors by altering their voices while retaining speech utility (including intelligibility and naturalness).
Follow these steps to set up the environment:
-
Create a new conda environment:
conda create -n child_speaker_anonymization python=3.10
-
Activate the environment:
conda activate child_speaker_anonymization
-
Install required dependencies:
pip install -r requirements.txt
If you use OpenMind, the data for our first experiments is stored here: <TODO: ADD>
. Soon(ish) we will set up a datalad repo for taking care of data versioning (TODO).
To run the project, simply execute:
python main.py
Access the project files and related data here.
-[] SETUP DATALAD WITH GOOGLE DRIVE (SUBDATASETS) - for now you can find it at /nese/mit/group/sig/projects/fabiocat/children_voice_anonymization
-[] VOICE CLONING PIPELINE
-[] ASR BENCHMARKING
-[] SER BENCHMARKING