Please press ⭐ button and/or cite papers if you feel helpful.
This repository contains the codebase for the wav2graph paper:
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech
https://www.arxiv.org/abs/2408.04174
In the wav2graph paper, we introduce the first framework for supervised learning knowledge graph from speech data. This repository provides the necessary scripts, configurations, and setup instructions to reproduce the experiments discussed in the paper.
To set up the environment and run the experiments, follow the steps below:
Before you start, create a Python virtual environment and install the required dependencies.
pip install -r requirements.txt
You will need a Hugging Face API token to access certain resources used in this project. Insert your Hugging Face token into the relevant YAML configuration files.
Once the environment is set up and the configurations are complete, you can run the experiments using the provided script.
sh run.sh
@misc{leduc2024wav2graphframeworksupervisedlearning,
title={wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech},
author={Khai Le-Duc and Quy-Anh Dang and Tan-Hanh Pham and Truong-Son Hy},
year={2024},
eprint={2408.04174},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2408.04174},
}
Core developers:
Khai Le-Duc
University of Toronto, Canada
Email: duckhai.le@mail.utoronto.ca
GitHub: https://github.com/leduckhai
Quy-Anh Dang
VNU University of Science, Vietnam
GitHub: https://github.com/QuyAnh2005
Facebook: https://www.facebook.com/anh.q.dang.5