YuanGongND

Follow

Yuan Gong YuanGongND

Follow

Research Scientist, MIT CSAIL

385 followers · 2 following

MIT
Cambridge, MA
06:21 (UTC -05:00)
yuangongnd.github.io

Achievements

Achievements

Pinned Loading

ltu ltu Public

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 382 36
whisper-at whisper-at Public

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 320 27
gopt gopt Public

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Python 150 27
cav-mae cav-mae Public

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 232 23
ssast ssast Public

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 364 61
ast ast Public

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.2k 214