I work at TikTok as a research scientist now in Singapore.
I am now working on audio-driven talking face generation, text-to-speech and music generation research. If you are seeking any form of academic cooperation, please feel free to email me at ren.yi@bytedance.com. We are hiring interns!
I graduated from Chu Kochen Honors College, Zhejiang University (浙江大学竺可桢学院) with a bachelor's degree and from the Department of Computer Science and Technology, Zhejiang University (浙江大学计算机科学与技术学院) with a master's degree, advised by Zhou Zhao (赵洲). I also collaborate with Xu Tan (谭旭), Tao Qin (秦涛) and Tie-yan Liu (刘铁岩) from Microsoft Research Asia closely.
I won the Baidu Scholarship (10 candidates worldwide each year) and ByteDance Scholars Program (10 candidates worldwide each year) in 2020 and was selected as one of the top 100 AI Chinese new stars and AI Chinese New Star Outstanding Scholar (10 candidates worldwide each year).
My research interest includes speech synthesis, neural machine translation and automatic music generation. I have published 50+ papers at the top international AI conferences such as NeurIPS, ICML, ICLR, KDD.
To promote the communication among the Chinese ML & NLP community, we (along with other 11 young scholars worldwide) founded the MLNLP community in 2021. I am honored to be one of the chairs of the MLNLP committee.
- Personal Pages: https://rayeren.github.io (updated recently🔥)
- Linkedin: https://www.linkedin.com/in/rayeren
- Google Scholar: https://scholar.google.com/citations?user=4FA6C0AAAAAJ
- DBLP: https://dblp.org/pid/75/6568-6.html
- 2024.03: 🎉 Two papers are accepted by ICLR 2024
- 2023.05: 🎉 Five papers are accepted by ACL 2023
- 2023.01: DiffSinger was introduced in a very popular video (2000k+ views) in Bilibili!
- 2023.01: I join TikTok as a speech research scientist in Singapore!
- 2022.02: I release a modern and responsive academic personal homepage template. Welcome to STAR and FORK!
My full paper list is shown at my personal homepage.
ICLR 2021
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech, Yi Ren, Chenxu Hu, Xu Tan, et al.NeurIPS 2019
FastSpeech: Fast, Robust and Controllable Text to Speech, Yi Ren, Yangjun Ruan, Xu Tan, et al.ICLR 2024
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis, Ziyue Jiang, Jinglin Liu, Yi Ren, et al.AAAI 2022
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism, Jinglin Liu, Chengxi Li, Yi Ren, et al. Project | | |NeurIPS 2021
PortaSpeech: Portable and High-Quality Generative Text-to-Speech, Yi Ren, Jinglin Liu, Zhou Zhao, Project | |ICML 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models, Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, et al.ICLR 2023
Bag of Tricks for Unsupervised Text-to-Speech, Yi Ren, Chen Zhang, Shuicheng YanACL 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier, Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao |NeurIPS 2022
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech, Ziyue Jiang, Zhe Su, Zhou Zhao, Qian Yang, Yi Ren, et al.
ICLR 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis, Zhenhui Ye, Tianyun Zhong, Yi Ren, et al.ICLR 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis, Zhenhui Ye, Ziyue Jiang`, Yi Ren, et al.
ACL 2023
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation, Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, et al.ICLR 2023
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation, Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, et al.ACL 2020
SimulSpeech: End-to-End Simultaneous Speech to Text Translation, Yi Ren, et al.ICLR 2019
Multilingual Neural Machine Translation with Knowledge Distillation, Xu Tan, Yi Ren, et al.
ACM-MM 2020
PopMAG: Pop Music Accompaniment Generation, Yi Ren, Jinzheng He, Xu Tan, et al.
ICLR 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds, Luping Liu, Yi Ren, et al. | |