Yichong Leng

Cited by

	All	Since 2019
Citations	741	741
h-index	14	14
i10-index	18	18

320

160

240

2019202020212022202320248 12 29 95 290 305

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Xu TanPrincipal Researcher and Research Manager, MicrosoftVerified email at microsoft.com
Tao QinSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Sheng ZhaoMicrosoftVerified email at microsoft.com
Jin XuQwen Team, Alibaba GroupVerified email at alibaba-inc.com
Kaitao SongSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Kai Shen (沈锴)Zhejiang UniversityVerified email at zju.edu.cn
Zeqian JuUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Zehua ChenPostDoc at Tsinghua University | Ph.D. from Imperial CollegeVerified email at imperial.ac.uk
Junliang GuoMicrosoft ResearchVerified email at microsoft.com
Xiang-Yang Li (李向阳)ACM Fellow, IEEE Fellow; Professor, CS @USTC, China;Verified email at ustc.edu.cn
Hande DongTencentVerified email at tencent.com

Yichong Leng

University of Science and Technology of China

Verified email at mail.ustc.edu.cn

Speech Processing NLP


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	135	2024
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	106	2023
MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network Y Leng, X Tan, S Zhao, F Soong, XY Li, T Qin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	83	2021
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition Y Leng, X Tan, L Zhu, J Xu, R Luo, L Liu, T Qin, XY Li, E Lin, TY Liu Advances in Neural Information Processing Systems 34, 2021	61*	2021
Prompttts: Controllable text-to-speech with text descriptions Z Guo, Y Leng, Y Wu, S Zhao, X Tan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	60	2023
Binauralgrad: A two-stage conditional diffusion probabilistic model for binaural audio synthesis Y Leng, Z Chen, J Guo, H Liu, J Chen, X Tan, D Mandic, L He, X Li, T Qin, ... Advances in Neural Information Processing Systems 35, 23689-23700, 2022	48	2022
Fastcorrect 2: Fast error correction on multiple candidates for automatic speech recognition Y Leng, X Tan, R Wang, L Zhu, J Xu, W Liu, L Liu, T Qin, XY Li, E Lin, ... Findings of EMNLP 2021, 2021	34	2021
Unsupervised pivot translation for distant languages Y Leng, X Tan, T Qin, XY Li, TY Liu ACL 2019, 2019	31	2019
Analyzing and mitigating interference in neural architecture search J Xu, X Tan, K Song, R Luo, Y Leng, T Qin, TY Liu, J Li International Conference on Machine Learning, 24646-24662, 2022	29	2022
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... arXiv preprint arXiv:2403.03100, 2024	27	2024
Microsoft Research Asia's systems for WMT19 Y Xia, X Tan, F Tian, F Gao, W Chen, Y Fan, L Gong, Y Leng, R Luo, ... arXiv preprint arXiv:1911.06191, 2019	26	2019
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023	16	2023
Softcorrect: Error correction with soft detection for automatic speech recognition Y Leng, X Tan, W Liu, K Song, R Wang, XY Li, T Qin, E Lin, TY Liu Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13034 …, 2023	16	2023
Speech-t: Transducer for text to speech and beyond J Chen, X Tan, Y Leng, J Xu, G Wen, T Qin, TY Liu Advances in Neural Information Processing Systems 34, 6621-6633, 2021	16	2021
Resgrad: Residual denoising diffusion probabilistic models for text to speech Z Chen, Y Wu, Y Leng, J Chen, H Liu, X Tan, Y Cui, K Wang, L He, S Zhao, ... arXiv preprint arXiv:2212.14518, 2022	14	2022
Transcormer: Transformer for sentence scoring with sliding language modeling K Song, Y Leng, X Tan, Y Zou, T Qin, D Li Advances in Neural Information Processing Systems 35, 11160-11174, 2022	11	2022
Mask the correct tokens: An embarrassingly simple approach for error correction K Shen, Y Leng, X Tan, S Tang, Y Zhang, W Liu, E Lin arXiv preprint arXiv:2211.13252, 2022	11	2022
A study of multilingual neural machine translation X Tan, Y Leng, J Chen, Y Ren, T Qin, TY Liu arXiv preprint arXiv:1912.11625, 2019	11	2019
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension Q Yang, J Xu, W Liu, Y Chu, Z Jiang, X Zhou, Y Leng, Y Lv, Z Zhao, ... arXiv preprint arXiv:2402.07729, 2024	2	2024
Extract and Attend: Improving Entity Translation in Neural Machine Translation Z Zeng, R Wang, Y Leng, J Guo, X Tan, T Qin, T Liu arXiv preprint arXiv:2306.02242, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors