MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network Y Leng, X Tan, S Zhao, F Soong, XY Li, T Qin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 38 | 2021 |
Naturalspeech: End-to-end text to speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... arXiv preprint arXiv:2205.04421, 2022 | 26 | 2022 |
Unsupervised pivot translation for distant languages Y Leng, X Tan, T Qin, XY Li, TY Liu ACL 2019, 2019 | 23 | 2019 |
Microsoft Research Asia's Systems for WMT19 Y Xia, X Tan, F Tian, F Gao, W Chen, Y Fan, L Gong, Y Leng, R Luo, ... arXiv preprint arXiv:1911.06191, 2019 | 21 | 2019 |
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition Y Leng, X Tan, L Zhu, J Xu, R Luo, L Liu, T Qin, XY Li, E Lin, TY Liu Advances in Neural Information Processing Systems 34, 2021 | 19* | 2021 |
Analyzing and mitigating interference in neural architecture search J Xu, X Tan, K Song, R Luo, Y Leng, T Qin, TY Liu, J Li International Conference on Machine Learning, 24646-24662, 2022 | 12 | 2022 |
Fastcorrect 2: Fast error correction on multiple candidates for automatic speech recognition Y Leng, X Tan, R Wang, L Zhu, J Xu, W Liu, L Liu, T Qin, XY Li, E Lin, ... Findings of EMNLP 2021, 2021 | 10 | 2021 |
A study of multilingual neural machine translation X Tan, Y Leng, J Chen, Y Ren, T Qin, TY Liu arXiv preprint arXiv:1912.11625, 2019 | 9 | 2019 |
Binauralgrad: A two-stage conditional diffusion probabilistic model for binaural audio synthesis Y Leng, Z Chen, J Guo, H Liu, J Chen, X Tan, D Mandic, L He, X Li, T Qin, ... Advances in Neural Information Processing Systems 35, 23689-23700, 2022 | 5 | 2022 |
Speech-t: Transducer for text to speech and beyond J Chen, X Tan, Y Leng, J Xu, G Wen, T Qin, TY Liu Advances in Neural Information Processing Systems 34, 6621-6633, 2021 | 5 | 2021 |
A study on the efficacy of model pre-training in developing neural text-to-speech system G Zhang, Y Leng, D Tan, Y Qin, K Song, X Tan, S Zhao, T Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2021 | 2 | 2021 |
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition Y Leng, X Tan, W Liu, K Song, R Wang, XY Li, T Qin, E Lin, TY Liu arXiv preprint arXiv:2212.01039, 2022 | 1 | 2022 |
Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction K Shen, Y Leng, X Tan, S Tang, Y Zhang, W Liu, E Lin arXiv preprint arXiv:2211.13252, 2022 | 1 | 2022 |
PromptTTS: Controllable Text-to-Speech with Text Descriptions Z Guo, Y Leng, Y Wu, S Zhao, X Tan arXiv preprint arXiv:2211.12171, 2022 | 1 | 2022 |
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech Z Chen, Y Wu, Y Leng, J Chen, H Liu, X Tan, Y Cui, K Wang, L He, S Zhao, ... arXiv preprint arXiv:2212.14518, 2022 | | 2022 |
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling K Song, Y Leng, X Tan, Y Zou, T Qin, D Li arXiv preprint arXiv:2205.12986, 2022 | | 2022 |