Lei He

Cited by

	All	Since 2019
Citations	2815	2463
h-index	26	24
i10-index	51	46

800

400

200

600

20152016201720182019202020212022202320248 93 71 134 121 198 308 355 794 663

Public access

View all

2 articles

available

not available

Based on funding mandates

Lei He

Principal Scientist Manager, Microsoft

Verified email at microsoft.com

artificial intelligence human language processing speech synthesis speech recognition pronunciation assessment.


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023	379	2023
Learning latent representations for style control and transfer in end-to-end speech synthesis YJ Zhang, S Pan, L He, ZH Ling ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	274	2019
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1510.06168, 2015	160	2015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis Y Fan, Y Qian, FK Soong, L He 2015 IEEE international conference on acoustics, speech and signal …, 2015	155	2015
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	135	2024
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1511.00215, 2015	119	2015
Developing RNN-T models surpassing high-performance hybrid models with customization capability J Li, R Zhao, Z Meng, Y Liu, W Wei, S Parthasarathy, V Mazalov, Z Wang, ... arXiv preprint arXiv:2007.15188, 2020	112	2020
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	106	2023
Robust sequence-to-sequence acoustic modeling with stepwise monotonic attention for neural TTS M He, Y Deng, L He arXiv preprint arXiv:1906.00672, 2019	97	2019
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023	90	2023
Word embedding for recurrent neural network based TTS synthesis P Wang, Y Qian, FK Soong, L He, H Zhao 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	74	2015
Conversational end-to-end tts for voice agents H Guo, S Zhang, FK Soong, L He, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021	61	2021
Improving prosody with linguistic and bert derived features in multi-speaker based mandarin chinese neural tts Y Xiao, L He, H Ming, FK Soong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	60	2020
A new GAN-based end-to-end TTS training algorithm H Guo, FK Soong, L He, L Xie arXiv preprint arXiv:1904.04775, 2019	59	2019
Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021 Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li, L He, S Zhao arXiv preprint arXiv:2110.12612, 2021	58	2021
Adaspeech 4: Adaptive text to speech in zero-shot scenarios Y Wu, X Tan, B Li, L He, S Zhao, R Song, T Qin, TY Liu arXiv preprint arXiv:2204.00436, 2022	54	2022
Binauralgrad: A two-stage conditional diffusion probabilistic model for binaural audio synthesis Y Leng, Z Chen, J Guo, H Liu, J Chen, X Tan, D Mandic, L He, X Li, T Qin, ... Advances in Neural Information Processing Systems 35, 23689-23700, 2022	48	2022
Speaker and language factorization in DNN-based TTS synthesis Y Fan, Y Qian, FK Soong, L He 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	46	2016
Learning distributed word representations for bidirectional lstm recurrent neural network P Wang, Y Qian, FK Soong, L He, H Zhao Proceedings of the 2016 Conference of the North American Chapter of the …, 2016	40	2016
Exploiting syntactic features in a parsed tree to improve end-to-end TTS H Guo, FK Soong, L He, L Xie arXiv preprint arXiv:1904.04764, 2019	38	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by