Follow
Jonathan Shen
Jonathan Shen
Verified email at google.com
Title
Cited by
Cited by
Year
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
33232018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems 31, 2018
10002018
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
2822018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2122019
Parallel tacotron: Non-autoregressive and controllable tts
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1312021
SATzilla2012: Improved algorithm selection based on cost-sensitive classification models
L Xu, F Hutter, J Shen, HH Hoos, K Leyton-Brown
Proceedings of SAT Challenge, 57-58, 2012
1262012
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling
J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu
arXiv preprint arXiv:2010.04301, 2020
1022020
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS
Y Jia, H Zen, J Shen, Y Zhang, Y Wu
arXiv preprint arXiv:2103.15060, 2021
842021
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu
arXiv preprint arXiv:2103.14574, 2021
732021
Neural program synthesis with priority queue training
DA Abolafia, M Norouzi, J Shen, R Zhao, QV Le
arXiv preprint arXiv:1801.03526, 2018
722018
Synthesizing speech from text using neural networks
Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ...
US Patent 10,971,170, 2021
522021
In teacher we trust: Learning compressed models for pedestrian detection
J Shen, N Vesdapunt, VN Boddeti, KM Kitani
arXiv preprint arXiv:1612.00478, 2016
392016
Examining scaling and transfer of language model architectures for machine translation
B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat
International Conference on Machine Learning, 26176-26192, 2022
132022
Synthesis of speech from text in a voice of a target speaker using neural networks
Y Jia, Z Chen, Y Wu, J Shen, R Pang, RJ Weiss, IL Moreno, F Ren, ...
US Patent 11,488,575, 2022
72022
Training text-to-speech systems from synthetic data: A practical approach for accent transfer tasks
L Finkelstein, H Zen, N Casagrande, C Chan, Y Jia, T Kenter, A Petelin, ...
arXiv preprint arXiv:2208.13183, 2022
52022
Parallel tacotron non-autoregressive and controllable TTS
I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun
US Patent 11,908,448, 2024
32024
Modelling intonation in spectrograms for neural vocoder based text-to-speech
V Wan, J Shen, H Siilen, R Clark
Proc. SpeechProsody 2020, 945-949, 2020
22020
Text-to-speech using duration prediction
Y Zhang, I Elias, B Chun, Y Jia, Y Wu, M Chrzanowski, J Shen
US Patent App. 17/492,543, 2022
12022
Phonemes And Graphemes for Neural Text-to-Speech
Y Jia, B Chun, Y Zhang, J Shen, Y Wu
US Patent App. 18/746,809, 2024
2024
Parallel Tacotron Non-Autoregressive and Controllable TTS
I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun
US Patent App. 18/421,116, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20