Softflow: Probabilistic framework for normalizing flow on manifolds H Kim, H Lee, WH Kang, JY Lee, NS Kim Advances in Neural Information Processing Systems 33, 16388-16397, 2020 | 119 | 2020 |
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim arXiv preprint arXiv:2203.15447, 2022 | 25 | 2022 |
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech BJ Choi, M Jeong, JY Lee, NS Kim IEEE Signal Processing Letters 29, 2502-2506, 2022 | 15 | 2022 |
Reformer-TTS: Neural Speech Synthesis with Reformer Network. HR Ihm, JY Lee, BJ Choi, SJ Cheon, NS Kim INTERSPEECH, 2012-2016, 2020 | 11 | 2020 |
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis. JY Lee, SJ Cheon, BJ Choi, NS Kim, E Song INTERSPEECH, 917-921, 2018 | 7 | 2018 |
Gated recurrent attention for multi-style speech synthesis SJ Cheon, JY Lee, BJ Choi, H Lee, NS Kim Applied Sciences 10 (15), 5325, 2020 | 5 | 2020 |
Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis JY Lee, JS Bae, S Mun, J Lee, JH Lee, HY Cho, C Kim Interspeech 2023, 2023 | 4 | 2023 |
Into-tts: Intonation template based prosody control system J Lee, JY Lee, H Choi, S Mun, S Park, JS Bae, C Kim arXiv preprint arXiv:2204.01271, 2022 | 4 | 2022 |
Memory attention: Robust alignment using gating mechanism for end-to-end speech synthesis JY Lee, SJ Cheon, BJ Choi, NS Kim IEEE Signal Processing Letters 27, 2004-2008, 2020 | 4 | 2020 |
Efficient Parallel Audio Generation Using Group Masked Language Modeling M Jeong, M Kim, JY Lee, NS Kim IEEE Signal Processing Letters, 2024 | 3 | 2024 |
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim arXiv, 2024 | 2 | 2024 |
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space J Lee, JS Bae, S Mun, H Choi, JY Lee, HY Cho, C Kim arXiv preprint arXiv:2211.03078, 2022 | 2 | 2022 |
Latent Filling: Latent Space Data Augmentation for Zero-Shot Speech Synthesis JS Bae, JY Lee, JH Lee, S Mun, T Kang, HY Cho, C Kim IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024 | 1 | 2024 |
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model JY Lee, M Jeong, M Kim, JH Lee, HY Cho, NS Kim Interspeech 2024, 2024 | | 2024 |
MELS-TTS : Multi-Emotion Multi-Lingual Multi-Speaker Text-To-Speech System Via Disentangled Style Tokens H Choi, JS Bae, JY Lee, S Mun, J Lee, HY Cho, C Kim IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024 | | 2024 |
Acoustic modeling and parameter generation using relevance vector machines for speech synthesis DH Hong, JY Lee, NS Kim 2015 23rd European Signal Processing Conference (EUSIPCO), 345-349, 2015 | | 2015 |
Speaker adaptation using relevance vector regression for HMM-based expressive TTS. DH Hong, JY Lee, SY Jang, NS Kim INTERSPEECH, 1216-1220, 2015 | | 2015 |
Speaker Adaptation Using Nonlinear Regression Techniques for HMM-Based Speech Synthesis DH Hong, SJ Kang, JY Lee, NS Kim 2014 Tenth International Conference on Intelligent Information Hiding and …, 2014 | | 2014 |