Suyoun Kim
Suyoun Kim
Research Scientist, Facebook
Verified email at - Homepage
Cited by
Cited by
Joint CTC-attention based end-to-end speech recognition using multi-task learning
S Kim, T Hori, S Watanabe
2017 IEEE international conference on acoustics, speech and signal …, 2017
Hybrid CTC/attention architecture for end-to-end speech recognition
S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017
Multi-channel speech recognition: LSTMs all the way through
H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ...
CHiME-4 workshop, 1-4, 2016
Towards language-universal end-to-end speech recognition
S Kim, ML Seltzer
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
Multimodal transfer deep learning with applications in audio-visual recognition
S Moon, S Kim, H Wang
NIPS workshop 2015, 2015
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion
D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ...
INTERSPEECH 2021, 2021
Improved training for online end-to-end speech recognition systems
S Kim, ML Seltzer, J Li, R Zhao
Dialog-context aware end-to-end speech recognition
S Kim, F Metze
2018 IEEE Spoken Language Technology Workshop (SLT), 434-440, 2018
Environmental noise embeddings for robust speech recognition
S Kim, B Raj, I Lane
arXiv preprint arXiv:1601.02553, 2016
Improving RNN transducer based ASR with auxiliary tasks
C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig
2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021
Recurrent models for auditory attention in multi-microphone distance speech recognition
S Kim, I Lane
ICLR workshop 2016, 2015
Gated embeddings in end-to-end speech recognition for conversational-context fusion
S Kim, S Dalmia, F Metze
ACL 2019, 2019
Impact of nano-scale through-silicon vias on the quality of today and future 3D IC designs
DH Kim, S Kim, SK Lim
International Workshop on System Level Interconnect Prediction, 1-8, 2011
Improved neural language model fusion for streaming recurrent neural network transducer
S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Semantic distance: A new metric for asr performance analysis towards spoken language understanding
S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer
INTERSPEECH 2021, 2021
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition.
S Kim, IR Lane, S Kim, I Lane
Interspeech, 3867-3871, 2017
Cross-attention end-to-end asr for two-party conversations
S Kim, S Dalmia, F Metze
INTERSPEECH 2019, 2019
Evaluating user perception of speech recognition system quality with semantic distance metric
S Kim, D Le, W Zheng, T Singh, A Arora, X Zhai, C Fuegen, O Kalinli, ...
INTERSPEECH 2022, 2022
Deliberation Model for On-Device Spoken Language Understanding
D Le, A Shrivastava, P Tomasello, S Kim, A Livshits, O Kalinli, ML Seltzer
INTERSPEECH 2022, 2022
Situation informed end-to-end asr for chime-5 challenge
S Kim, S Dalmia, F Metze
CHiME5 workshop, 2018
The system can't perform the operation now. Try again later.
Articles 1–20