Suyoun Kim

Cited by

	All	Since 2019
Citations	2554	2275
h-index	17	16
i10-index	19	18

560

280

140

420

20162017201820192020202120222023202419 58 175 264 332 483 490 556 149

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Takaaki HoriAppleVerified email at apple.com
Mike SeltzerFacebookVerified email at fb.com
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Christian FuegenFacebook Inc.Verified email at fb.com
Siddharth DalmiaResearch Scientist, Google DeepMindVerified email at google.com
Ian LaneCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Yuan (June) ShangguanStaff Software Engineer, GoogleVerified email at google.com
Seungwhan MoonFacebook, Carnegie Mellon UniversityVerified email at fb.com
Haohan WangSchool of Information Sciences, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Jonathan Le RouxMERLVerified email at merl.com
Chiori HoriMERLVerified email at merl.com
Hakan ErdoganGoogleVerified email at google.com
Wei-Ning HsuFacebook AI Research (FAIR)Verified email at csail.mit.edu
Zhong MengGoogleVerified email at google.com
Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Rui ZhaomicrosoftVerified email at microsoft.com
Bhiksha RajCarnegie Mellon UniversityVerified email at cs.cmu.edu
Richard M. SternProfessor of Electrical Engineering and Computer Science, Carnegie Mellon UniversityVerified email at cs.cmu.edu

Suyoun Kim

Research Scientist, Facebook

Verified email at fb.com - Homepage

Speech Recognition Spoken Dialog System Conversational AI Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Joint CTC-attention based end-to-end speech recognition using multi-task learning S Kim, T Hori, S Watanabe 2017 IEEE international conference on acoustics, speech and signal …, 2017	1016	2017
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	821	2017
Multi-channel speech recognition: LSTMs all the way through H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ... CHiME-4 workshop, 1-4, 2016	86	2016
Towards language-universal end-to-end speech recognition S Kim, ML Seltzer 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	81	2018
Multimodal transfer deep learning with applications in audio-visual recognition S Moon, S Kim, H Wang NIPS workshop 2015, 2015	70*	2015
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ... INTERSPEECH 2021, 2021	67	2021
Improved training for online end-to-end speech recognition systems S Kim, ML Seltzer, J Li, R Zhao INTERSPEECH, 2018	49	2018
Dialog-context aware end-to-end speech recognition S Kim, F Metze 2018 IEEE Spoken Language Technology Workshop (SLT), 434-440, 2018	47	2018
Environmental noise embeddings for robust speech recognition S Kim, B Raj, I Lane arXiv preprint arXiv:1601.02553, 2016	45	2016
Improving RNN transducer based ASR with auxiliary tasks C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig 2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021	41	2021
Recurrent models for auditory attention in multi-microphone distance speech recognition S Kim, I Lane ICLR workshop 2016, 2015	33	2015
Gated embeddings in end-to-end speech recognition for conversational-context fusion S Kim, S Dalmia, F Metze ACL 2019, 2019	29	2019
Impact of nano-scale through-silicon vias on the quality of today and future 3D IC designs DH Kim, S Kim, SK Lim International Workshop on System Level Interconnect Prediction, 1-8, 2011	28	2011
Improved neural language model fusion for streaming recurrent neural network transducer S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	24	2021
Semantic distance: A new metric for asr performance analysis towards spoken language understanding S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer INTERSPEECH 2021, 2021	23	2021
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition. S Kim, IR Lane, S Kim, I Lane Interspeech, 3867-3871, 2017	20	2017
Cross-attention end-to-end asr for two-party conversations S Kim, S Dalmia, F Metze INTERSPEECH 2019, 2019	18	2019
Evaluating user perception of speech recognition system quality with semantic distance metric S Kim, D Le, W Zheng, T Singh, A Arora, X Zhai, C Fuegen, O Kalinli, ... INTERSPEECH 2022, 2022	16	2022
Deliberation Model for On-Device Spoken Language Understanding D Le, A Shrivastava, P Tomasello, S Kim, A Livshits, O Kalinli, ML Seltzer INTERSPEECH 2022, 2022	11	2022
Situation informed end-to-end asr for chime-5 challenge S Kim, S Dalmia, F Metze CHiME5 workshop, 2018	9*	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors