Ching-Feng Yeh
Ching-Feng Yeh
Research Scientist, FAIR
Verified email at
Cited by
Cited by
Transformer-transducer: End-to-end speech recognition with self-attention
CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ...
arXiv preprint arXiv:1910.12977, 2019
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition
Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Domain Adversarial Training for Accented Speech Recognition
S Sun, CF Yeh, MY Hwang, M Ostendorf, L Xie
Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International …, 2018
Training Augmentation with Adversarial Examples for Robust Speech Recognition
S Sun, CF Yeh, M Ostendorf, MY Hwang, L Xie
Streaming transformer-based acoustic models using self-attention with augmented memory
C Wu, Y Wang, Y Shi, CF Yeh, F Zhang
arXiv preprint arXiv:2005.08042, 2020
Alignment restricted streaming recurrent neural network transducer
J Mahadeokar, Y Shangguan, D Le, G Keren, H Su, T Le, CF Yeh, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2021
RNN-T for latency controlled ASR with improved beam search
M Jain, K Schubert, J Mahadeokar, CF Yeh, K Kalgaonkar, A Sriram, ...
arXiv preprint arXiv:1911.01629, 2019
An integrated framework for transcribing Mandarin-English code-mixed lectures with improved acoustic and language modeling
CF Yeh, CY Huang, LC Sun, LS Lee
2010 7th International Symposium on Chinese Spoken Language Processing, 214-219, 2010
Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms.
YN Chen, Y Huang, CF Yeh, LS Lee
Interspeech, 933-936, 2011
Aipnet: Generative adversarial pre-training of accent-invariant networks for end-to-end speech recognition
YC Chen, Z Yang, CF Yeh, M Jain, ML Seltzer
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
Weak-attention suppression for transformer based speech recognition
Y Shi, Y Wang, C Wu, C Fuegen, F Zhang, D Le, CF Yeh, ML Seltzer
arXiv preprint arXiv:2005.09137, 2020
Spoken knowledge organization by semantic structuring and a prototype course lecture system for personalized learning
H Lee, SR Shiang, C Yeh, YN Chen, Y Huang, SY Kong, L Lee
IEEE/ACM transactions on audio, speech, and language processing 22 (5), 883-898, 2014
Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning
T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023
Improved spoken term detection by feature space pseudo-relevance feedback
C Chen, H Lee, C Yeh, L Lee
Eleventh Annual Conference of the International Speech Communication Association, 2010
Semantic distance: A new metric for asr performance analysis towards spoken language understanding
S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer
arXiv preprint arXiv:2104.02138, 2021
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr
X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ...
2021 IEEE spoken language technology workshop (SLT), 46-51, 2021
Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures
CF Yeh, LC Sun, CY Huang, LS Lee
2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Y Wang, Y Shi, F Zhang, C Wu, J Chan, CF Yeh, A Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
An improved framework for recognizing highly imbalanced bilingual code-switched lectures with cross-language acoustic modeling and frame-level language identification
CF Yeh, LS Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (7), 1144 …, 2015
The system can't perform the operation now. Try again later.
Articles 1–20