ESPnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018 | 1742 | 2018 |
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018 | 155 | 2018 |
The multilingual tedx corpus for speech recognition and translation E Salesky, M Wiesner, J Bremerman, R Cattoni, M Negri, M Turchi, ... arXiv preprint arXiv:2102.01757, 2021 | 140 | 2021 |
Findings of the IWSLT 2022 Evaluation Campaign. A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ... Proceedings of the 19th International Conference on Spoken Language …, 2022 | 112 | 2022 |
Massively multilingual adversarial speech recognition O Adams, M Wiesner, S Watanabe, D Yarowsky arXiv preprint arXiv:1904.02210, 2019 | 89 | 2019 |
Multi-modal data augmentation for end-to-end ASR A Renduchintala, S Ding, M Wiesner, S Watanabe arXiv preprint arXiv:1803.10299, 2018 | 72 | 2018 |
The chime-7 dasr challenge: Distant meeting transcription with multiple devices in diverse scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023 | 58 | 2023 |
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... Interspeech, 3597-3601, 2017 | 49 | 2017 |
A corpus for large-scale phonetic typology E Salesky, E Chodroff, T Pimentel, M Wiesner, R Cotterell, AW Black, ... arXiv preprint arXiv:2005.13962, 2020 | 32 | 2020 |
Topic identification for speech without asr C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur arXiv preprint arXiv:1703.07476, 2017 | 22 | 2017 |
Towards zero-shot code-switched speech recognition B Yan, M Wiesner, O Klejch, P Jyothi, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 20 | 2023 |
Pretraining by backtranslation for end-to-end asr in low-resource settings M Wiesner, A Renduchintala, S Watanabe, C Liu, N Dehak, S Khudanpur arXiv preprint arXiv:1812.03919, 2018 | 18* | 2018 |
Automatic speech recognition and topic identification for almost-zero-resource languages M Wiesner, C Liu, L Ondel, C Harman, V Manohar, J Trmal, Z Huang, ... arXiv preprint arXiv:1802.08731, 2018 | 17 | 2018 |
Analysis of multilingual sequence-to-sequence speech recognition systems M Karafiát, MK Baskar, S Watanabe, T Hori, M Wiesner, J Černocký arXiv preprint arXiv:1811.03451, 2018 | 14 | 2018 |
End-to-end ASR to jointly predict transcriptions and linguistic annotations M Omachi, Y Fujita, S Watanabe, M Wiesner Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 13 | 2021 |
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models M Wiesner, D Raj, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
The chime-8 dasr challenge for generalizable and array agnostic distant automatic speech recognition and diarization S Cornell, T Park, S Huang, C Boeddeker, X Chang, M Maciejewski, ... arXiv preprint arXiv:2407.16447, 2024 | 8 | 2024 |
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop H Hermansky, L Burget, J Cohen, E Dupoux, N Feldman, J Godfrey, ... 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 7 | 2015 |
Bypass temporal classification: Weakly supervised automatic speech recognition with imperfect transcripts D Gao, M Wiesner, H Xu, LP Garcia, D Povey, S Khudanpur arXiv preprint arXiv:2306.01031, 2023 | 6 | 2023 |
JHU IWSLT 2022 dialect speech translation system description J Yang, A Hussein, M Wiesner, S Khudanpur Proceedings of the 19th International Conference on Spoken Language …, 2022 | 6 | 2022 |