Follow
Mengyue Wu
Title
Cited by
Cited by
Year
Multiple sound sources localization from coarse to fine
R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1532020
Audio caption: Listen and tell
M Wu, H Dinkel, K Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
782019
Towards duration robust weakly supervised sound event detection
H Dinkel, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 887-900, 2021
602021
Investigating local and global information for automated audio captioning with transfer learning
X Xu, H Dinkel, M Wu, Z Xie, K Yu
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
592021
What does a Car-ssette tape tell?
X Xu, H Dinkel, M Wu, K Yu
arXiv preprint arXiv:1905.13448v1, 2019
56*2019
LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation
S Chen, M Wu, KQ Zhu, LC Kunyao Lan, Zhiling Zhang
arXiv preprint arXiv:2305.13614, 2023
53*2023
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning.
X Xu, H Dinkel, M Wu, K Yu
DCASE, 225-229, 2020
502020
Depa: Self-supervised audio embedding for depression detection
P Zhang, M Wu, H Dinkel, K Yu
Proceedings of the 29th ACM international conference on multimedia, 135-143, 2021
492021
Voice activity detection in the wild: A data-driven approach using teacher-student training
H Dinkel, S Wang, X Xu, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1542-1555, 2021
462021
Can audio captions be evaluated with image caption metrics?
Z Zhou, Z Zhang, X Xu, Z Xie, M Wu, KQ Zhu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
382022
Building interpretable interaction trees for deep nlp models
D Zhang, H Zhang, H Zhou, X Bao, D Huo, R Chen, X Cheng, M Wu, ...
Proceedings of the AAAI conference on artificial intelligence 35 (16), 14328 …, 2021
382021
The SJTU system for DCASE2022 challenge task 6: Audio captioning with audio-text retrieval pre-training
X Xu, Z Xie, M Wu, K Yu
Tech. Rep., DCASE2022 Challenge, 2022
342022
Voice activity detection in the wild via weakly supervised sound event detection
H Dinkel, Y Chen, M Wu, K Yu
arXiv preprint arXiv:2003.12222, 2020
312020
Text-based depression detection on sparse data
H Dinkel, M Wu, K Yu
arXiv preprint arXiv:1904.05154, 2019
272019
Audio-text retrieval in context
S Lou, X Xu, M Wu, K Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
252022
Text-to-audio grounding: Building correspondence between captions and sound events
X Xu, H Dinkel, M Wu, K Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
232021
Decoupled dialogue modeling and semantic parsing for multi-turn text-to-SQL
Z Chen, L Chen, H Li, R Cao, D Ma, M Wu, K Yu
arXiv preprint arXiv:2106.02282, 2021
212021
Audio caption in a car setting with a sentence-level loss
X Xu, H Dinkel, M Wu, K Yu
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
202021
Psychiatric scale guided risky post screening for early detection of depression
Z Zhang, S Chen, M Wu, KQ Zhu
arXiv preprint arXiv:2205.09497, 2022
192022
Beyond the status quo: A contemporary survey of advances and challenges in audio captioning
X Xu, Z Xie, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
18*2023
The system can't perform the operation now. Try again later.
Articles 1–20