Follow
Siddharth Sigtia
Title
Cited by
Cited by
Year
An end-to-end neural network for polyphonic piano music transcription
S Sigtia, E Benetos, S Dixon
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (5), 927-939, 2016
4082016
Improved music feature learning with deep neural networks
S Sigtia, S Dixon
2014 IEEE international conference on acoustics, speech and signal …, 2014
1622014
Chime-home: A dataset for sound source recognition in a domestic environment
P Foster, S Sigtia, S Krstulovic, J Barker, MD Plumbley
2015 IEEE Workshop on Applications of Signal Processing to Audio and …, 2015
1022015
Unsupervised feature learning based on deep models for environmental audio tagging
Y Xu, Q Huang, W Wang, P Foster, S Sigtia, PJB Jackson, MD Plumbley
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (6), 1230 …, 2017
972017
Automatic environmental sound recognition: Performance versus computational cost
S Sigtia, AM Stark, S Krstulović, MD Plumbley
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (11 …, 2016
962016
Audio Chord Recognition with a Hybrid Recurrent Neural Network.
S Sigtia, N Boulanger-Lewandowski, S Dixon
ISMIR, 127-133, 2015
872015
A hybrid recurrent neural network for music transcription
S Sigtia, E Benetos, N Boulanger-Lewandowski, T Weyde, ASA Garcez, ...
2015 IEEE international conference on acoustics, speech and signal …, 2015
602015
Multi-Task Learning for Speaker Verification and Voice Trigger Detection
S Sigtia, E Marchi, S Kajarekar, D Naik, J Bridle
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
572020
Multi-task Learning for Voice Trigger Detection
S Sigtia, P Clark, R Haynes, H Richards, J Bridle
2020 IEEE International Conference on Acoustics, Speech and Signal …, 2020
57*2020
Efficient Voice Trigger Detection for Low Resource Hardware
S Sigtia, R Haynes, H Richards, E Marchi, J Bridle
Interspeech, 2092-2096, 2018
482018
An RNN-based music language model for improving automatic music transcription
S Sigtia, E Benetos, S Cherla, T Weyde, A Garcez, S Dixon
http://www. terasoft. com. tw/conf/ismir2014//proceedings …, 2014
482014
Generalised discriminative transform via curriculum learning for speaker recognition
E Marchi, S Shum, K Hwang, S Kajarekar, S Sigtia, H Richards, R Haynes, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
292018
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
S Adya, V Garg, S Sigtia, P Simha, C Dhir
INTERSPEECH, 2020
222020
Progressive voice trigger detection: Accuracy vs latency
S Sigtia, J Bridle, H Richards, P Clark, E Marchi, V Garg
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
142021
Streaming transformer for hardware efficient voice trigger detection and false trigger mitigation
V Garg, W Chang, S Sigtia, S Adya, P Simha, P Dighe, C Dhir
arXiv preprint arXiv:2105.06598, 2021
122021
A denoising autoencoder that guides stochastic search
AW Churchill, S Sigtia, C Fernando
arXiv preprint arXiv:1404.1614, 2014
112014
Learning to generate genotypes with neural networks
AW Churchill, S Sigtia, C Fernando
arXiv preprint arXiv:1604.04153, 2016
62016
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Fully deep neural networks incorporating unsupervised feature learning for audio tagging
Y Xu, Q Huang, W Wang, P Foster, S Sigtia, PJ Jackson, MD Plumbley
arXiv preprint arXiv:1607.03681, 2016
42016
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models
D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ...
arXiv preprint arXiv:2312.03632, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20