Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 672 | 2022 |
Audio Event Detection using Weakly Labeled Data A Kumar, B Raj 24th ACM International Conference on Multimedia (ACM MM), 2016 | 201 | 2016 |
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes A Kumar, M Khadkevich, C Fugen IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018 | 172 | 2018 |
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes A Kumar, M Khadkevich, C Fügen IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018 | 172 | 2018 |
Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks A Kumar, D Florencio Interspeech, 2016 | 141 | 2016 |
Audio event detection from acoustic unit occurrence patterns A Kumar, P Dighe, R Singh, S Chaudhuri, B Raj 2012 IEEE international conference on acoustics, speech and signal …, 2012 | 76 | 2012 |
A closer look at weak label learning for audio events A Shah, A Kumar, AG Hauptmann, B Raj arXiv preprint arXiv:1804.09288, 2018 | 63 | 2018 |
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording B Elizalde, A Kumar, A Shah, A Badlani, E Vincent, B Raj, I Lane Workshop on Detection and Classification of Acoustic Scenes and Events …, 2016 | 60* | 2016 |
Deep cnn framework for audio event recognition using weakly labeled web data A Kumar, B Raj NIPS Workshop on Machine Learning for Audio, 2017 | 55 | 2017 |
Content Based Representations Of Audio Using Siamese Neural Networks P Manocha, R Badlani, A Kumar, A Shah, B Elizalde, B Raj IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018 | 52 | 2018 |
Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data A Kumar, B Raj International Joint Conference on Neural Networks (IJCNN), 2017 | 49 | 2017 |
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition A Kumar, VK Ithapu International Conference on Machine Learning (ICML), 2020, 2020 | 48 | 2020 |
Informedia@ TrecVID 2014: MED and MER SI Yu, L Jiang, Z Xu, Z Lan, S Xu, X Chang, X Li, Z Mao, C Gan, Y Miao, ... TREC Video Retrieval Evaluation 2014, 2014 | 43 | 2014 |
Remixit: Continual self-training of speech enhancement models via bootstrapped remixing E Tzinis, Y Adi, VK Ithapu, B Xu, P Smaragdis, A Kumar IEEE Journal of Selected Topics in Signal Processing 16 (6), 1329-1341, 2022 | 42 | 2022 |
Multi-Channel Speech Enhancement using Graph Neural Networks P Tzirakis, A Kumar, J Donley IEEE International Conference on Acoustics, Speech and Signal Processing …, 2021 | 40 | 2021 |
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation K Tan, B Xu, A Kumar, E Nachmani, Y Adi IEEE Signal Processing Letters, 2020 | 38 | 2020 |
Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data H Fayek, A Kumar 29th International Joint Conference on Artificial Intelligence (IJCAI), 2020 | 38 | 2020 |
NORESQA--A Framework for Speech Quality Assessment using Non-Matching References P Manocha, B Xu, A Kumar Advances in neural information processing systems, 2021 | 37 | 2021 |
TPARN: Triple-path attentive recurrent network for time-domain multichannel speech enhancement A Pandey, B Xu, A Kumar, J Donley, P Calamia, DL Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 34 | 2022 |
NELS-Never-Ending Learner of Sounds B Elizalde, R Badlani, A Shah, A Kumar, B Raj NIPS Workshop on Machine Learning for Audio, 2018 | 34* | 2018 |