CNN architectures for large-scale audio classification S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ... 2017 ieee international conference on acoustics, speech and signal …, 2017 | 2957 | 2017 |
Ava active speaker: An audio-visual dataset for active speaker detection J Roth, S Chaudhuri, O Klejch, R Marvin, A Gallagher, L Kaver, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 175 | 2020 |
Non-negative matrix factorization based compensation of music for automatic speech recognition. B Raj, T Virtanen, S Chaudhuri, R Singh Interspeech, 717-720, 2010 | 158 | 2010 |
Associating faces with voices for speaker diarization within videos S Chaudhuri, K Hoover US Patent 10,497,382, 2019 | 89 | 2019 |
Audio event detection from acoustic unit occurrence patterns A Kumar, P Dighe, R Singh, S Chaudhuri, B Raj 2012 IEEE international conference on acoustics, speech and signal …, 2012 | 76 | 2012 |
Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification. S Chaudhuri, M Harvilla, B Raj Interspeech, 2265-2268, 2011 | 76 | 2011 |
Engaging collaborative learners with helping agents S Chaudhuri, R Kumar, I Howley, CP Rosé Artificial Intelligence in Education, 365-372, 2009 | 56 | 2009 |
Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen K Hoover, S Chaudhuri, C Pantofaru, I Sturdy, M Slaney 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 48* | 2018 |
Ava-speech: A densely labeled dataset of speech activity in movies S Chaudhuri, J Roth, DPW Ellis, A Gallagher, L Kaver, R Marvin, ... arXiv preprint arXiv:1808.00606, 2018 | 47 | 2018 |
Unsupervised structure discovery for semantic analysis of audio S Chaudhuri, B Raj Advances in Neural Information Processing Systems 25, 2012 | 34 | 2012 |
It’s not easy being green: Supporting collaborative “green design” learning S Chaudhuri, R Kumar, M Joshi, E Terrell, F Higgs, V Aleven, ... Intelligent Tutoring Systems: 9th International Conference, ITS 2008 …, 2008 | 33 | 2008 |
An HMM based part-of-speech tagger and statistical chunker for 3 Indian languages GMR Sastry, S Chaudhuri, PN Reddy Shallow Parsing for South Asian Languages 13, 2007 | 25 | 2007 |
Unsupervised hierarchical structure induction for deeper semantic analysis of audio S Chaudhuri, B Raj 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 24 | 2013 |
Unsupervised word discovery from phonetic input using nested pitman-yor language modeling O Walter, R Haeb-Umbach, S Chaudhuri, B Raj ICRA Workshop on Autonomous Learning, 2013 | 22 | 2013 |
Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia. S Chaudhuri, R Singh, B Raj INTERSPEECH, 1728-1731, 2012 | 19 | 2012 |
Automatic smoothed captioning of non-speech sounds from audio F Wang, S Chaudhuri, D Ellis, N Reale US Patent 10,037,313, 2018 | 17 | 2018 |
Structured Models for Semantic Analysis of Audio Content S Chaudhuri PhD thesis, Carnegie Mellon University. 46, 47, 2013 | 17* | 2013 |
Learning contextual relevance of audio segments using discriminative models over AUD sequences S Chaudhuri, B Raj 2011 IEEE Workshop on Applications of Signal Processing to Audio and …, 2011 | 16 | 2011 |
Helping agents in VMT Y Cui, R Kumar, S Chaudhuri, G Gweon, CP Rosé Studying virtual math teams, 335-354, 2009 | 16 | 2009 |
VMT-Basilica: an environment for rapid prototyping of collaborative learning environments with dynamic support. R Kumar, S Chaudhuri, IK Howley, CP Rosé CSCL (2), 192-194, 2009 | 14 | 2009 |