Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering S Adya, V Garg, S Sigtia, P Simha, C Dhir arXiv preprint arXiv:2008.02323, 2020 | 22 | 2020 |
Progressive Voice Trigger Detection: Accuracy vs Latency S Sigtia, J Bridle, H Richards, P Clark, E Marchi, V Garg ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 14 | 2021 |
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation V Garg, W Chang, S Sigtia, S Adya, P Simha, P Dighe, C Dhir arXiv preprint arXiv:2105.06598, 2021 | 12 | 2021 |
Streaming on-device detection of device directed speech from voice and touch-based invocation OO Rudovic, A Bindal, V Garg, P Simha, P Dighe, S Kajarekar ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Leveraging large language models for exploiting asr uncertainty P Dighe, Y Su, S Zheng, Y Liu, V Garg, X Niu, A Tewfik ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models V Garg, O Rudovic, P Dighe, AH Abdelaziz, E Marchi, S Adya, C Dhir, ... arXiv preprint arXiv:2203.15975, 2022 | 6 | 2022 |
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types O Rudovic, W Chang, V Garg, P Dighe, P Simha, J Berkowitz, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance UO Sarawgi, J Berkowitz, V Garg, A Kundu, M Cho, SS Buddi, S Adya, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness S Kumar, SS Buddi, UO Sarawgi, V Garg, S Ranjan, AH Abdelaziz, ... arXiv preprint arXiv:2406.09443, 2024 | | 2024 |
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance J Berkowitz, V Garg, A Kundu, M Cho, SS Buddi, S Adya, A Tewfik arXiv preprint arXiv:2310.05886, 2023 | | 2023 |
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance U Oggy Sarawgi, J Berkowitz, V Garg, A Kundu, M Cho, S Srujana Buddi, ... arXiv e-prints, arXiv: 2310.05886, 2023 | | 2023 |
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study A Brueggeman, T Higuchi, M Delfarah, S Shum, V Garg arXiv preprint arXiv:2309.16060, 2023 | | 2023 |
A Deep Motion Vector Approach to Video Object Segmentation V Garg | | 2019 |