关注
Vineet Garg
Vineet Garg
在 apple.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
S Adya, V Garg, S Sigtia, P Simha, C Dhir
arXiv preprint arXiv:2008.02323, 2020
222020
Progressive Voice Trigger Detection: Accuracy vs Latency
S Sigtia, J Bridle, H Richards, P Clark, E Marchi, V Garg
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
142021
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
V Garg, W Chang, S Sigtia, S Adya, P Simha, P Dighe, C Dhir
arXiv preprint arXiv:2105.06598, 2021
122021
Streaming on-device detection of device directed speech from voice and touch-based invocation
OO Rudovic, A Bindal, V Garg, P Simha, P Dighe, S Kajarekar
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
82022
Leveraging large language models for exploiting asr uncertainty
P Dighe, Y Su, S Zheng, Y Liu, V Garg, X Niu, A Tewfik
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
62024
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
V Garg, O Rudovic, P Dighe, AH Abdelaziz, E Marchi, S Adya, C Dhir, ...
arXiv preprint arXiv:2203.15975, 2022
62022
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types
O Rudovic, W Chang, V Garg, P Dighe, P Simha, J Berkowitz, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
UO Sarawgi, J Berkowitz, V Garg, A Kundu, M Cho, SS Buddi, S Adya, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
S Kumar, SS Buddi, UO Sarawgi, V Garg, S Ranjan, AH Abdelaziz, ...
arXiv preprint arXiv:2406.09443, 2024
2024
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
J Berkowitz, V Garg, A Kundu, M Cho, SS Buddi, S Adya, A Tewfik
arXiv preprint arXiv:2310.05886, 2023
2023
Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
U Oggy Sarawgi, J Berkowitz, V Garg, A Kundu, M Cho, S Srujana Buddi, ...
arXiv e-prints, arXiv: 2310.05886, 2023
2023
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study
A Brueggeman, T Higuchi, M Delfarah, S Shum, V Garg
arXiv preprint arXiv:2309.16060, 2023
2023
A Deep Motion Vector Approach to Video Object Segmentation
V Garg
2019
系统目前无法执行此操作,请稍后再试。
文章 1–13