Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder

A Bittar, P Dixon, M Samragh, K Nishu… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Using a vision-inspired keyword spotting framework, we propose an architecture with input-
dependent dynamic depth capable of processing streaming audio. Specifically, we extend a …