Audio-visual voice activity detection using diffusion maps
The performance of traditional voice activity detectors significantly deteriorates in the
presence of highly nonstationary noise and transient interferences. One solution is to …
presence of highly nonstationary noise and transient interferences. One solution is to …
Voice activity detection for transient noisy environment based on diffusion nets
We address voice activity detection in acoustic environments of transients and stationary
noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech …
noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech …
A deep architecture for audio-visual voice activity detection in the presence of transients
We address the problem of voice activity detection in difficult acoustic environments
including high levels of noise and transients, which are common in real life scenarios. We …
including high levels of noise and transients, which are common in real life scenarios. We …
Kernel-based sensor fusion with application to audio-visual voice activity detection
In this paper, we address the problem of multiple view data fusion in the presence of noise
and interferences. Recent studies have approached this problem using kernel methods, by …
and interferences. Recent studies have approached this problem using kernel methods, by …
Kernel method for voice activity detection in the presence of transients
Voice activity detection in the presence of transient interferences is a challenging problem
since transients are often detected incorrectly as speech by existing detectors. In this paper …
since transients are often detected incorrectly as speech by existing detectors. In this paper …
Underwater object classification using scattering transform of sonar signals
N Saito, DS Weber - Wavelets and Sparsity XVII, 2017 - spiedigitallibrary.org
In this paper, we apply the scattering transform (ST)—a nonlinear map based off of a
convolutional neural network (CNN)—to classification of underwater objects using sonar …
convolutional neural network (CNN)—to classification of underwater objects using sonar …
[图书][B] On Interpreting Sonar Waveforms via the Scattering Transform
DS Weber - 2022 - search.proquest.com
Abstract The Scattering Transform (ST) is a formalization of some potential properties that
have made convolutional neural networks effective at a wide variety of image and signal …
have made convolutional neural networks effective at a wide variety of image and signal …
Sequential audio-visual correspondence with alternating diffusion kernels
A fundamental problem in multimodal signal processing is to quantify relations between two
different signals with respect to a certain phenomenon. In this paper, we address this …
different signals with respect to a certain phenomenon. In this paper, we address this …
[PDF][PDF] Scattering vs. discrete cosine transform features in visual speech processing.
Appearance-based feature extraction constitutes the dominant approach for visual speech
representation in a variety of problems, such as automatic speechreading, visual speech …
representation in a variety of problems, such as automatic speechreading, visual speech …
Kernel method for speech source activity detection in multi-modal signals
We consider a problem setup, in which a desired speech source is measured by a
microphone and by a video camera in an interfering environment. We assume that the …
microphone and by a video camera in an interfering environment. We assume that the …