Audio-visual voice activity detection using diffusion maps

D Dov, R Talmon, I Cohen - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
The performance of traditional voice activity detectors significantly deteriorates in the
presence of highly nonstationary noise and transient interferences. One solution is to …

Voice activity detection for transient noisy environment based on diffusion nets

A Ivry, B Berdugo, I Cohen - IEEE Journal of Selected Topics in …, 2019 - ieeexplore.ieee.org
We address voice activity detection in acoustic environments of transients and stationary
noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech …

A deep architecture for audio-visual voice activity detection in the presence of transients

I Ariav, D Dov, I Cohen - Signal Processing, 2018 - Elsevier
We address the problem of voice activity detection in difficult acoustic environments
including high levels of noise and transients, which are common in real life scenarios. We …

Kernel-based sensor fusion with application to audio-visual voice activity detection

D Dov, R Talmon, I Cohen - IEEE Transactions on Signal …, 2016 - ieeexplore.ieee.org
In this paper, we address the problem of multiple view data fusion in the presence of noise
and interferences. Recent studies have approached this problem using kernel methods, by …

Kernel method for voice activity detection in the presence of transients

D Dov, R Talmon, I Cohen - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
Voice activity detection in the presence of transient interferences is a challenging problem
since transients are often detected incorrectly as speech by existing detectors. In this paper …

Underwater object classification using scattering transform of sonar signals

N Saito, DS Weber - Wavelets and Sparsity XVII, 2017 - spiedigitallibrary.org
In this paper, we apply the scattering transform (ST)—a nonlinear map based off of a
convolutional neural network (CNN)—to classification of underwater objects using sonar …

[图书][B] On Interpreting Sonar Waveforms via the Scattering Transform

DS Weber - 2022 - search.proquest.com
Abstract The Scattering Transform (ST) is a formalization of some potential properties that
have made convolutional neural networks effective at a wide variety of image and signal …

Sequential audio-visual correspondence with alternating diffusion kernels

D Dov, R Talmon, I Cohen - IEEE Transactions on Signal …, 2018 - ieeexplore.ieee.org
A fundamental problem in multimodal signal processing is to quantify relations between two
different signals with respect to a certain phenomenon. In this paper, we address this …

[PDF][PDF] Scattering vs. discrete cosine transform features in visual speech processing.

E Marcheret, G Potamianos, J Vopicka, V Goel - AVSP, 2015 - isca-archive.org
Appearance-based feature extraction constitutes the dominant approach for visual speech
representation in a variety of problems, such as automatic speechreading, visual speech …

Kernel method for speech source activity detection in multi-modal signals

D Dov, R Talmon, I Cohen - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
We consider a problem setup, in which a desired speech source is measured by a
microphone and by a video camera in an interfering environment. We assume that the …