Multimodal kernel method for activity detection of sound sources

CC Chang, NA Ubina, SC Cheng, HY Lan, KC Chen… - Sensors, 2022 - mdpi.com

Monitoring the status of culture fish is an essential task for precision aquaculture using a
smart underwater imaging device as a non-intrusive way of sensing to monitor freely …

被引用次数：23 相关文章所有 12 个版本

[PDF] springer.com

AUC optimization for deep learning-based voice activity detection

XL Zhang, M Xu - EURASIP Journal on Audio, Speech, and Music …, 2022 - Springer

Voice activity detection (VAD) based on deep neural networks (DNN) have demonstrated
good performance in adverse acoustic environments. Current DNN-based VAD optimizes a …

被引用次数：42 相关文章所有 11 个版本

[PDF] sagepub.com

DNAE-GAN: Noise-free acoustic signal generator by integrating autoencoder and generative adversarial network

PH Kuo, ST Lin, J Hu - International Journal of Distributed …, 2020 - journals.sagepub.com

Linear predictive coding is an extremely effective voice generation method that operates
through simple process. However, linear predictive coding–generated voices have limited …

被引用次数：15 相关文章所有 4 个版本

[PDF] israelcohen.com

Sequential audio-visual correspondence with alternating diffusion kernels

D Dov, R Talmon, I Cohen - IEEE Transactions on Signal …, 2018 - ieeexplore.ieee.org

A fundamental problem in multimodal signal processing is to quantify relations between two
different signals with respect to a certain phenomenon. In this paper, we address this …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

One-Shot Distributed Node-Specific Signal Estimation with Non-Overlapping Latent Subspaces in Acoustic Sensor Networks

P Didier, P Behmandpoor… - … on Acoustic Signal …, 2024 - ieeexplore.ieee.org

A one-shot algorithm called iterationless DANSE (iDANSE) is introduced to perform
distributed adaptive node-specific signal estimation (DANSE) in a fully connected wireless …

Influence of adaptive thresholding on peaks detection in audio data

T Maka - Multimedia Tools and Applications, 2020 - Springer

Many audio analysis systems employ peak picking procedure to produce the final decision.
A typical scheme uses a thresholding function to minimise detection errors where its form …

被引用次数：3 相关文章所有 5 个版本

[PDF] github.io

Robust audiovisual liveness detection for biometric authentication using deep joint embedding and dynamic time warping

A Aides, DOV David, H Aronowitz - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

We address the problem of liveness detection in audiovisual recordings for preventing
spoofing attacks in biometric authentication systems. We assume that liveness is detected …

被引用次数：2 相关文章所有 4 个版本

[PDF] utm.my

[PDF][PDF] DUAL-LEVEL SEGMENTATION METHOD FOR FEATURE EXTRACTION ENHANCEMENT STRATEGY IN SPEECH EMOTION RECOGNITION

NAB ZAIDAN - 2022 - eprints.utm.my

The speech segmentation approach could be one of the significant factors contributing to a
Speech Emotion Recognition (SER) system's overall performance. An utterance may contain …