A two-mode underwater smart sensor object for precision aquaculture based on AIoT technology

CC Chang, NA Ubina, SC Cheng, HY Lan, KC Chen… - Sensors, 2022 - mdpi.com
Monitoring the status of culture fish is an essential task for precision aquaculture using a
smart underwater imaging device as a non-intrusive way of sensing to monitor freely …

AUC optimization for deep learning-based voice activity detection

XL Zhang, M Xu - EURASIP Journal on Audio, Speech, and Music …, 2022 - Springer
Voice activity detection (VAD) based on deep neural networks (DNN) have demonstrated
good performance in adverse acoustic environments. Current DNN-based VAD optimizes a …

DNAE-GAN: Noise-free acoustic signal generator by integrating autoencoder and generative adversarial network

PH Kuo, ST Lin, J Hu - International Journal of Distributed …, 2020 - journals.sagepub.com
Linear predictive coding is an extremely effective voice generation method that operates
through simple process. However, linear predictive coding–generated voices have limited …

Sequential audio-visual correspondence with alternating diffusion kernels

D Dov, R Talmon, I Cohen - IEEE Transactions on Signal …, 2018 - ieeexplore.ieee.org
A fundamental problem in multimodal signal processing is to quantify relations between two
different signals with respect to a certain phenomenon. In this paper, we address this …

One-Shot Distributed Node-Specific Signal Estimation with Non-Overlapping Latent Subspaces in Acoustic Sensor Networks

P Didier, P Behmandpoor… - … on Acoustic Signal …, 2024 - ieeexplore.ieee.org
A one-shot algorithm called iterationless DANSE (iDANSE) is introduced to perform
distributed adaptive node-specific signal estimation (DANSE) in a fully connected wireless …

Influence of adaptive thresholding on peaks detection in audio data

T Maka - Multimedia Tools and Applications, 2020 - Springer
Many audio analysis systems employ peak picking procedure to produce the final decision.
A typical scheme uses a thresholding function to minimise detection errors where its form …

Robust audiovisual liveness detection for biometric authentication using deep joint embedding and dynamic time warping

A Aides, DOV David, H Aronowitz - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
We address the problem of liveness detection in audiovisual recordings for preventing
spoofing attacks in biometric authentication systems. We assume that liveness is detected …

[PDF][PDF] DUAL-LEVEL SEGMENTATION METHOD FOR FEATURE EXTRACTION ENHANCEMENT STRATEGY IN SPEECH EMOTION RECOGNITION

NAB ZAIDAN - 2022 - eprints.utm.my
The speech segmentation approach could be one of the significant factors contributing to a
Speech Emotion Recognition (SER) system's overall performance. An utterance may contain …