An overview of deep-learning-based audio-visual speech enhancement and separation
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …
extract either one or more target speech signals, respectively, from a mixture of sounds …
Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …
information, such as gender, accent, and other unique characteristics of a speaker. These …
Ssd-6d: Making rgb-based 3d detection and 6d pose estimation great again
We present a novel method for detecting 3D model instances and estimating their 6D poses
from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover …
from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover …
End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …
known to reside in specific subbands and temporal segments. Various approaches can be …
[图书][B] Адаптивная фильтрация сигналов: теория и алгоритмы
В Джиган - 2022 - books.google.com
В книге рассматриваются основные разновидности адаптивных фильтров и их
применение в радиотехнических системах и системах связи. Дается представление о …
применение в радиотехнических системах и системах связи. Дается представление о …
Classification of Indian classical music with time-series matching deep learning approach
Music is a heavenly way of expressing feelings about the world. The language of music has
vast diversity. For centuries, people have indulged in debates to stratisfy between Western …
vast diversity. For centuries, people have indulged in debates to stratisfy between Western …
Design of intelligent diabetes mellitus detection system using hybrid feature selection based XGBoost classifier
In this work, a non-invasive diabetes mellitus detection system is proposed based on the
wristband photoplethysmography (PPG) signal and basic physiological parameters (PhyP) …
wristband photoplethysmography (PPG) signal and basic physiological parameters (PhyP) …
Survey on speech emotion recognition: Features, classification schemes, and databases
Recently, increasing attention has been directed to the study of the emotional content of
speech signals, and hence, many systems have been proposed to identify the emotional …
speech signals, and hence, many systems have been proposed to identify the emotional …
An overview of text-independent speaker recognition: From features to supervectors
T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …
emphasis on text-independent recognition. Speaker recognition has been studied actively …
The expectation-maximization algorithm
TK Moon - IEEE Signal processing magazine, 1996 - ieeexplore.ieee.org
A common task in signal processing is the estimation of the parameters of a probability
distribution function. Perhaps the most frequently encountered estimation problem is the …
distribution function. Perhaps the most frequently encountered estimation problem is the …