An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges

R Jahangir, YW Teh, HF Nweke, G Mujtaba… - Expert Systems with …, 2021 - Elsevier
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …

Ssd-6d: Making rgb-based 3d detection and 6d pose estimation great again

W Kehl, F Manhardt, F Tombari… - Proceedings of the …, 2017 - openaccess.thecvf.com
We present a novel method for detecting 3D model instances and estimating their 6D poses
from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover …

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

H Tak, J Jung, J Patino, M Kamble, M Todisco… - arXiv preprint arXiv …, 2021 - arxiv.org
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …

[图书][B] Адаптивная фильтрация сигналов: теория и алгоритмы

В Джиган - 2022 - books.google.com
В книге рассматриваются основные разновидности адаптивных фильтров и их
применение в радиотехнических системах и системах связи. Дается представление о …

Classification of Indian classical music with time-series matching deep learning approach

AK Sharma, G Aggarwal, S Bhardwaj… - IEEE …, 2021 - ieeexplore.ieee.org
Music is a heavenly way of expressing feelings about the world. The language of music has
vast diversity. For centuries, people have indulged in debates to stratisfy between Western …

Design of intelligent diabetes mellitus detection system using hybrid feature selection based XGBoost classifier

A Prabha, J Yadav, A Rani, V Singh - Computers in Biology and Medicine, 2021 - Elsevier
In this work, a non-invasive diabetes mellitus detection system is proposed based on the
wristband photoplethysmography (PPG) signal and basic physiological parameters (PhyP) …

Survey on speech emotion recognition: Features, classification schemes, and databases

M El Ayadi, MS Kamel, F Karray - Pattern recognition, 2011 - Elsevier
Recently, increasing attention has been directed to the study of the emotional content of
speech signals, and hence, many systems have been proposed to identify the emotional …

An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

The expectation-maximization algorithm

TK Moon - IEEE Signal processing magazine, 1996 - ieeexplore.ieee.org
A common task in signal processing is the estimation of the parameters of a probability
distribution function. Perhaps the most frequently encountered estimation problem is the …