Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

Olfactory coding in the insect brain: data and conjectures

CG Galizia - European Journal of Neuroscience, 2014 - Wiley Online Library
Much progress has been made recently in understanding how olfactory coding works in
insect brains. Here, I propose a wiring diagram for the major steps from the first processing …

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

H Tak, M Todisco, X Wang, J Jung, J Yamagishi… - arXiv preprint arXiv …, 2022 - arxiv.org
The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …

A survey of personality computing

A Vinciarelli, G Mohammadi - IEEE Transactions on Affective …, 2014 - ieeexplore.ieee.org
Personality is a psychological construct aimed at explaining the wide variety of human
behaviors in terms of a few, stable and measurable individual characteristics. In this respect …

[图书][B] Introduction to audio analysis: a MATLAB® approach

T Giannakopoulos, A Pikrakis - 2014 - books.google.com
Introduction to Audio Analysis serves as a standalone introduction to audio analysis,
providing theoretical background to many state-of-the-art techniques. It covers the essential …

[图书][B] Digital watermarking and steganography

I Cox, M Miller, J Bloom, J Fridrich, T Kalker - 2007 - books.google.com
Digital audio, video, images, and documents are flying through cyberspace to their
respective owners. Unfortunately, along the way, individuals may choose to intervene and …

Curriculum learning of multiple tasks

A Pentina, V Sharmanska… - Proceedings of the …, 2015 - openaccess.thecvf.com
Sharing information between multiple tasks enables algorithms to achieve good
generalization performance even from small amounts of training data. However, in a realistic …

A perceptually-motivated approach for low-complexity, real-time enhancement of fullband speech

JM Valin, U Isik, N Phansalkar, R Giri… - arXiv preprint arXiv …, 2020 - arxiv.org
Over the past few years, speech enhancement methods based on deep learning have
greatly surpassed traditional methods based on spectral subtraction and spectral estimation …

[图书][B] Fundamentals of multimedia

ZN Li, MS Drew, J Liu - 2004 - Springer
In the 17 years since the first edition of Fundamentals of Multimedia, the field and
applications of multimedia have flourished and are undergoing evermore rapid growth and …

Modulation spectra of natural sounds and ethological theories of auditory processing

NC Singh, FE Theunissen - The Journal of the Acoustical Society of …, 2003 - pubs.aip.org
The modulation statistics of natural sound ensembles were analyzed by calculating the
probability distributions of the amplitude envelope of the sounds and their time-frequency …