Perceptual coding of digital audio

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：403 相关文章所有 10 个版本

[PDF] wiley.com

Olfactory coding in the insect brain: data and conjectures

CG Galizia - European Journal of Neuroscience, 2014 - Wiley Online Library

Much progress has been made recently in understanding how olfactory coding works in
insect brains. Here, I propose a wiring diagram for the major steps from the first processing …

被引用次数：152 相关文章所有 14 个版本

[PDF] arxiv.org

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

H Tak, M Todisco, X Wang, J Jung, J Yamagishi… - arXiv preprint arXiv …, 2022 - arxiv.org

The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …

被引用次数：145 相关文章所有 9 个版本

[PDF] gla.ac.uk

A survey of personality computing

A Vinciarelli, G Mohammadi - IEEE Transactions on Affective …, 2014 - ieeexplore.ieee.org

Personality is a psychological construct aimed at explaining the wide variety of human
behaviors in terms of a few, stable and measurable individual characteristics. In this respect …

被引用次数：635 相关文章所有 6 个版本

[图书][B] Introduction to audio analysis: a MATLAB® approach

T Giannakopoulos, A Pikrakis - 2014 - books.google.com

Introduction to Audio Analysis serves as a standalone introduction to audio analysis,
providing theoretical background to many state-of-the-art techniques. It covers the essential …

被引用次数：441 相关文章所有 4 个版本

[PDF] iau.ir

[图书][B] Digital watermarking and steganography

I Cox, M Miller, J Bloom, J Fridrich, T Kalker - 2007 - books.google.com

Digital audio, video, images, and documents are flying through cyberspace to their
respective owners. Unfortunately, along the way, individuals may choose to intervene and …

被引用次数：3208 相关文章所有 5 个版本

[PDF] thecvf.com

Curriculum learning of multiple tasks

A Pentina, V Sharmanska… - Proceedings of the …, 2015 - openaccess.thecvf.com

Sharing information between multiple tasks enables algorithms to achieve good
generalization performance even from small amounts of training data. However, in a realistic …

被引用次数：285 相关文章所有 14 个版本

[PDF] arxiv.org

A perceptually-motivated approach for low-complexity, real-time enhancement of fullband speech

JM Valin, U Isik, N Phansalkar, R Giri… - arXiv preprint arXiv …, 2020 - arxiv.org

Over the past few years, speech enhancement methods based on deep learning have
greatly surpassed traditional methods based on spectral subtraction and spectral estimation …

被引用次数：97 相关文章所有 11 个版本

[PDF] academia.edu

[图书][B] Fundamentals of multimedia

ZN Li, MS Drew, J Liu - 2004 - Springer

In the 17 years since the first edition of Fundamentals of Multimedia, the field and
applications of multimedia have flourished and are undergoing evermore rapid growth and …

被引用次数：646 相关文章所有 15 个版本

[PDF] academia.edu

Modulation spectra of natural sounds and ethological theories of auditory processing

NC Singh, FE Theunissen - The Journal of the Acoustical Society of …, 2003 - pubs.aip.org

The modulation statistics of natural sound ensembles were analyzed by calculating the
probability distributions of the amplitude envelope of the sounds and their time-frequency …

被引用次数：531 相关文章所有 11 个版本