[PDF][PDF] Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
As many acoustic signal processing methods, for example for source separation or noise
canceling, operate in the magnitude spectrogram domain, the problem of reconstructing a …
canceling, operate in the magnitude spectrogram domain, the problem of reconstructing a …
[图书][B] Designing audio effect plugins in C++: for AAX, AU, and VST3 with DSP theory
W Pirkle - 2019 - taylorfrancis.com
Designing Audio Effect Plugins in C++ presents everything you need to know about digital
signal processing in an accessible way. Not just another theory-heavy digital signal …
signal processing in an accessible way. Not just another theory-heavy digital signal …
Augmentation invariant discrete representation for generative spoken language modeling
Generative Spoken Language Modeling research focuses on optimizing speech Language
Models (LMs) using raw audio recordings without accessing any textual supervision. Such …
Models (LMs) using raw audio recordings without accessing any textual supervision. Such …
Speech time-scale modification with GANs
While listening to spoken content, it is often desired to vary the speech rate while preserving
the speaker's timbre and pitch. To date, advanced signal processing techniques are used to …
the speaker's timbre and pitch. To date, advanced signal processing techniques are used to …
Audio pitch shifting using the constant-Q transform
C Schörkhuber, A Klapuri, A Sontacchi - Journal of the Audio Engineering …, 2013 - aes.org
Pitch shifting of polyphonic music is usually performed by manipulating the time–frequency
representation of the input signal such that frequency is scaled by a constant and time …
representation of the input signal such that frequency is scaled by a constant and time …
NAST: Noise Aware Speech Tokenization for Speech Language Models
S Messica, Y Adi - arXiv preprint arXiv:2406.11037, 2024 - arxiv.org
Speech tokenization is the task of representing speech signals as a sequence of discrete
units. Such representations can be later used for various downstream tasks including …
units. Such representations can be later used for various downstream tasks including …
[PDF][PDF] PVSOLA: A phase vocoder with synchronized overlap-add
In this paper we present an original method mixing temporal and spectral processing to
reduce the phasiness in the phase vocoder. Phasiness is an inherent artifact of the phase …
reduce the phasiness in the phase vocoder. Phasiness is an inherent artifact of the phase …
Deep learning-based single-ended quality prediction for time-scale modified audio
Objective evaluation of audio processed with Time-Scale Modification (TSM) has recently
seen improvement with a labeled time-scaled audio dataset used to train an objective …
seen improvement with a labeled time-scaled audio dataset used to train an objective …
Apparatus, method and computer program for manipulating an audio signal comprising a transient event
Coie LLP (57) ABSTRACT An apparatus for manipulating an audio signal comprising a
transient event has a transient signal replacer configured to replace a transient signal …
transient event has a transient signal replacer configured to replace a transient signal …
Modular and adaptive control of sound processing
D Van Nort - 2010 - escholarship.mcgill.ca
La présente dissertation expose une recherche sur la création de systèmes pour le contrôle
de la synthèse et du traitement des sons. Les travaux portant sur le design d'instruments de …
de la synthèse et du traitement des sons. Les travaux portant sur le design d'instruments de …