Soundstream: An end-to-end neural audio codec
We present SoundStream, a novel neural audio codec that can efficiently compress speech,
music and general audio at bitrates normally targeted by speech-tailored codecs …
music and general audio at bitrates normally targeted by speech-tailored codecs …
Speech coding techniques and challenges: A comprehensive literature survey
M Anees - Multimedia Tools and Applications, 2024 - Springer
Speech coding is the process of compressing speech signals for transmission and storage
in communication systems. In recent years, speech coding has become increasingly …
in communication systems. In recent years, speech coding has become increasingly …
Noise robust automatic speech recognition: review and analysis
M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer
Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …
Cross-scale vector quantization for scalable neural speech coding
Bitrate scalability is a desirable feature for audio coding in real-time communications.
Existing neural audio codecs usually enforce a specific bitrate during training, so different …
Existing neural audio codecs usually enforce a specific bitrate during training, so different …
Speech enhancement for low bit rate speech codec
Speech codec compresses the input signal into compact bit stream, which is then decoded
at the receiver to generate the best possible perceptual quality. This compression makes …
at the receiver to generate the best possible perceptual quality. This compression makes …
Neural feature predictor and discriminative residual coding for low-bitrate speech coding
Low and ultra-low-bitrate neural speech codecs achieved unprecedented coding gain by
generating speech signals from compact features. This paper introduces additional coding …
generating speech signals from compact features. This paper introduces additional coding …
TeNC: Low bit-rate speech coding with VQ-VAE and GAN
Speech coding aims at compressing digital speech signals with fewer bits and
reconstructing it back to raw signals, maintaining the speech quality as much as possible …
reconstructing it back to raw signals, maintaining the speech quality as much as possible …
Revisiting speech content privacy
In this paper, we discuss an important aspect of speech privacy: protecting spoken content.
New capabilities from the field of machine learning provide a unique and timely opportunity …
New capabilities from the field of machine learning provide a unique and timely opportunity …
Low-bitrate redundancy coding of speech using a rate-distortion-optimized variational autoencoder
Robustness to packet loss is one of the main ongoing challenges in real-time speech
communication. Deep packet loss concealment (PLC) techniques have recently …
communication. Deep packet loss concealment (PLC) techniques have recently …
Machine learning and deep learning as new tools for business analytics
Data scientists need to develop accurate and effective tools and techniques to handle a
huge amount of data. Therefore, machine learning and deep learning algorithms have come …
huge amount of data. Therefore, machine learning and deep learning algorithms have come …