Soundstream: An end-to-end neural audio codec

N Zeghidour, A Luebs, A Omran… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
We present SoundStream, a novel neural audio codec that can efficiently compress speech,
music and general audio at bitrates normally targeted by speech-tailored codecs …

Speech coding techniques and challenges: A comprehensive literature survey

M Anees - Multimedia Tools and Applications, 2024 - Springer
Speech coding is the process of compressing speech signals for transmission and storage
in communication systems. In recent years, speech coding has become increasingly …

Noise robust automatic speech recognition: review and analysis

M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer
Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …

Cross-scale vector quantization for scalable neural speech coding

X Jiang, X Peng, H Xue, Y Zhang, Y Lu - arXiv preprint arXiv:2207.03067, 2022 - arxiv.org
Bitrate scalability is a desirable feature for audio coding in real-time communications.
Existing neural audio codecs usually enforce a specific bitrate during training, so different …

Speech enhancement for low bit rate speech codec

J Lin, K Kalgaonkar, Q He, X Lei - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Speech codec compresses the input signal into compact bit stream, which is then decoded
at the receiver to generate the best possible perceptual quality. This compression makes …

Neural feature predictor and discriminative residual coding for low-bitrate speech coding

H Yang, W Lim, M Kim - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Low and ultra-low-bitrate neural speech codecs achieved unprecedented coding gain by
generating speech signals from compact features. This paper introduces additional coding …

TeNC: Low bit-rate speech coding with VQ-VAE and GAN

Y Chen, S Yang, N Hu, L Xie, D Su - Companion Publication of the 2021 …, 2021 - dl.acm.org
Speech coding aims at compressing digital speech signals with fewer bits and
reconstructing it back to raw signals, maintaining the speech quality as much as possible …

Revisiting speech content privacy

J Williams, J Yamagishi, PG Noé, CV Botinhao… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we discuss an important aspect of speech privacy: protecting spoken content.
New capabilities from the field of machine learning provide a unique and timely opportunity …

Low-bitrate redundancy coding of speech using a rate-distortion-optimized variational autoencoder

JM Valin, J Büthe, A Mustafa - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Robustness to packet loss is one of the main ongoing challenges in real-time speech
communication. Deep packet loss concealment (PLC) techniques have recently …

Machine learning and deep learning as new tools for business analytics

A Bouguettaya, H Zarzour, A Kechida… - Handbook of Research …, 2022 - igi-global.com
Data scientists need to develop accurate and effective tools and techniques to handle a
huge amount of data. Therefore, machine learning and deep learning algorithms have come …