- 学术资源搜索

Generative adversarial networks for speech processing: A review

A Wali, Z Alamgir, S Karim, A Fawaz, MB Ali… - Computer Speech & …, 2022 - Elsevier

Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …

[PDF] arxiv.org

CMGAN: Conformer-based metric GAN for speech enhancement

R Cao, S Abdulatif, B Yang - arXiv preprint arXiv:2203.15149, 2022 - arxiv.org

Recently, convolution-augmented transformer (Conformer) has achieved promising
performance in automatic speech recognition (ASR) and time-domain speech enhancement …

被引用次数：110 相关文章所有 7 个版本

[PDF] ieee.org

Systematic review of advanced AI methods for improving healthcare data quality in post COVID-19 Era

M Isgut, L Gloster, K Choi… - IEEE Reviews in …, 2022 - ieeexplore.ieee.org

At the beginning of the COVID-19 pandemic, there was significant hype about the potential
impact of artificial intelligence (AI) tools in combatting COVID-19 on diagnosis, prognosis, or …

被引用次数：25 相关文章所有 5 个版本 Web of Science: 7

[PDF] arxiv.org

Cmgan: Conformer-based metric-gan for monaural speech enhancement

S Abdulatif, R Cao, B Yang - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org

In this work, we further develop the conformer-based metric generative adversarial network
(CMGAN) model 1 for speech enhancement (SE) in the time-frequency (TF) domain. This …

被引用次数：47 相关文章所有 8 个版本 Web of Science: 3

ComposeInStyle: Music composition with and without Style Transfer

S Mukherjee, M Mulimani - Expert Systems with Applications, 2022 - Elsevier

Every music composition has a composer at the core of its building block, molding it into a
style of their own. The creative compositional style of a composer varies dynamically with …

[PDF] arxiv.org

Learning to denoise historical music

Y Li, B Gfeller, M Tagliasacchi, D Roblek - arXiv preprint arXiv:2008.02027, 2020 - arxiv.org

We propose an audio-to-audio neural network model that learns to denoise old music
recordings. Our model internally converts its input into a time-frequency representation by …

被引用次数：18 相关文章所有 5 个版本

μ-law SGAN for generating spectra with more details in speech enhancement

H Li, Y Xu, D Ke, K Su - Neural Networks, 2021 - Elsevier

The goal of monaural speech enhancement is to separate clean speech from noisy speech.
Recently, many studies have employed generative adversarial networks (GAN) to deal with …

[PDF] arxiv.org

Single channel speech enhancement by colored spectrograms

S Gul, MS Khan, M Fazeel - arXiv preprint arXiv:2310.17142, 2023 - arxiv.org

Speech enhancement concerns the processes required to remove unwanted background
sounds from the target speech to improve its quality and intelligibility. In this paper, a novel …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Audio denoising for robust audio fingerprinting

K Akesbi - arXiv preprint arXiv:2212.11277, 2022 - arxiv.org

Music discovery services let users identify songs from short mobile recordings. These
solutions are often based on Audio Fingerprinting, and rely more specifically on the …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Investigating cross-domain losses for speech enhancement

S Abdulatif, K Armanious, JT Sajeev… - 2021 29th European …, 2021 - ieeexplore.ieee.org

Recent years have seen a surge in the number of available frameworks for speech
enhancement (SE) and recognition. Whether model-based or constructed via deep learning …

被引用次数：11 相关文章所有 5 个版本