Jukedrummer: Conditional beat-aware audio-domain drum accompaniment generation via transformer VQ-VAE

YK Wu, CY Chiu, YH Yang - arXiv preprint arXiv:2210.06007, 2022 - arxiv.org
This paper proposes a model that generates a drum track in the audio domain to play along
to a user-provided drum-free recording. Specifically, using paired data of drumless tracks …

Darkgan: Exploiting knowledge distillation for comprehensible audio synthesis with gans

J Nistal, S Lattner, G Richard - arXiv preprint arXiv:2108.01216, 2021 - arxiv.org
Generative Adversarial Networks (GANs) have achieved excellent audio synthesis quality in
the last years. However, making them operable with semantically meaningful controls …

A benchmarking initiative for audio-domain music generation using the freesound loop dataset

TM Hung, BY Chen, YT Yeh, YH Yang - arXiv preprint arXiv:2108.01576, 2021 - arxiv.org
This paper proposes a new benchmark task for generat-ing musical passages in the audio
domain by using thedrum loops from the FreeSound Loop Dataset, which arepublicly re …

PVGAN: a pathological voice generation model incorporating a progressive nesting strategy

X Pan, T Feng, N Zhang - Journal of Voice, 2023 - Elsevier
The voice generation task is to solve the problem of limited samples in the voice dataset
using computer technology. By increasing the number of samples, the accuracy of voice …

Drum synthesis and rhythmic transformation with adversarial autoencoders

M Tomczak, M Goto, J Hockman - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Creative rhythmic transformations of musical audio refer to automated methods for
manipulation of temporally-relevant sounds in time. This paper presents a method for joint …

StyleWaveGAN: Style-based synthesis of drum sounds with extensive controls using generative adversarial networks

A Lavault, A Roebel, M Voiry - arXiv preprint arXiv:2204.00907, 2022 - arxiv.org
In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a
variation of StyleGAN, a state-of-the-art image generator. By conditioning StyleWaveGAN on …

VQCPC-GAN: Variable-length adversarial audio synthesis using vector-quantized contrastive predictive coding

J Nistal, C Aouameur, S Lattner… - 2021 IEEE Workshop on …, 2021 - ieeexplore.ieee.org
Influenced by the field of Computer Vision, Generative Adversarial Networks (GANs) are
often adopted for the audio domain using fixed-size two-dimensional spectrogram …

Exploiting pre-trained feature networks for generative adversarial networks in audio-domain loop generation

YT Yeh, BY Chen, YH Yang - arXiv preprint arXiv:2209.01751, 2022 - arxiv.org
While generative adversarial networks (GANs) have been widely used in research on audio
generation, the training of a GAN model is known to be unstable, time consuming, and data …

A study of control methods for percussive sound synthesis based on gans

A Ramires, J Juras, JD Parker… - Evangelista G, Holighaus …, 2022 - repositori.upf.edu
The process of creating drum sounds has seen significant evolution in the past decades.
The development of analogue drum synthesizers, such as the TR-808, and modern sound …

Physics-informed differentiable method for piano modeling

R Simionato, S Fasciani, S Holm - Frontiers in Signal Processing, 2024 - frontiersin.org
Numerical emulations of the piano have been a subject of study since the early days of
sound synthesis. High-accuracy sound synthesis of acoustic instruments employs physical …