Pesto: Pitch estimation with self-supervised transposition-equivariant objective

A Riou, S Lattner, G Hadjeres, G Peeters - International Society for …, 2023 - hal.science
In this paper, we address the problem of pitch estimation using Self Supervised Learning
(SSL). The SSL paradigm we use is equivariance to pitch transposition, which enables our …

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation

YJ Luo, KW Cheuk, W Choi, T Uesaka… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing work on pitch and timbre disentanglement has been mostly focused on single-
instrument music audio, excluding the cases where multiple instruments are presented. To …

Disentangling Multi-instrument Music Audio for Source-level Pitch and Timbre Manipulation

YJ Luo, KW Cheuk, W Choi, WH Liao, K Toyama… - … NeurIPS 2024 Workshop … - openreview.net
Disentangling pitch and timbre from the audio of a musical instrument involves encoding
these two attributes as separate latent representations, allowing the synthesis of instrument …

Unsupervised Pitch-Timbre Disentanglement of Musical Instruments Using a Jacobian Disentangled Sequential Autoencoder

YJ Luo, S Ewert, S Dixon - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Disentangled representation learning seeks to align individual dimensions or separate
groups of coordinates of latent factors with attributes of observed data such that perturbing …