From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano

H Zhang, J Liang, S Dixon - arXiv preprint arXiv:2407.04518, 2024 - arxiv.org
Our study investigates an approach for understanding musical performances through the
lens of audio encoding models, focusing on the domain of solo Western classical piano …

Audio Conditioning for Music Generation via Discrete Bottleneck Features

S Rouard, Y Adi, J Copet, A Roebel… - arXiv preprint arXiv …, 2024 - arxiv.org
While most music generation models use textual or parametric conditioning (eg tempo,
harmony, musical genre), we propose to condition a language model based music …

Embedding Compression for Teacher-to-Student Knowledge Transfer

Y Ding, A Lerch - arXiv preprint arXiv:2402.06761, 2024 - arxiv.org
Common knowledge distillation methods require the teacher model and the student model
to be trained on the same task. However, the usage of embeddings as teachers has also …