Multitrack music transcription with a time-frequency perceiver

WT Lu, JC Wang, YN Hung - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Multitrack music transcription aims to transcribe a music audio input into the musical notes of
multiple instruments simultaneously. It is a very challenging task that typically requires a …

StemGen: A music generation model that listens

JD Parker, J Spijkervet, K Kosta… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
End-to-end generation of musical audio using deep learning techniques has seen an
explosion of activity recently. However, most models concentrate on generating fully mixed …

To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions

JC Wang, YN Hung, JBL Smith - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Conventional music structure analysis algorithms aim to divide a song into segments and to
group them with abstract labels (eg,'A','B', and 'C'). However, explicitly identifying the function …

Beat transformer: Demixed beat and downbeat tracking with dilated self-attention

J Zhao, G Xia, Y Wang - arXiv preprint arXiv:2209.07140, 2022 - arxiv.org
We propose Beat Transformer, a novel Transformer encoder architecture for joint beat and
downbeat tracking. Different from previous models that track beats solely based on the …

Seed-music: A unified framework for high quality and controlled music generation

Y Bai, H Chen, J Chen, Z Chen, Y Deng… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Seed-Music, a suite of music generation systems capable of producing high-
quality music with fine-grained style control. Our unified framework leverages both auto …

Music source separation with band-split rope transformer

WT Lu, JC Wang, Q Kong… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Music source separation (MSS) aims to separate a music recording into multiple musically
distinct stems, such as vocals, bass, drums, and more. Recently, deep learning approaches …

SSDPT: Self-supervised dual-path transformer for anomalous sound detection

J Bai, J Chen, M Wang, MS Ayub, Q Yan - Digital Signal Processing, 2023 - Elsevier
Anomalous sound detection for machine condition monitoring or structural health monitoring
is essential in the development of Industry 4.0. However, the anomalous sounds of …

All-in-one metrical and functional structure analysis with neighborhood attentions on demixed audio

T Kim, J Nam - 2023 IEEE Workshop on Applications of Signal …, 2023 - ieeexplore.ieee.org
Music is characterized by complex hierarchical structures. Developing a comprehensive
model to capture these structures has been a significant challenge in the field of Music …

An efficient hidden markov model with periodic recurrent neural network observer for music beat tracking

G Song, Z Wang - Electronics, 2022 - mdpi.com
In music information retrieval (MIR), beat tracking is one of the most fundamental tasks. To
obtain this critical component from rhythmic music signals, a previous beat tracking system …

Beat this! Accurate beat tracking without DBN postprocessing

F Foscarin, J Schlüter, G Widmer - arXiv preprint arXiv:2407.21658, 2024 - arxiv.org
We propose a system for tracking beats and downbeats with two objectives: generality
across a diverse music range, and high accuracy. We achieve generality by training on …