Transformation vs tradition: Artificial general intelligence (agi) for arts and humanities

Z Liu, Y Li, Q Cao, J Chen, T Yang, Z Wu, J Hale… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advances in artificial general intelligence (AGI), particularly large language models
and creative image generation systems have demonstrated impressive capabilities on …

Video-to-music recommendation using temporal alignment of segments

L Prétet, G Richard, C Souchier… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We study cross-modal recommendation of musictracks to be used as soundtracks for videos.
This problem is known as the music supervision task. We build on a self-supervised system …

Solos: A dataset for audio-visual music analysis

JF Montesinos, O Slizovskaia… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org
In this paper, we present a new dataset of music performance videos which can be used for
training machine learning methods for multiple tasks such as audio-visual blind source …

Audiovisual singing voice separation

B Li, Y Wang, Z Duan - arXiv preprint arXiv:2107.00231, 2021 - arxiv.org
Separating a song into vocal and accompaniment components is an active research topic,
and recent years witnessed an increased performance from supervised training using deep …

Metric learning for video to music recommendation

L Prétet - 2022 - theses.hal.science
Music enhances moving images and allows to efficiently communicate emotion or narrative
tension, thanks to cultural codes common to the filmmakers and viewers. A successful …

[PDF][PDF] Improving Audio Onset Detection for String Instruments by Incorporating Visual Modality

G Bastas, A Gkiokas, V Katsouros, P Maragos - MML 2020, 2020 - researchgate.net
This paper presents a method for enhancing music audio onset detection in the context of
live music performance recordings. As a structural element of our method we utilize a …

[图书][B] Multi-modal analysis for music performances

B Li - 2020 - search.proquest.com
When we talk about music, we tend to view it as an art of sound. However, it is much broader
than that. We watch music performances, read musical scores, and memorize lyrics of …

[PDF][PDF] Interactive Music Systems

Z Duan - hajim.rochester.edu
Interactive Music Systems Page 1 Interactive Music Systems Zhiyao Duan Associate
Professor of ECE and CS University of Rochester Presentation at WiSSAP 2023, IIT Kanpur …

[PDF][PDF] CONVOLUTIONAL NETWORKS FOR VISUAL ONSET DETECTION IN THE CONTEXT OF BOWED STRING INSTRUMENT PERFORMANCES

G BASTAS, A GKIOKAS, V KATSOUROS, P MARAGOS - 2021 - researchgate.net
In this work, we employ deep learning methods for visual onset detection. We focus on live
music performances involving bowed string instruments. In this context, we take as a source …