Ssvmr: Saliency-based self-training for video-music retrieval

X Cheng, Z Zhu, H Li, Y Li, Y Zou - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
With the rise of short videos, the demand for selecting appropriate background music (BGM)
for a video has increased significantly, video-music retrieval (VMR) task gradually draws …

Multimodal music information processing and retrieval: Survey and future challenges

F Simonetta, S Ntalampiras… - … workshop on multilayer …, 2019 - ieeexplore.ieee.org
Towards improving the performance in various music information processing tasks, recent
studies exploit different modalities able to capture diverse aspects of music. Such modalities …

Music recommendation system and recommendation model based on convolutional neural network

Y Zhang - Mobile Information Systems, 2022 - Wiley Online Library
In today's era of big data with excess information, music is common and everyday, which
shows the huge amount of music data. How to obtain one's favorite music from the massive …

Video-to-music recommendation using temporal alignment of segments

L Prétet, G Richard, C Souchier… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We study cross-modal recommendation of musictracks to be used as soundtracks for videos.
This problem is known as the music supervision task. We build on a self-supervised system …

Multimodal music datasets? Challenges and future goals in music processing

AM Christodoulou, O Lartillot, AR Jensenius - International Journal of …, 2024 - Springer
The term “multimodal music dataset” is often used to describe music-related datasets that
represent music as a multimedia art form and multimodal experience. However, the term …

Cross-modal music-video recommendation: A study of design choices

L Prétet, G Richard, G Peeters - 2021 International Joint …, 2021 - ieeexplore.ieee.org
In this work, we study music/video cross-modal recommendation, ie recommending a music
track for a video or vice versa. We rely on a self-supervised learning paradigm to learn from …

[HTML][HTML] An Audiovisual Correlation Matching Method Based on Fine-Grained Emotion and Feature Fusion

Z Su, Y Feng, J Liu, J Peng, W Jiang, J Liu - Sensors, 2024 - mdpi.com
Most existing intelligent editing tools for music and video rely on the cross-modal matching
technology of the affective consistency or the similarity of feature representations. However …

[PDF][PDF] Content-based multimedia recommendation systems: definition and application domains

Y Deldjoo, M Schedl, P Cremonesi… - Italian Information …, 2018 - re.public.polimi.it
The goal of this work is to formally provide a general definition of a multimedia
recommendation system (MMRS), in particular a content-based MMRS (CB-MMRS), and to …

Content-based music-image retrieval using self-and cross-modal feature embedding memory

T Nakatsuka, M Hamasaki… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper describes a method based on deep metric learning for content-based cross-
modal retrieval of a piece of music and its representative image (ie, a music audio signal …

Is there a" language of music-video clips"? A qualitative and quantitative study

L Prétet, G Richard, G Peeters - arXiv preprint arXiv:2108.00970, 2021 - arxiv.org
Recommending automatically a video given a music or a music given a video has become
an important asset for the audiovisual industry-with user-generated or professional content …