Content-driven music recommendation: Evolution, state of the art, and challenges
The music domain is among the most important ones for adopting recommender systems
technology. In contrast to most other recommendation domains, which predominantly rely on …
technology. In contrast to most other recommendation domains, which predominantly rely on …
Is rlhf more difficult than standard rl? a theoretical perspective
Abstract Reinforcement learning from Human Feedback (RLHF) learns from preference
signals, while standard Reinforcement Learning (RL) directly learns from reward signals …
signals, while standard Reinforcement Learning (RL) directly learns from reward signals …
Positive, negative and neutral: Modeling implicit feedback in session-based news recommendation
News recommendation for anonymous readers is a useful but challenging task for many
news portals, where interactions between readers and articles are limited within a temporary …
news portals, where interactions between readers and articles are limited within a temporary …
Preference-based online learning with dueling bandits: A survey
In machine learning, the notion of multi-armed bandits refers to a class of online learning
problems, in which an agent is supposed to simultaneously explore and exploit a given set …
problems, in which an agent is supposed to simultaneously explore and exploit a given set …
Arithmetic control of llms for diverse user preferences: Directional preference alignment with multi-objective rewards
Fine-grained control over large language models (LLMs) remains a significant challenge,
hindering their adaptability to diverse user needs. While Reinforcement Learning from …
hindering their adaptability to diverse user needs. While Reinforcement Learning from …
Carousel personalization in music streaming apps with contextual bandits
Media services providers, such as music streaming platforms, frequently leverage swipeable
carousels to recommend personalized content to their users. However, selecting the most …
carousels to recommend personalized content to their users. However, selecting the most …
Counteracting user attention bias in music streaming recommendation via reward modification
In streaming media applications, like music Apps, songs are recommended in a continuous
way in users' daily life. The recommended songs are played automatically although users …
way in users' daily life. The recommended songs are played automatically although users …
Discover: Disentangled music representation learning for cover song identification
In the field of music information retrieval (MIR), cover song identification (CSI) is a
challenging task that aims to identify cover versions of a query song from a massive …
challenging task that aims to identify cover versions of a query song from a massive …
Building cross-sectional systematic strategies by learning to rank
The success of a cross-sectional systematic strategy depends critically on accurately ranking
assets prior to portfolio construction. Contemporary techniques perform this ranking step …
assets prior to portfolio construction. Contemporary techniques perform this ranking step …