Towards a fair marketplace: Counterfactual evaluation of the trade-off between relevance, fairness & satisfaction in recommendation systems
Two-sided marketplaces are platforms that have customers not only on the demand side (eg
users), but also on the supply side (eg retailer, artists). While traditional recommender …
users), but also on the supply side (eg retailer, artists). While traditional recommender …
Woulda, coulda, shoulda: Counterfactually-guided policy search
Learning policies on data synthesized by models can in principle quench the thirst of
reinforcement learning algorithms for large amounts of real experience, which is often costly …
reinforcement learning algorithms for large amounts of real experience, which is often costly …
Counterfactual evaluation of slate recommendations with sequential reward interactions
Users of music streaming, video streaming, news recommendation, and e-commerce
services often engage with content in a sequential manner. Providing and evaluating good …
services often engage with content in a sequential manner. Providing and evaluating good …
Deriving user-and content-specific rewards for contextual bandits
Bandit algorithms have gained increased attention in recommender systems, as they
provide effective and scalable recommendations. These algorithms use reward functions …
provide effective and scalable recommendations. These algorithms use reward functions …
Exploration Trade-offs in Web Recommender Systems
R Baeza-Yates, G Delnevo - … Conference on Big Data (Big Data …, 2022 - ieeexplore.ieee.org
One of the main problems of web recommender systems is exposure bias, due to the fact
that the web system itself is partly generating its own future, as users can only click on items …
that the web system itself is partly generating its own future, as users can only click on items …
A critical analysis of offline evaluation decisions against online results: a real-time recommendations case study
P Nogueira, D Gonçalves, VQ Marinho… - … Systems in Fashion and …, 2022 - Springer
Offline evaluation has a widespread use in the development of recommender systems. In
order to perform offline evaluation, an Information Retrieval practitioner has to make several …
order to perform offline evaluation, an Information Retrieval practitioner has to make several …
Improving evolutionary strategies with generative neural networks
Evolutionary Strategies (ES) are a popular family of black-box zeroth-order optimization
algorithms which rely on search distributions to efficiently optimize a large variety of …
algorithms which rely on search distributions to efficiently optimize a large variety of …
Task selection policies for multitask learning
One of the questions that arises when designing models that learn to solve multiple tasks
simultaneously is how much of the available training budget should be devoted to each …
simultaneously is how much of the available training budget should be devoted to each …
Efficient methods in counterfactual policy learning and sequential decision making
H Zenati - 2023 - theses.hal.science
Because logged data has become ubiquitous in wide-range applications and since
onlineexploration may be sensitive, counterfactual methods have gained significant …
onlineexploration may be sensitive, counterfactual methods have gained significant …
Counterfactual learning of stochastic policies with continuous actions: from models to offline evaluation
Counterfactual reasoning from logged data has become increasingly important for many
applications such as web advertising or healthcare. In this paper, we address the problem of …
applications such as web advertising or healthcare. In this paper, we address the problem of …