Marginal density ratio for off-policy evaluation in contextual bandits

MF Taufiq, A Doucet, R Cornish… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new
policies using existing data without costly experimentation. However, current OPE methods …

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

MF Taufiq, A Doucet, R Cornish, JF Ton - arXiv preprint arXiv:2312.01457, 2023 - arxiv.org
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …

Marginal density ratio for off-policy evaluation in contextual bandits

MF Taufiq, A Doucet, R Cornish, JF Ton - Proceedings of the 37th …, 2023 - dl.acm.org
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

MF Taufiq, A Doucet, R Cornish, JF Ton - Thirty-seventh Conference on … - openreview.net
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …