Marginal density ratio for off-policy evaluation in contextual bandits
Abstract Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new
policies using existing data without costly experimentation. However, current OPE methods …
policies using existing data without costly experimentation. However, current OPE methods …
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits
MF Taufiq, A Doucet, R Cornish, JF Ton - arXiv preprint arXiv:2312.01457, 2023 - arxiv.org
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …
existing data without costly experimentation. However, current OPE methods, such as …
Marginal density ratio for off-policy evaluation in contextual bandits
MF Taufiq, A Doucet, R Cornish, JF Ton - Proceedings of the 37th …, 2023 - dl.acm.org
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …
existing data without costly experimentation. However, current OPE methods, such as …
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits
MF Taufiq, A Doucet, R Cornish, JF Ton - Thirty-seventh Conference on … - openreview.net
Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using
existing data without costly experimentation. However, current OPE methods, such as …
existing data without costly experimentation. However, current OPE methods, such as …