Uncertainty-aware instance reweighting for off-policy learning

X Zhang, J Chen, H Wang, H Xie… - Advances in Neural …, 2023 - proceedings.neurips.cc
Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various important real-world applications …

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu, J Lui… - arXiv e …, 2023 - ui.adsabs.harvard.edu
Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-aware instance reweighting for off-policy learning

X Zhang, J Chen, H Wang, H Xie, Y Liu… - Proceedings of the 37th …, 2023 - dl.acm.org
Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu, J Lui… - arXiv preprint arXiv …, 2023 - arxiv.org
Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu… - … -seventh Conference on … - openreview.net
Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various important real-world applications …