所有版本 - 学术资源搜索

Uncertainty-aware instance reweighting for off-policy learning

X Zhang, J Chen, H Wang, H Xie… - Advances in Neural …, 2023 - proceedings.neurips.cc

Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various important real-world applications …

被引用次数：2 相关文章

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu, J Lui… - arXiv e …, 2023 - ui.adsabs.harvard.edu

Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-aware instance reweighting for off-policy learning

X Zhang, J Chen, H Wang, H Xie, Y Liu… - Proceedings of the 37th …, 2023 - dl.acm.org

Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu, J Lui… - arXiv preprint arXiv …, 2023 - arxiv.org

Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various real-world applications, such as …

Uncertainty-Aware Instance Reweighting for Off-Policy Learning

X Zhang, J Chen, H Wang, H Xie, Y Liu… - … -seventh Conference on … - openreview.net

Off-policy learning, referring to the procedure of policy optimization with access only to
logged feedback data, has shown importance in various important real-world applications …