关注
jiajun fan
jiajun fan
在 mails.tsinghua.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Optimal transport for treatment effect estimation
H Wang, J Fan, Z Chen, H Li, W Liu, T Liu, Q Dai, Y Wang, Z Dong, ...
Advances in Neural Information Processing Systems 36, 2024
242024
A review for deep reinforcement learning in atari: Benchmarks, challenges, and solutions
J Fan
arXiv preprint arXiv:2112.04145, 2021
172021
Learnable behavior control: Breaking atari human world records via sample-efficient behavior selection
J Fan, Y Zhuang, Y Liu, J Hao, B Wang, J Zhu, H Wang, ST Xia
The Eleventh International Conference on Learning Representations, 2023
162023
Generalized data distribution iteration
J Fan, C Xiao
The Thirty-ninth International Conference on Machine Learning, 2022
132022
Gdi: Rethinking what makes reinforcement learning different from supervised learning
J Fan, C Xiao, Y Huang
arXiv preprint arXiv:2106.06232, 2021
102021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
C Xiao, H Shi, J Fan, S Deng
arXiv preprint arXiv:2106.00707, 2021
52021
CASA: A bridge between gradient of policy improvement and policy evaluation
C Xiao, H Shi, J Fan, S Deng
CoRR, abs/2105.03923, 2021a. URL https://arxiv. org/abs/2105.03923, 2021
42021
Critic PI2: Master continuous planning via policy improvement with path integrals and deep actor-critic reinforcement learning
J Fan, H Ba, X Guo, J Hao
arXiv preprint arXiv:2011.06752, 2020
42020
Entire space counterfactual learning: Tuning, analytical properties and industrial applications
H Wang, Z Chen, J Fan, Y Huang, W Liu, X Liu
arXiv preprint arXiv:2210.11039, 2022
32022
Convformer: Revisiting transformer for sequential user modeling
H Wang, J Lian, M Wu, H Li, J Fan, W Xu, C Li, X Xie
arXiv preprint arXiv:2308.02925, 2023
22023
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference
Y Li, C Tang, Y Meng, J Fan, Z Chai, X Ma, Z Wang, W Zhu
arXiv preprint arXiv:2407.05010, 2024
2024
Proximity Matters: Local Proximity Preserved Balancing for Treatment Effect Estimation
H Wang, Z Chen, Y Shen, J Fan, Z Liu, D Yang, X Liu, H Li
arXiv preprint arXiv:2407.01111, 2024
2024
Sinkhorn Discrepancy for Counterfactual Generalization
H Wang, Q Dai, J Fan, W Liu, Z Chen, T Liu, Y Wang, Z Dong, R Tang
系统目前无法执行此操作,请稍后再试。
文章 1–13