关注
Guoxi Zhang
Guoxi Zhang
Beijing Institute for General Artificial Intelligence
在 ml.ist.i.kyoto-u.ac.jp 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Machine learning in materials chemistry: An invitation
D Packwood, LTH Nguyen, P Cesana, G Zhang, A Staykov, Y Fukumoto, ...
Machine Learning with Applications 8, 100265, 2022
402022
Learning state importance for preference-based reinforcement learning
G Zhang, H Kashima
Machine Learning 113 (4), 1885-1901, 2024
62024
Batch reinforcement learning from crowds
G Zhang, H Kashima
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2022
62022
Estimating treatment effects under heterogeneous interference
X Lin, G Zhang, X Lu, H Bao, K Takeuchi, H Kashima
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2023
42023
Improving pairwise rank aggregation via querying for rank difference
G Zhang, J Li, H Kashima
2022 IEEE 9th International Conference on Data Science and Advanced …, 2022
42022
Robust multi-view topic modeling by incorporating detecting anomalies
G Zhang, T Iwata, H Kashima
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017
42017
INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations
L Luo, G Zhang, H Xu, Y Yang, C Fang, Q Li
arXiv preprint arXiv:2403.12451, 2024
32024
Behavior estimation from multi-source data for offline reinforcement learning
G Zhang, H Kashima
Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 11201 …, 2023
22023
On Modeling Long-Term User Engagement from Stochastic Feedback
G Zhang, X Yao, X Xiao
Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion), 2023
12023
On reducing dimensionality of labeled data efficiently
G Zhang, T Iwata, H Kashima
Advances in Knowledge Discovery and Data Mining: 22nd Pacific-Asia …, 2018
12018
VickreyFeedback: Cost-efficient Data Construction for Reinforcement Learning from Human Feedback
G Zhang, J Duan
arXiv preprint arXiv:2409.18417, 2024
2024
SYNERGAI: Perception Alignment for Human-Robot Collaboration
Y Chen, G Zhang, Y Zhang, H Xu, P Zhi, Q Li, S Huang
arXiv preprint arXiv:2409.15684, 2024
2024
Treatment Effect Estimation Under Unknown Interference
X Lin, G Zhang, X Lu, H Kashima
Pacific-Asia Conference on Knowledge Discovery and Data Mining, 28-42, 2024
2024
Online Policy Learning from Offline Preferences
G Zhang, H Bao, H Kashima
arXiv preprint arXiv:2403.10160, 2024
2024
Offline Reinforcement Learning from Imperfect Human Guidance
G Zhang
Kyoto University, 2023
2023
Deploying exploration in proximity indices for link collection problem
G Zhang
人工知能学会全国大会論文集 第 31 回 (2017), 4Q12in2-4Q12in2, 2017
2017
系统目前无法执行此操作,请稍后再试。
文章 1–16