关注
Paul Weng
Paul Weng
Duke Kunshan University
在 duke.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Dual graph attention networks for deep latent representation of multifaceted social effects in recommender systems
Q Wu, H Zhang, X Gao, P He, P Weng, H Gao, G Chen
The world wide web conference, 2091-2102, 2019
3352019
Top-k selection based on adaptive sampling of noisy preferences
R Busa-Fekete, B Szorenyi, P Weng, W Cheng, E Hüllermeier
International Conference on Machine Learning, 1094-1102, 2013
932013
Analytics and machine learning in vehicle routing research
R Bai, X Chen, ZL Chen, T Cui, S Gong, W He, X Jiang, H Jin, J Jin, ...
International Journal of Production Research 61 (1), 4-30, 2023
922023
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards
U Siddique, P Weng, M Zimmer
International Conference on Machine Learning, 8905-8915, 2020
852020
Teacher-student framework: a reinforcement learning approach
M Zimmer, P Viappiani, P Weng
AAMAS Workshop autonomous robots and multirobot systems, 2014
792014
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
R Busa-Fekete, B Szörényi, P Weng, W Cheng, E Hüllermeier
Machine learning 97, 327-351, 2014
692014
A survey on interpretable reinforcement learning
C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu
Machine Learning, 1-44, 2024
642024
On finding compromise solutions in multiobjective Markov decision processes
P Perny, P Weng
ECAI 2010, 969-970, 2010
642010
Dual sequential prediction models linking sequential recommendation and information dissemination
Q Wu, Y Gao, X Gao, P Weng, G Chen
Proceedings of the 25th ACM SIGKDD international conference on knowledge …, 2019
612019
Optimization of probabilistic argumentation with Markov decision models
E Hadoux, A Beynier, N Maudet, P Weng, A Hunter
International Joint Conference on Artificial Intelligence, 2015
552015
Qualitative multi-armed bandits: A quantile-based approach
B Szorenyi, R Busa-Fekete, P Weng, E Hüllermeier
International Conference on Machine Learning, 1660-1668, 2015
552015
Learning fair policies in decentralized cooperative multi-agent reinforcement learning
M Zimmer, C Glanois, U Siddique, P Weng
International Conference on Machine Learning, 12967-12978, 2021
472021
Multi-objective bandits: Optimizing the generalized gini index
R Busa-Fekete, B Szörényi, P Weng, S Mannor
International Conference on Machine Learning, 625-634, 2017
442017
Decomposition methods for distributed optimal power flow: panorama and case studies of the DC model
MH Amini, S Bahrami, F Kamyab, S Mishra, R Jaddivada, K Boroojeni, ...
Classical and recent aspects of power system optimization, 137-155, 2018
432018
Interactive value iteration for markov decision processes with unknown rewards
P Weng, B Zanuttini
IJCAI'13-Twenty-Third international joint conference on Artificial …, 2013
422013
Algebraic Markov decision processes
P Perny, O Spanjaard, P Weng
19th International Joint Conference on Artificial Intelligence, 1372-1377, 2005
422005
Invariant transform experience replay: Data augmentation for deep reinforcement learning
Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng
IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020
402020
Sequential decision-making under non-stationary environments via sequential change-point detection
E Hadoux, A Beynier, P Weng
Learning over multiple contexts (LMCE), 2014
402014
Hierarchical electric vehicle charging aggregator strategy using Dantzig-Wolfe decomposition
MH Amini, P McNamara, P Weng, O Karabasoglu, Y Xu
IEEE Design & Test 35 (6), 25-36, 2017
362017
A survey of reinforcement learning from human feedback
T Kaufmann, P Weng, V Bengs, E Hüllermeier
arXiv preprint arXiv:2312.14925, 2023
352023
系统目前无法执行此操作,请稍后再试。
文章 1–20