关注
Pengqian Yu
Pengqian Yu
IBM Research
在 ibm.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Model-based deep reinforcement learning for financial portfolio optimization
P Yu, JS Lee, I Kulyatin, Z Shi, S Dasgupta
Real-world Sequential Decision Making Workshop, ICML, 2019, 2019
112*2019
Distributionally robust counterpart in Markov decision processes
P Yu, H Xu
IEEE Transactions on Automatic Control 61 (9), 2538-2543, 2015
892015
Distributionally robust optimization for sequential decision-making
Z Chen, P Yu, WB Haskell
Optimization 68 (12), 2397-2426, 2019
412019
Fed+: A unified approach to robust personalized federated learning
P Yu, A Kundu, L Wynter, SH Lim
arXiv preprint arXiv:2009.06303, 2020
29*2020
Mutual information maximization in graph neural networks
X Di, P Yu, R Bu, M Sun
2020 International Joint Conference on Neural Networks (IJCNN), 1-7, 2020
292020
A universal empirical dynamic programming algorithm for continuous state MDPs
WB Haskell, R Jain, H Sharma, P Yu
IEEE Transactions on Automatic Control 65 (1), 115-129, 2019
29*2019
Robust path planning for flexible needle insertion using Markov decision processes
X Tan, P Yu, KB Lim, CK Chui
International Journal of Computer Assisted Radiology and Surgery 13, 1439-1451, 2018
292018
Approximate value iteration for risk-aware Markov decision processes
P Yu, WB Haskell, H Xu
IEEE Transactions on Automatic Control 63 (9), 3135-3142, 2018
272018
Robustness and personalization in federated learning: A unified approach via regularization
A Kundu, P Yu, L Wynter, SH Lim
2022 IEEE International Conference on Edge Computing and Communications …, 2022
162022
A study of enterprise performance management system based on KPI+ BSC
L Guanying, Y Kaichao, W Congcong, Y Pengqian
2010 3rd International Conference on Information Management, Innovation …, 2010
122010
LWA-HAND: Lightweight attention hand for interacting hand reconstruction
X Di, P Yu
European Conference on Computer Vision, 722-738, 2022
92022
Randomized function fitting-based empirical value iteration
WB Haskell, P Yu, H Sharma, R Jain
2017 IEEE 56th Annual Conference on Decision and Control (CDC), 2467-2472, 2017
92017
Dynamic programming for risk-aware sequential optimization
P Yu, WB Haskell, H Xu
2017 IEEE 56th Annual Conference on Decision and Control (CDC), 4934-4939, 2017
92017
Model-based deep reinforcement learning for dynamic portfolio optimization. arXiv 2019
P Yu, JS Lee, I Kulyatin, Z Shi, S Dasgupta
arXiv preprint arXiv:1901.08740, 0
9
Deep reinforcement learning for 3D furniture layout in indoor graphic scenes
X Di, P Yu
Reinforcement Learning for Real Life Workshop, ICML, 2021
8*2021
3D reconstruction of simple objects from a single view silhouette image
X Di, P Yu
arXiv preprint arXiv:1701.04752, 2017
82017
Model-based deep reinforcement learning for dynamic portfolio optimization. arXiv
P Yu, JS Lee, I Kulyatin, Z Shi, S Dasgupta
arXiv preprint arXiv:1901.08740, 2019
72019
Structural Plan of Indoor Scenes with Personalized Preferences
X Di, P Yu, H Zhu, L Cai, Q Sheng, C Sun, L Ran
Assistive Computer Vision and Robotics Workshop, ECCV, 455-468, 2020
62020
End-to-end generative floor-plan and layout with attributes and relation graph
X Di, P Yu, D Yang, H Zhu, C Sun, YD Liu
arXiv preprint arXiv:2012.08514, 2020
52020
Federated reinforcement learning for portfolio management
P Yu, L Wynter, SH Lim
Federated Learning: A Comprehensive Overview of Methods and Applications …, 2022
42022
系统目前无法执行此操作,请稍后再试。
文章 1–20