关注
Kishan Panaganti
Kishan Panaganti
其他姓名Kishan Panaganti Badrinath, Kishan Badrinath
在 caltech.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Sample complexity of robust reinforcement learning with a generative model
K Panaganti, D Kalathil
International Conference on Artificial Intelligence and Statistics, 9582-9602, 2022
722022
Robust reinforcement learning using least squares policy iteration with provable performance guarantees
K Panaganti, D Kalathil
International Conference on Machine Learning, 511-520, 2021
692021
Robust reinforcement learning using offline data
K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh
Advances in neural information processing systems 35, 32211-32224, 2022
682022
Improved sample complexity bounds for distributionally robust reinforcement learning
Z Xu, K Panaganti, D Kalathil
International Conference on Artificial Intelligence and Statistics, 9728-9754, 2023
302023
Personalized reward learning with interaction-grounded learning (IGL)
J Maghakian, P Mineiro, K Panaganti, M Rucker, A Saran, C Tan
The Eleventh International Conference on Learning Representations (ICLR), 2023
62023
Bridging distributionally robust learning and offline rl: An approach to mitigate distribution shift and partial data coverage
K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh
arXiv preprint arXiv:2310.18434, 2023
52023
Sample complexity of model-based robust reinforcement learning
K Panaganti, D Kalathil
2021 60th IEEE Conference on Decision and Control (CDC), 2240-2245, 2021
42021
Distributionally Robust Behavioral Cloning for Robust Imitation Learning
K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh
Conference on Decision and Control, 2023
22023
Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Z Zhang, K Panaganti, L Shi, Y Sui, A Wierman, Y Yue
Reinforcement Learning Journal 1 (1), 2024
12024
Model-Free Robust -Divergence Reinforcement Learning Using Both Offline and Online Data
K Panaganti, A Wierman, E Mazumdar
Forty-first International Conference on Machine Learning (ICML), 2024
12024
Bounded regret for finitely parameterized multi-armed bandits
K Panaganti, D Kalathil, P Varaiya
Stochastic Analysis, Filtering, and Stochastic Optimization: A Commemorative …, 2022
12022
Tractable Equilibrium Computation in Markov Games through Risk Aversion
E Mazumdar, K Panaganti, L Shi
arXiv preprint arXiv:2406.14156, 2024
2024
Robust Reinforcement Learning: Theory and Algorithms
K Panaganti Badrinath
2023
Interaction-Grounded Learning for Recommender Systems.
J Maghakian, K Panaganti, P Mineiro, A Saran, C Tan
ORSUM@ RecSys, 2022
2022
Off-Policy Evaluation Using Information Borrowing and Context-Based Switching
S Dasgupta, Y Niu, K Panaganti, D Kalathil, D Pati, B Mallick
arXiv preprint arXiv:2112.09865, 2021
2021
系统目前无法执行此操作,请稍后再试。
文章 1–15