Sample complexity of robust reinforcement learning with a generative model K Panaganti, D Kalathil International Conference on Artificial Intelligence and Statistics, 9582-9602, 2022 | 72 | 2022 |
Robust reinforcement learning using least squares policy iteration with provable performance guarantees K Panaganti, D Kalathil International Conference on Machine Learning, 511-520, 2021 | 69 | 2021 |
Robust reinforcement learning using offline data K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh Advances in neural information processing systems 35, 32211-32224, 2022 | 68 | 2022 |
Improved sample complexity bounds for distributionally robust reinforcement learning Z Xu, K Panaganti, D Kalathil International Conference on Artificial Intelligence and Statistics, 9728-9754, 2023 | 30 | 2023 |
Personalized reward learning with interaction-grounded learning (IGL) J Maghakian, P Mineiro, K Panaganti, M Rucker, A Saran, C Tan The Eleventh International Conference on Learning Representations (ICLR), 2023 | 6 | 2023 |
Bridging distributionally robust learning and offline rl: An approach to mitigate distribution shift and partial data coverage K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh arXiv preprint arXiv:2310.18434, 2023 | 5 | 2023 |
Sample complexity of model-based robust reinforcement learning K Panaganti, D Kalathil 2021 60th IEEE Conference on Decision and Control (CDC), 2240-2245, 2021 | 4 | 2021 |
Distributionally Robust Behavioral Cloning for Robust Imitation Learning K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh Conference on Decision and Control, 2023 | 2 | 2023 |
Distributionally Robust Constrained Reinforcement Learning under Strong Duality Z Zhang, K Panaganti, L Shi, Y Sui, A Wierman, Y Yue Reinforcement Learning Journal 1 (1), 2024 | 1 | 2024 |
Model-Free Robust -Divergence Reinforcement Learning Using Both Offline and Online Data K Panaganti, A Wierman, E Mazumdar Forty-first International Conference on Machine Learning (ICML), 2024 | 1 | 2024 |
Bounded regret for finitely parameterized multi-armed bandits K Panaganti, D Kalathil, P Varaiya Stochastic Analysis, Filtering, and Stochastic Optimization: A Commemorative …, 2022 | 1 | 2022 |
Tractable Equilibrium Computation in Markov Games through Risk Aversion E Mazumdar, K Panaganti, L Shi arXiv preprint arXiv:2406.14156, 2024 | | 2024 |
Robust Reinforcement Learning: Theory and Algorithms K Panaganti Badrinath | | 2023 |
Interaction-Grounded Learning for Recommender Systems. J Maghakian, K Panaganti, P Mineiro, A Saran, C Tan ORSUM@ RecSys, 2022 | | 2022 |
Off-Policy Evaluation Using Information Borrowing and Context-Based Switching S Dasgupta, Y Niu, K Panaganti, D Kalathil, D Pati, B Mallick arXiv preprint arXiv:2112.09865, 2021 | | 2021 |