Safe option-critic: learning safety in the option-critic architecture A Jain, K Khetarpal, D Precup The Knowledge Engineering Review 36, e4, 2021 | 36 | 2021 |
Variance Penalized On-Policy and Off-Policy Actor-Critic A Jain, G Patil, A Jain, K Khetarpal, D Precup Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), 2021, 2021 | 12 | 2021 |
Towards painless policy optimization for constrained mdps A Jain, S Vaswani, R Babanezhad, C Szepesvari, D Precup Uncertainty in Artificial Intelligence, 895-905, 2022 | 8 | 2022 |
Adaptive Exploration for Data-Efficient General Value Function Evaluations A Jain, JP Hanna, D Precup arXiv preprint arXiv:2405.07838, 2024 | 1 | 2024 |
Safety using constraint variance in policy-gradient methods A Jain McGill University, 2020 | | 2020 |
Safe Policy Learning with Constrained Return Variance A Jain Advances in Artificial Intelligence: 32nd Canadian Conference on Artificial …, 2019 | | 2019 |
Towards Painless Policy Optimization for Constrained MDPs: Supplementary material A Jain, S Vaswani, R Babanezhad, C Szepesvári, D Precup | | |
Robust Constrained MDPs A Jain, S Vaswani, R Babanezhad, D Precup, C Szepesvari | | |
Safe Actor-Critic A Jain, A Jain, D Precup | | |
Safe Hierarchical Policy Optimization using Constrained Return Variance in Options A Jain, D Precup | | |
Learning Options using Constrained Return Variance A Jain, D Precup | | |
Safe Option-Critic A JAIN, K KHETARPAL, D PRECUP | | |