Convergence and dynamical behavior of the Adam algorithm for non-convex stochastic optimization A Barakat, P Bianchi SIAM Journal on Optimization 31 (1), 244-274, 2020 | 119 | 2020 |
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies I Fatkhullin, A Barakat, A Kireeva, N He ICML 2023 - Proceedings of the 40th International Conference on Machine Learning, 2023 | 37 | 2023 |
Convergence Rates of a Momentum Algorithm with Bounded Adaptive Step Size for Nonconvex Optimization A Barakat, P Bianchi ACML 2020 - Proceedings of the 12th Asian Conference on Machine Learning, 2020 | 35* | 2020 |
Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance A Barakat, P Bianchi, W Hachem, S Schechtman Electronic Journal of Statistics 15 (2), 3892-3947, 2021 | 18 | 2021 |
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space A Barakat, I Fatkhullin, N He ICML 2023 - Proceedings of the 40th International Conference on Machine Learning, 2023 | 15 | 2023 |
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation A Barakat, P Bianchi, J Lehmann AISTATS 2022 - 25th International Conference on Artificial Intelligence and …, 2022 | 15 | 2022 |
Independent Learning in Constrained Markov Potential Games P Jordan, A Barakat, N He AISTATS 2024 - 27th International Conference on Artificial Intelligence and …, 2024 | 2 | 2024 |
Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity J Wu, A Barakat, I Fatkhullin, N He CDC 2023 - 62nd IEEE Conference on Decision and Control, 2023 | 1 | 2023 |
Policy Mirror Descent with Lookahead K Protopapas, A Barakat NeurIPS 2024 - Proceedings of the 38th Annual Conference on Neural …, 2024 | | 2024 |
On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning A Barakat, S Chakraborty, P Yu, P Tokekar, AS Bedi arXiv preprint, 2024 | | 2024 |
Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning O Lepel, A Barakat arXiv preprint, 2024 | | 2024 |
Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players P Alatur, A Barakat, N He CDC 2024 - 63rd IEEE Conference on Decision and Control, 2024 | | 2024 |
Contributions to non-convex stochastic optimization and reinforcement learning A Barakat PhD Thesis, Institut Polytechnique de Paris, 2021 | | 2021 |