关注
Anas Barakat
标题
引用次数
引用次数
年份
Convergence and dynamical behavior of the Adam algorithm for non-convex stochastic optimization
A Barakat, P Bianchi
SIAM Journal on Optimization 31 (1), 244-274, 2020
1192020
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
I Fatkhullin, A Barakat, A Kireeva, N He
ICML 2023 - Proceedings of the 40th International Conference on Machine Learning, 2023
372023
Convergence Rates of a Momentum Algorithm with Bounded Adaptive Step Size for Nonconvex Optimization
A Barakat, P Bianchi
ACML 2020 - Proceedings of the 12th Asian Conference on Machine Learning, 2020
35*2020
Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance
A Barakat, P Bianchi, W Hachem, S Schechtman
Electronic Journal of Statistics 15 (2), 3892-3947, 2021
182021
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
A Barakat, I Fatkhullin, N He
ICML 2023 - Proceedings of the 40th International Conference on Machine Learning, 2023
152023
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
A Barakat, P Bianchi, J Lehmann
AISTATS 2022 - 25th International Conference on Artificial Intelligence and …, 2022
152022
Independent Learning in Constrained Markov Potential Games
P Jordan, A Barakat, N He
AISTATS 2024 - 27th International Conference on Artificial Intelligence and …, 2024
22024
Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
J Wu, A Barakat, I Fatkhullin, N He
CDC 2023 - 62nd IEEE Conference on Decision and Control, 2023
12023
Policy Mirror Descent with Lookahead
K Protopapas, A Barakat
NeurIPS 2024 - Proceedings of the 38th Annual Conference on Neural …, 2024
2024
On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
A Barakat, S Chakraborty, P Yu, P Tokekar, AS Bedi
arXiv preprint, 2024
2024
Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
O Lepel, A Barakat
arXiv preprint, 2024
2024
Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players
P Alatur, A Barakat, N He
CDC 2024 - 63rd IEEE Conference on Decision and Control, 2024
2024
Contributions to non-convex stochastic optimization and reinforcement learning
A Barakat
PhD Thesis, Institut Polytechnique de Paris, 2021
2021
系统目前无法执行此操作,请稍后再试。
文章 1–13