关注
Mehdi Jafarnia Jahromi
Mehdi Jafarnia Jahromi
DeepMind
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
CY Wei, M Jafarnia-Jahromi, H Luo, H Sharma, R Jain
International Conference On Machine Learning (ICML), 2020
1022020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
CY Wei, M Jafarnia-Jahromi, H Luo, R Jain
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
592021
Online Learning for Unknown Partially Observable MDPs
M Jafarnia-Jahromi, R Jain, A Nayyar
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
28*2022
Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
L Chen, M Jafarnia-Jahromi, R Jain, H Luo
Neural Information Processing Systems (NeurIPS), 2021
252021
Online Learning for Stochastic Shortest Path Model via Posterior Sampling
M Jafarnia-Jahromi, L Chen, R Jain, H Luo
arXiv preprint arXiv:2106.05335, 2021
202021
Approximate Relative Value Learning for Average-reward Continuous State MDPs
H Sharma, M Jafarnia-Jahromi, R Jain
Uncertainty in Artificial Intelligence (UAI), 2019
162019
Learning Zero-sum Stochastic Games with Posterior Sampling
M Jafarnia-Jahromi, R Jain, A Nayyar
arXiv preprint arXiv:2109.03396, 2021
82021
Non-indexability of the Stochastic Appointment Scheduling Problem
M Jafarnia-Jahromi, R Jain
Automatica 118, 109016, 2020
82020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
M Jafarnia-Jahromi, CY Wei, R Jain, H Luo
arXiv preprint arXiv:2006.04354, 2020
82020
Online learning for cooperative multi-player multi-armed bandits
W Chang, M Jafarnia-Jahromi, R Jain
2022 IEEE 61st Conference on Decision and Control (CDC), 7248-7253, 2022
62022
PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
M Jafarnia-Jahromi, T Chowdhury, HT Wu, S Mukherjee
18th IEEE International Conference On Machine Learning And Applications (ICMLA), 2019
42019
Posterior sampling-based online learning for the stochastic shortest path model
M Jafarnia-Jahromi, L Chen, R Jain, H Luo
Uncertainty in Artificial Intelligence, 922-931, 2023
12023
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent
MJ Jahromi, RA Jain, A Nayyar
International Conference on Artificial Intelligence and Statistics, 3880-3888, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–13