关注
Pratik Gajane
Pratik Gajane
未知所在单位机构
在 tue.nl 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
On formalizing fairness in prediction with machine learning
P Gajane, M Pechenizkiy
the 5th Workshop on Fairness, Accountability, and Transparency in Machine …, 2018
2822018
Adaptively tracking the best bandit arm with an unknown number of distribution changes
P Auer, P Gajane, R Ortner
Conference on Learning Theory, 138-158, 2019
1322019
Variational regret bounds for reinforcement learning
R Ortner, P Gajane, P Auer
Uncertainty in Artificial Intelligence, 81-90, 2020
692020
A sliding-window algorithm for markov decision processes with arbitrarily changing rewards and transitions
P Gajane, R Ortner, P Auer
Lifelong Learning: A Reinforcement Learning Approach Workshop at FAIM, 2018
512018
A relative exponential weighing algorithm for adversarial utility-based dueling bandits
P Gajane, T Urvoy, F Clérot
International Conference on Machine Learning, 218-227, 2015
472015
Corrupt bandits for preserving local privacy
P Gajane, T Urvoy, E Kaufmann
Algorithmic Learning Theory, 387-412, 2018
402018
Achieving optimal dynamic regret for non-stationary bandits without prior information
P Auer, Y Chen, P Gajane, CW Lee, H Luo, R Ortner, CY Wei
Conference on Learning Theory, 159-163, 2019
332019
Adaptively tracking the best arm with an unknown number of distribution changes
P Auer, P Gajane, R Ortner
European Workshop on Reinforcement Learning 14, 375, 2018
312018
Corrupt bandits
P Gajane, T Urvoy, E Kaufmann
EWRL, 2016
162016
Survey on fair reinforcement learning: Theory and practice
P Gajane, A Saxena, M Tavakol, G Fletcher, M Pechenizkiy
arXiv preprint arXiv:2205.10032, 2022
132022
Utility-based dueling bandits as a partial monitoring game
P Gajane, T Urvoy
In the 12th European Workshop on Reinforcement Learning (EWRL), 2015, 2015
72015
Lemon: Alternative sampling for more faithful explanation through local surrogate models
D Collaris, P Gajane, J Jorritsma, JJ van Wijk, M Pechenizkiy
International Symposium on Intelligent Data Analysis, 77-90, 2023
62023
The impact of batch learning in stochastic linear bandits
D Provodin, P Gajane, M Pechenizkiy, M Kaptein
2022 IEEE International Conference on Data Mining (ICDM), 1149-1154, 2022
42022
Gambler bandits and the regret of being ruined
FS Perotto, S Vakili, P Gajane, Y Faghan, M Bourgais
20th International Conference on Autonomous Agents and Multiagent Systems …, 2021
42021
Autonomous exploration for navigating in non-stationary CMPs
P Gajane, R Ortner, P Auer, C Szepesvari
arXiv preprint arXiv:1910.08446, 2019
42019
Counterfactual learning for machine translation: Degeneracies and solutions
C Lawrence, P Gajane, S Riezler
arXiv preprint arXiv:1711.08621, 2017
42017
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
J Li, P Gajane
16th European Workshop on Reinforcement Learning (EWRL), 2023
32023
Corrupt bandits for privacy preserving input
P Gajane, T Urvoy, E Kaufmann
arXiv preprint arXiv:1708.05033, 2017
32017
The impact of batch learning in stochastic bandits
D Provodin, P Gajane, M Pechenizkiy, M Kaptein
Workshop on Ecological Theory of Reinforcement Learning, 2021
22021
A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions.
P Gajane, R Ortner, P Auer
22018
系统目前无法执行此操作,请稍后再试。
文章 1–20