Pratik Gajane 个人学术档案

引用次数

	总计	2019 年至今
引用	757	728
h 指数	10	10
i10 指数	10	10

180

135

2016201720182019202020212022202320243 7 15 61 104 156 129 164 114

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Mykola PechenizkiyEindhoven University of Technology在 tue.nl 的电子邮件经过验证
Ronald OrtnerMontanuniversität Leoben在 unileoben.ac.at 的电子邮件经过验证
Peter AuerMontanuniversität Leoben在 unileoben.ac.at 的电子邮件经过验证
Tanguy UrvoyOrange在 orange.com 的电子邮件经过验证
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)在 inria.fr 的电子邮件经过验证
fabrice clérotOrange Labs在 orange.com 的电子邮件经过验证
Chung-Wei LeeUniversity of Southern California在 usc.edu 的电子邮件经过验证
Haipeng LuoAssociate Professor, University of Southern California在 usc.edu 的电子邮件经过验证
Chen-Yu WeiAssistant Professor, University of Virginia在 virginia.edu 的电子邮件经过验证

关注

Pratik Gajane

未知所在单位机构

在 tue.nl 的电子邮件经过验证 - 首页

Sequential decision making Reinforcement learning Fairness in machine learning Privacy in machine learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
On formalizing fairness in prediction with machine learning P Gajane, M Pechenizkiy the 5th Workshop on Fairness, Accountability, and Transparency in Machine …, 2018	282	2018
Adaptively tracking the best bandit arm with an unknown number of distribution changes P Auer, P Gajane, R Ortner Conference on Learning Theory, 138-158, 2019	132	2019
Variational regret bounds for reinforcement learning R Ortner, P Gajane, P Auer Uncertainty in Artificial Intelligence, 81-90, 2020	69	2020
A sliding-window algorithm for markov decision processes with arbitrarily changing rewards and transitions P Gajane, R Ortner, P Auer Lifelong Learning: A Reinforcement Learning Approach Workshop at FAIM, 2018	51	2018
A relative exponential weighing algorithm for adversarial utility-based dueling bandits P Gajane, T Urvoy, F Clérot International Conference on Machine Learning, 218-227, 2015	47	2015
Corrupt bandits for preserving local privacy P Gajane, T Urvoy, E Kaufmann Algorithmic Learning Theory, 387-412, 2018	40	2018
Achieving optimal dynamic regret for non-stationary bandits without prior information P Auer, Y Chen, P Gajane, CW Lee, H Luo, R Ortner, CY Wei Conference on Learning Theory, 159-163, 2019	33	2019
Adaptively tracking the best arm with an unknown number of distribution changes P Auer, P Gajane, R Ortner European Workshop on Reinforcement Learning 14, 375, 2018	31	2018
Corrupt bandits P Gajane, T Urvoy, E Kaufmann EWRL, 2016	16	2016
Survey on fair reinforcement learning: Theory and practice P Gajane, A Saxena, M Tavakol, G Fletcher, M Pechenizkiy arXiv preprint arXiv:2205.10032, 2022	13	2022
Utility-based dueling bandits as a partial monitoring game P Gajane, T Urvoy In the 12th European Workshop on Reinforcement Learning (EWRL), 2015, 2015	7	2015
Lemon: Alternative sampling for more faithful explanation through local surrogate models D Collaris, P Gajane, J Jorritsma, JJ van Wijk, M Pechenizkiy International Symposium on Intelligent Data Analysis, 77-90, 2023	6	2023
The impact of batch learning in stochastic linear bandits D Provodin, P Gajane, M Pechenizkiy, M Kaptein 2022 IEEE International Conference on Data Mining (ICDM), 1149-1154, 2022	4	2022
Gambler bandits and the regret of being ruined FS Perotto, S Vakili, P Gajane, Y Faghan, M Bourgais 20th International Conference on Autonomous Agents and Multiagent Systems …, 2021	4	2021
Autonomous exploration for navigating in non-stationary CMPs P Gajane, R Ortner, P Auer, C Szepesvari arXiv preprint arXiv:1910.08446, 2019	4	2019
Counterfactual learning for machine translation: Degeneracies and solutions C Lawrence, P Gajane, S Riezler arXiv preprint arXiv:1711.08621, 2017	4	2017
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning J Li, P Gajane 16th European Workshop on Reinforcement Learning (EWRL), 2023	3	2023
Corrupt bandits for privacy preserving input P Gajane, T Urvoy, E Kaufmann arXiv preprint arXiv:1708.05033, 2017	3	2017
The impact of batch learning in stochastic bandits D Provodin, P Gajane, M Pechenizkiy, M Kaptein Workshop on Ecological Theory of Reinforcement Learning, 2021	2	2021
A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions. P Gajane, R Ortner, P Auer	2	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用