GHEORGHE COMANICI 个人学术档案

引用次数

	总计	2019 年至今
引用	379	309
h 指数	10	9
i10 指数	10	8

100

201120122013201420152016201720182019202020212022202320247 7 11 10 9 7 11 6 9 28 47 91 79 55

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

GHEORGHE COMANICI

Research Scientist, DeepMind

在 deepmind.com 的电子邮件经过验证

Reinforcement Learning Hierarchical Behavior Bisimulation metrics Spectral Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	97	2019
What can i do here? a theory of affordances in reinforcement learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning, 5243-5253, 2020	67	2020
Optimal policy switching algorithms for reinforcement learning G Comanici, D Precup Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	47	2010
Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021	46	2021
On-the-fly algorithms for bisimulation metrics G Comanici, P Panangaden, D Precup 2012 ninth international conference on quantitative evaluation of systems …, 2012	24	2012
Basis function discovery using spectral clustering and bisimulation metrics G Comanici, D Precup International Workshop on Adaptive and Learning Agents, 85-99, 2011	21	2011
Representation discovery for mdps using bisimulation metrics S Ruan, G Comanici, P Panangaden, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	17	2015
Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 1979-1991, 2021	11	2021
Knowledge representation for reinforcement learning using general value functions G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ...	10	2018
An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013	10	2013
Basis refinement strategies for linear value function approximation in MDPs G Comanici, D Precup, P Panangaden Advances in neural information processing systems 28, 2015	9	2015
AndroidEnv: A Reinforcement Learning Platform for Android. abs/2105.13231 (2021) D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint cs.LG/2105.13231, 2021	5	2021
What can I do here K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup A theory of affordances in reinforcement learning. arXiv [cs. LG], 2020	5	2020
A study of off-policy learning in computational sustainability C Paduraru, D Precup, J Pineau, G Comanici European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012	4	2012
Vision-Language Models as a Source of Rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023	3	2023
Finding increasingly large extremal graphs with alphazero and tabu search A Mehrabian, A Anand, H Kim, N Sonnerat, M Balog, G Comanici, ... arXiv preprint arXiv:2311.03583, 2023	3	2023
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning G Comanici, A Glaese, A Gergely, D Toyama, Z Ahmed, T Jackson, ... arXiv preprint arXiv:2204.10374, 2022		2022
Representation discovery for Markov decision processes using behavioural similarity G Comanici McGill University (Canada), 2016		2016
Optimal Time Scales for Reinforcement Learning Behaviour Strategies G Comanici, D Precup McGill University, 2010		2010

系统目前无法执行此操作，请稍后再试。

文章 1–19

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用