Markov decision processes in artificial intelligence John Wiley & Sons, 2010 | 373* | 2010 |
Path integral policy improvement with covariance matrix adaptation F Stulp, O Sigaud arXiv preprint arXiv:1206.4621, 2012 | 258 | 2012 |
Many regression algorithms, one unified model: A review F Stulp, O Sigaud Neural Networks 69, 60-79, 2015 | 247 | 2015 |
CURIOUS: Intrinsically motivated multi-task multi-goal reinforcement learning C Colas, P Fournier, O Sigaud, PY Oudeyer | 217* | 2018 |
Anticipatory behavior in adaptive learning systems: From brains to individual and social behavior MV Butz, O Sigaud, G Pezzulo, G Baldassarre Lecture Notes In Artificial Intelligence, Springer, 2007 | 200* | 2007 |
Robot skill learning: From reinforcement learning to evolution strategies F Stulp, O Sigaud Paladyn, Journal of Behavioral Robotics 4 (1), 49-61, 2013 | 173 | 2013 |
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms C Colas, O Sigaud, PY Oudeyer International Conference in Machine Learning (ICML), 2018 | 171 | 2018 |
On-line regression algorithms for learning mechanical models of robots: a survey O Sigaud, C Salaün, V Padois Robotics and Autonomous Systems 59 (12), 1115-1129, 2011 | 171 | 2011 |
CEM-RL: Combining evolutionary and gradient-based methods for policy search A Pourchot, O Sigaud International Conference on Learning Representations (ICLR), 2018 | 170 | 2018 |
Learning the structure of factored markov decision processes in reinforcement learning problems T Degris, O Sigaud, PH Wuillemin Proceedings of the 23rd international conference on Machine learning, 257-264, 2006 | 158 | 2006 |
Learning classifier systems: a survey O Sigaud, SW Wilson Soft Computing 11, 1065-1078, 2007 | 148 | 2007 |
Anticipatory behavior in adaptive learning systems: Foundations, Theories and Systems MV Butz, O Sigaud, P Gérard Lecture Notes in Artificial Intelligence,, 2003 | 128 | 2003 |
Anticipatory behavior: Exploiting knowledge about the future to improve current behavior MV Butz, O Sigaud, P Gérard Anticipatory behavior in adaptive learning systems: Foundations, theories …, 2003 | 125 | 2003 |
Internal models and anticipations in adaptive learning systems MV Butz, O Sigaud, P Gerard Anticipatory behavior in adaptive learning systems: Foundations, theories …, 2003 | 123 | 2003 |
The problem with DDPG: understanding failures in deterministic environments with sparse rewards G Matheron, N Perrin, O Sigaud arXiv preprint arXiv:1911.11679, 2019 | 120* | 2019 |
Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey C Colas, T Karch, O Sigaud, PY Oudeyer Journal of Artificial Intelligence Research 74, 1159-1199, 2022 | 111 | 2022 |
Modelling individual differences in the form of Pavlovian conditioned approach responses: a dual learning systems approach with factored representations F Lesaint, O Sigaud, SB Flagel, TE Robinson, M Khamassi PLoS computational biology 10 (2), e1003466, 2014 | 107 | 2014 |
How many random seeds? statistical power analysis in deep reinforcement learning experiments C Colas, O Sigaud, PY Oudeyer arXiv preprint arXiv:1806.08295, 2018 | 104 | 2018 |
Unsupervised learning of goal spaces for intrinsically motivated goal exploration A Péré, S Forestier, O Sigaud, PY Oudeyer arXiv preprint arXiv:1803.00781, 2018 | 104 | 2018 |
Policy Search in Continuous Action Domains: an Overview O Sigaud, F Stulp Neural Networks, https://doi.org/10.1016/j.neunet.2019.01, 2018 | 102 | 2018 |