Motion planning diffusion: Learning and planning of robot motions with diffusion models J Carvalho, AT Le, M Baierl, D Koert, J Peters 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 27 | 2023 |
A nonparametric off-policy policy gradient S Tosatto, J Carvalho, H Abdulsamad, J Peters International Conference on Artificial Intelligence and Statistics, 167-177, 2020 | 14 | 2020 |
Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation J Carvalho, M Baeirl, J Urain, J Peters NeurIPS 2022 Workshop on Score-Based Methods, 2022 | 7 | 2022 |
Adapting Object-Centric Probabilistic Movement Primitives with Residual Reinforcement Learning J Carvalho, D Koert, M Daniv, J Peters 2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids), 2022 | 6* | 2022 |
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients J Carvalho, D Tateo, F Muratore, J Peters International Joint Conference on Neural Networks, 2021 | 6 | 2021 |
Batch reinforcement learning with a nonparametric off-policy policy gradient S Tosatto, J Carvalho, J Peters IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 5996 …, 2021 | 5 | 2021 |
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning D Palenicek, M Lutter, J Carvalho, J Peters International Conference on Learning Representations, 2023 | 1 | 2023 |
A Hierarchical Approach to Active Pose Estimation J Hellwig, M Baierl, J Carvalho, J Urain, J Peters arXiv preprint arXiv:2203.03919, 2022 | 1 | 2022 |
Integrated Bi-Manual Motion Generation and Control shaped for Probabilistic Movement Primitives J Vorndamme, J Carvalho, R Laha, D Koert, L Figueredo, J Peters, ... 2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids), 2022 | 1 | 2022 |
An Analysis of Measure-Valued Derivatives for Policy Gradients J Carvalho, J Peters arXiv preprint arXiv:2203.03917, 2022 | | 2022 |