关注
João Carvalho
João Carvalho
在 ias.informatik.tu-darmstadt.de 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Motion planning diffusion: Learning and planning of robot motions with diffusion models
J Carvalho, AT Le, M Baierl, D Koert, J Peters
2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023
272023
A nonparametric off-policy policy gradient
S Tosatto, J Carvalho, H Abdulsamad, J Peters
International Conference on Artificial Intelligence and Statistics, 167-177, 2020
142020
Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation
J Carvalho, M Baeirl, J Urain, J Peters
NeurIPS 2022 Workshop on Score-Based Methods, 2022
72022
Adapting Object-Centric Probabilistic Movement Primitives with Residual Reinforcement Learning
J Carvalho, D Koert, M Daniv, J Peters
2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids), 2022
6*2022
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
J Carvalho, D Tateo, F Muratore, J Peters
International Joint Conference on Neural Networks, 2021
62021
Batch reinforcement learning with a nonparametric off-policy policy gradient
S Tosatto, J Carvalho, J Peters
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 5996 …, 2021
52021
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
D Palenicek, M Lutter, J Carvalho, J Peters
International Conference on Learning Representations, 2023
12023
A Hierarchical Approach to Active Pose Estimation
J Hellwig, M Baierl, J Carvalho, J Urain, J Peters
arXiv preprint arXiv:2203.03919, 2022
12022
Integrated Bi-Manual Motion Generation and Control shaped for Probabilistic Movement Primitives
J Vorndamme, J Carvalho, R Laha, D Koert, L Figueredo, J Peters, ...
2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids), 2022
12022
An Analysis of Measure-Valued Derivatives for Policy Gradients
J Carvalho, J Peters
arXiv preprint arXiv:2203.03917, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–10