Samuele Tosatto 个人学术档案

引用次数

	总计	2019 年至今
引用	197	188
h 指数	5	5
i10 指数	4	4

20182019202020212022202320248 17 23 29 46 42 31

开放获取的出版物数量

查看全部

4 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKI在 ias.tu-darmstadt.de 的电子邮件经过验证
A. Rupam MahmoodUniversity of Alberta, Amii在 ualberta.ca 的电子邮件经过验证
Marcello RestelliAssociate Professor, Politecnico di Milano在 polimi.it 的电子邮件经过验证
Carlo D'EramoProfessor of Reinforcement Learning @ University of Würzburg | Group leader @ TU Darmstadt在 uni-wuerzburg.de 的电子邮件经过验证
Univ.-Prof. Dr. Elmar RueckertChair of Cyber-Physical-Systems at Montanuniversität Leoben在 ai-lab.science 的电子邮件经过验证
Matteo PirottaResearch Scientist, Meta (FAIR)在 fb.com 的电子邮件经过验证
Martin JagersandUniversity of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Georgia ChalvatzakiProfessor for Interactive Robot Perception and Learning, Technische Universität Darmstadt在 tu-darmstadt.de 的电子邮件经过验证
João CarvalhoTechnische Universität Darmstadt在 ias.informatik.tu-darmstadt.de 的电子邮件经过验证
Hany AbdulsamadPostdoc, Aalto University在 aalto.fi 的电子邮件经过验证
Joni PajarinenAssociate Professor at Aalto University在 aalto.fi 的电子邮件经过验证
Riad AkrourInria Scool在 inria.fr 的电子邮件经过验证
Andrew PattersonUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Martha WhiteUniversity of Alberta在 ualberta.ca 的电子邮件经过验证

关注

Samuele Tosatto

Assistant Professor @ Universität Innsbruck

在 uibk.ac.at 的电子邮件经过验证 - 首页

Robot Learning Reinforcement Learning Machine Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Learning inverse dynamics models in o (n) time with lstm networks E Rueckert, M Nakatenus, S Tosatto, J Peters 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids …, 2017	79	2017
Boosted Fitted Q-Iteration S Tosatto, DE Carlo, P Matteo, R Marcello International Conference of Machine Learning, 2017	47	2017
Contextual latent-movements off-policy optimization for robotic manipulation skills S Tosatto, G Chalvatzaki, J Peters 2021 IEEE international conference on robotics and automation (ICRA), 10815 …, 2021	17	2021
A Nonparametric Off-Policy Policy Gradient S Tosatto, J Carvalho, H Abdulsamad, J Peters International Conference on Artificial Intelligence and Statistics (AISTATS), 2020	14	2020
Model-free Policy Learning with Reward Gradients Q Lan, S Tosatto, H Farrahi, A Mahmood arXiv preprint arXiv:2103.05147, 2021	9	2021
Dynamic Decision Frequency with Continuous Options A Karimi, J Jin, J Luo, AR Mahmood, M Jagersand, S Tosatto 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023	5	2023
An alternate policy gradient estimator for softmax policies S Garg, S Tosatto, Y Pan, M White, AR Mahmood arXiv preprint arXiv:2112.11622, 2021	5	2021
Batch reinforcement learning with a nonparametric off-policy policy gradient S Tosatto, J Carvalho, J Peters IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 5996 …, 2021	5	2021
An upper bound of the bias of Nadaraya-Watson kernel regression under Lipschitz assumptions S Tosatto, R Akrour, J Peters Stats 4 (1), 1-17, 2020	5	2020
Exploration Driven By an Optimistic Bellman Equation S Tosatto, C D'Eramo, J Pajarinen, M Restelli, J Peters International Joint Conference on Neural Networks, 2019	5	2019
Deep probabilistic movement primitives with a bayesian aggregator M Przystupa, F Haghverd, M Jagersand, S Tosatto 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023	3	2023
A temporal-difference approach to policy gradient estimation S Tosatto, A Patterson, M White, R Mahmood International Conference on Machine Learning, 21609-21632, 2022	2	2022
A Gradient Critic for Policy Gradient Estimation S Tosatto, A Patterson, M White, AR Mahmood Sixteenth European Workshop on Reinforcement Learning, 2023	1	2023
Variable-Decision Frequency Option Critic. A Karimi, J Jin, J Luo, AR Mahmood, M Jägersand, S Tosatto CoRR, 2022		2022
Off-Policy Reinforcement Learning for Robotics S Tosatto Technische Universität Darmstadt, 2021		2021
Dimensionality Reduction of Movement Primitives in Parameter Space S Tosatto, J Stadtmüller, J Peters arXiv preprint arXiv:2003.02634, 2020		2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions. Stats 2021, 4, 1–17 S Tosatto, R Akrour, J Peters s Note: MDPI stays neu-tral with regard to jurisdictional clai-ms in …, 2020		2020
Technical Report:“Exploration Driven by an Optimistic Bellman Equation” S Tosatto, C D’Eramo, J Pajarinen, M Restelli, J Peters		2018
Pink Noise LQR: How does Colored Noise affect the Optimal Policy in RL? J Hollenstein, M Zaric, S Tosatto, J Piater ICML 2024 Workshop: Foundations of Reinforcement Learning and Control …, 0
Making Policy Gradient Estimators for Softmax Policies More Robust to Non-stationarities S Garg, S Tosatto, Y Pan, M White, AR Mahmood

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用