Thiago D. Simão 个人学术档案

引用次数

	总计	2019 年至今
引用	374	370
h 指数	9	9
i10 指数	9	9

180

135

2016201720182019202020212022202320242 1 7 8 29 67 162 96

开放获取的出版物数量

查看全部

20 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Matthijs T. J. SpaanDelft University of Technology在 tudelft.nl 的电子邮件经过验证
Nils JansenProfessor of Artificial Intelligence and Formal Methods, Ruhr-University Bochum在 rub.de 的电子邮件经过验证
Qisong YangDelft University of Technology在 tudelft.nl 的电子邮件经过验证
Simon TindemansTU Delft在 tudelft.nl 的电子邮件经过验证
Marnix SuilenPhD Candidate, Radboud University在 science.ru.nl 的电子邮件经过验证
Remi Tachet des Combes在 alpacaml.com 的电子邮件经过验证
Romain LarocheMicrosoft Research在 polytechnique.org 的电子邮件经过验证
David ParkerProfessor of Computer Science, University of Oxford在 cs.ox.ac.uk 的电子邮件经过验证
Danial KamranInstitute for Measurement and Control Systems, Karlsruhe Institute of Technology在 kit.edu 的电子邮件经过验证
Canmanie Teresa PonnambalamTNO在 tno.nl 的电子邮件经过验证
Alessandro FarinelliFull professor of Computer Science, University of Verona在 univr.it 的电子邮件经过验证
Alberto CastelliniUniversità degli studi di Verona在 univr.it 的电子邮件经过验证
Edoardo ZorziUniversità di Verona在 univr.it 的电子邮件经过验证
Federico BianchiUniversity of Verona在 univr.it 的电子邮件经过验证
Merlijn KralePhD, Radboud University Nijmegen在 ru.nl 的电子邮件经过验证
Thom BadingsPhD Candidate, Radboud University在 ru.nl 的电子邮件经过验证
Martin LauerKarlsruhe Institute of Technology在 kit.edu 的电子邮件经过验证
Johannes FischerKarlsruhe Institute of Technology (KIT)在 kit.edu 的电子邮件经过验证
Tal KachmanRadboud University在 donders.ru.nl 的电子邮件经过验证
Sebastian JungesAssistant Professor, Radboud University, Nijmegen在 ru.nl 的电子邮件经过验证

关注

Thiago D. Simão

Assistant Professor at Eindhoven University of Technology

在 tue.nl 的电子邮件经过验证 - 首页

decision making under uncertainty safe reinforcement learning offline reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning Q Yang, TD Simão, SH Tindemans, MTJ Spaan AAAI, 10639-10646, 2021	114	2021
AlwaysSafe: Reinforcement learning without safety constraint violations during training TD Simão, N Jansen, MTJ Spaan AAMAS, 1226-1235, 2021	48	2021
Safety-constrained reinforcement learning with a distributional safety critic Q Yang, TD Simão, SH Tindemans, MTJ Spaan Machine Learning 112 (3), 859-887, 2023	36	2023
Safe Policy Improvement with an Estimated Baseline Policy TD Simão, R Laroche, R Tachet des Combes AAMAS, 1269-1277, 2020	33*	2020
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments TD Simão, MTJ Spaan AAAI, 4967-4974, 2019	32	2019
Robust anytime learning of Markov decision processes M Suilen, TD Simão, D Parker, N Jansen NeurIPS, 28790-28802, 2022	24	2022
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives T Badings, TD Simão, M Suilen, N Jansen International Journal on Software Tools for Technology Transfer 25 (3), 375-391, 2023	12	2023
Safe policy improvement for POMDPs via finite-state controllers TD Simão, M Suilen, N Jansen AAAI, 15109-15117, 2023	12	2023
Structure Learning for Safe Policy Improvement TD Simão, MTJ Spaan IJCAI, 3453-3459, 2019	11	2019
Reinforcement Learning by Guided Safe Exploration Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan ECAI, 2858-2865, 2023	9*	2023
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation Y Hogewind, TD Simão, T Kachman, N Jansen ICLR, 2023	9	2023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ... 2022 IEEE 25th International Conference on Intelligent Transportation …, 2022	9	2022
Scalable Safe Policy Improvement via Monte Carlo Tree Search A Castellini, F Bianchi, E Zorzi, TD Simão, A Farinelli, MTJ Spaan ICML, 3732-3756, 2023	5	2023
More for Less: Safe Policy Improvement With Stronger Performance Guarantees P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen IJCAI, 4406-4415, 2023	5	2023
Act-then-measure: reinforcement learning for partially observable environments with active measuring M Krale, TD Simão, N Jansen ICAPS, 212-220, 2023	5	2023
Recursive small-step multi-agent A* for dec-POMDPs W Koops, N Jansen, S Junges, TD Simão IJCAI, 5402-5410, 2023	2	2023
Planejamento probabilístico com becos sem saída TD Simão Universidade de São Paulo, 2017	2	2017
Utilização de algoritmos genéticos para otimização de soluções para o timetabling escolar TD SIMÃO Tese apresentada ao Departamento de Ciência da Computação da Universidade …, 2013	2	2013
Risk-aware curriculum generation for heavy-tailed task distributions C Koprulu, TD Simão, N Jansen, U Topcu UAI, 1132-1142, 2023	1	2023
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments. TD Simão IJCAI, 6460-6461, 2019	1	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用