Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee IEEE Conference on Decision and Control (CDC), 2019 | 140 | 2019 |
Logically-Constrained Reinforcement Learning M Hasanbeig, A Abate, D Kroening arXiv preprint arXiv:1801.08099, 2018 | 116 | 2018 |
Cautious Reinforcement Learning with Logical Constraints M Hasanbeig, A Abate, D Kroening AAMAS, 483-491, 2020 | 99 | 2020 |
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan IEEE Robotics and Automation and IROS, 2021 | 82 | 2021 |
Certified reinforcement learning with logic guidance H Hasanbeig, D Kroening, A Abate Artificial Intelligence 322, 103949, 2023 | 64 | 2023 |
Deep Reinforcement Learning with Temporal Logics M Hasanbeig, D Kroening, A Abate International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020 | 63 | 2020 |
Deepsynth: Program Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening AAAI Conference on Artificial Intelligence (AAAI-21), 2021 | 52* | 2021 |
Logically-Constrained Neural Fitted Q-iteration M Hasanbeig, A Abate, D Kroening AAMAS, 2012-2014, 2019 | 51 | 2019 |
Modular Deep Reinforcement Learning with Temporal Logic Specifications LZ Yuan, M Hasanbeig, A Abate, D Kroening arXiv preprint arXiv:1909.11591, 2019 | 49 | 2019 |
Evaluating cognitive maps in large language models with cogeval: No emergent planning I Momennejad, H Hasanbeig, FV Frujeri, H Sharma, RO Ness, N Jojic, ... Advances in neural information processing systems 37, 2023 | 32* | 2023 |
Towards Verifiable and Safe Model-free Reinforcement Learning M Hasanbeig, D Kroening, A Abate Workshop on Artificial Intelligence and Formal Verification, Logics …, 2020 | 28* | 2020 |
Shielding Atari Games with Bounded Prescience M Giacobbe, M Hasanbeig, D Kroening, H Wijk International Conference on Autonomous Agents and Multiagent Systems, 2021 | 25 | 2021 |
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening arXiv preprint arXiv:1911.10244, 2019 | 19 | 2019 |
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning M Hasanbeig, D Kroening, A Abate International Conference on Quantitative Evaluation of Systems, 217-231, 2022 | 15 | 2022 |
On Synchronous Binary Log-Linear Learning and Second Order Q-learning M Hasanbeig, L Pavel IFAC World Congress 50 (1), 8987-8992, 2017 | 12 | 2017 |
Distributed Coverage Control by Robot Networks in Unknown Environments using a Modified EM Algorithm M Hasanbeig, L Pavel International Journal of Computer and Information Engineering 11 (7), 815-823, 2017 | 8 | 2017 |
From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning M Hasanbeig, L Pavel arXiv preprint arXiv:1802.02277, 2018 | 7 | 2018 |
Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad arXiv preprint arXiv:2309.13701, 2023 | 5 | 2023 |
Jump operator planning: Goal-conditioned policy ensembles and zero-shot transfer TJ Ringstrom, M Hasanbeig, A Abate arXiv preprint arXiv:2007.02527, 2020 | 5 | 2020 |
Logically-correct reinforcement learning. CoRR abs/1801.08099 M Hasanbeig, A Abate, D Kroening | 5* | 2017 |