Open problems and fundamental limitations of reinforcement learning from human feedback S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ... Transactions on Machine Learning Research, 2023 | 427 | 2023 |
PRIMAL: Pathfinding Via Reinforcement and Imitation Multi-Agent Learning - Lifelong M Damani, Z Luo, E Wenzel, G Sartoretti IEEE Robotics and Automation Letters 6 (2), 2666-2673, 2021 | 153 | 2021 |
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World F Laurent, M Schneider, C Scheller, J Watson, J Li, Z Chen, Y Zheng, ... NeurIPS 2020 Competition and Demonstration Track, 275-301, 2021 | 32 | 2021 |
Distributed Reinforcement Learning for Robot Teams: A Review Y Wang, M Damani, P Wang, Y Cao, G Sartoretti Current Robotics Reports, 2022 | 24 | 2022 |
SocialLight: Distributed Cooperation Learning towards Network-Wide Traffic Signal Control H Goel, Y Zhang, M Damani, G Sartoretti AAMAS 2023, 1551–1559, 2023 | 9 | 2023 |
Mitigating generative agent social dilemmas JR Yocum Massachusetts Institute of Technology, 2024 | 4 | 2024 |
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation M Damani, I Shenfeld, A Peng, A Bobu, J Andreas arXiv preprint arXiv:2410.04707, 2024 | 3 | 2024 |
Formal contracts mitigate social dilemmas in multi-agent reinforcement learning A Haupt, P Christoffersen, M Damani, D Hadfield-Menell Autonomous Agents and Multi-Agent Systems 38 (2), 1-38, 2024 | 2 | 2024 |
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning E Akyürek, M Damani, L Qiu, H Guo, Y Kim, J Andreas arXiv preprint arXiv:2411.07279, 2024 | 1 | 2024 |
Multi-agent traffic signal control via distributed RL with spatial and temporal feature extraction Y Zhang, M Damani, G Sartoretti International Conference on Autonomous Agents and Multiagent Systems, 106-113, 2022 | 1 | 2022 |