On the Planning Abilities of Large Language Models--A Critical Investigation K Valmeekam, M Marquez, S Sreedharan, S Kambhampati arXiv preprint arXiv:2305.15771, 2023 | 130 | 2023 |
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems K Stechly, M Marquez, S Kambhampati arXiv preprint arXiv:2310.12397, 2023 | 56 | 2023 |
On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark) K Valmeekam, S Sreedharan, M Marquez, A Olmo, S Kambhampati arXiv preprint arXiv:2302.06706, 2023 | 56 | 2023 |
Can Large Language Models Really Improve by Self-critiquing Their Own Plans? K Valmeekam, M Marquez, S Kambhampati arXiv preprint arXiv:2310.08118, 2023 | 47 | 2023 |
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion U Soni, S Sreedharan, M Verma, L Guan, M Marquez, S Kambhampati arXiv preprint arXiv:2210.15096, 2022 | 5 | 2022 |