Valmeekam, Karthik
13 publications
TMLR
2025
A Systematic Evaluation of the Planning and Scheduling Abilities of the Reasoning Model O1
ICLR
2025
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
NeurIPS
2023
Leveraging Pre-Trained Large Language Models to Construct and Utilize World Models for Model-Based Task Planning
NeurIPS
2023
PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning About Change