Aghajohari, Milad

8 publications

TMLR 2026 DeepSeek-R1 Thoughtology: Let’s Think About LLM Reasoning Sara Vera Marjanovic, Arkil Patel, Vaibhav Adlakha, Milad Aghajohari, Parishad BehnamGhader, Mehar Bhatia, Aditi Khandelwal, Austin Kraft, Benno Krojer, Xing Han Lù, Nicholas Meade, Dongchan Shin, Amirhossein Kazemnejad, Gaurav Kamath, Marius Mosbach, Karolina Stanczak, Siva Reddy
ICLR 2025 Advantage Alignment Algorithms Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, Razvan Ciuca, Tianyu Zhang, Gauthier Gidel, Aaron Courville
ICML 2025 VinePPO: Refining Credit Assignment in RL Training of LLMs Amirhossein Kazemnejad, Milad Aghajohari, Eva Portelance, Alessandro Sordoni, Siva Reddy, Aaron Courville, Nicolas Le Roux
ICMLW 2024 Advantage Alignment Algorithms Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, Tianyu Zhang, Aaron Courville
ICLR 2024 LOQA: Learning with Opponent Q-Learning Awareness Milad Aghajohari, Juan Agustin Duque, Tim Cooijmans, Aaron Courville
NeurIPSW 2024 VinePPO: Accurate Credit Assignment in RL for LLM Mathematical Reasoning Amirhossein Kazemnejad, Milad Aghajohari, Eva Portelance, Alessandro Sordoni, Siva Reddy, Aaron Courville, Nicolas Le Roux
ICMLW 2023 Learning with Learning Awareness Using Meta-Values Tim Cooijmans, Milad Aghajohari, Aaron Courville
NeurIPS 2022 Riemannian Diffusion Models Chin-Wei Huang, Milad Aghajohari, Joey Bose, Prakash Panangaden, Aaron C. Courville