Miller, Alexander H

8 publications

NeurIPS 2025 AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-Bench Edan Toledo, Karen Hambardzumyan, Martin Josifoski, Rishi Hazra, Nicolas Baldwin, Alexis Audran-Reiss, Michael Kuchnik, Despoina Magka, Minqi Jiang, Alisia Maria Lupidi, Andrei Lupu, Roberta Raileanu, Tatiana Shavrina, Kelvin Niu, Jean-Christophe Gagnon-Audet, Michael Shvartsman, Shagun Sodhani, Alexander H Miller, Abhishek Charnalia, Derek Dunfield, Carole-Jean Wu, Pontus Stenetorp, Nicola Cancedda, Jakob Nicolaus Foerster, Yoram Bachrach
TMLR 2025 Scaling and Distilling Transformer Models for sEMG Nick Mehlman, Jean-Christophe Gagnon-Audet, Michael Shvartsman, Kelvin Niu, Alexander H Miller, Shagun Sodhani
NeurIPS 2025 The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Bingchen Zhao, Despoina Magka, Minqi Jiang, Xian Li, Roberta Raileanu, Tatiana Shavrina, Jean-Christophe Gagnon-Audet, Kelvin Niu, Shagun Sodhani, Michael Shvartsman, Andrei Lupu, Alisia Maria Lupidi, Karen Hambardzumyan, Martin Josifoski, Edan Toledo, Thomas Foster, Lucia Cipolina-Kun, Derek Dunfield, Abhishek Charnalia, Alexander H Miller, Oisin Mac Aodha, Jakob Nicolaus Foerster, Yoram Bachrach
ICLR 2023 Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown
NeurIPSW 2022 Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown
ICLR 2017 Dialogue Learning with Human-in-the-Loop Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, Jason Weston
ICLR 2017 Learning Through Dialogue Interactions by Asking Questions Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, Jason Weston
ICLR 2016 Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems Jesse Dodge, Andreea Gane, Xiang Zhang, Antoine Bordes, Sumit Chopra, Alexander H. Miller, Arthur Szlam, Jason Weston