ML Anthology
Authors
Search
About
Bakhtin, Anton
11 publications
ICLR
2023
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Anton Bakhtin
,
David J Wu
,
Adam Lerer
,
Jonathan Gray
,
Athul Paul Jacob
,
Gabriele Farina
,
Alexander H Miller
,
Noam Brown
NeurIPSW
2022
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Anton Bakhtin
,
David J Wu
,
Adam Lerer
,
Jonathan Gray
,
Athul Paul Jacob
,
Gabriele Farina
,
Alexander H Miller
,
Noam Brown
ICML
2022
Modeling Strong and Human-like Gameplay with KL-Regularized Search
Athul Paul Jacob
,
David J Wu
,
Gabriele Farina
,
Adam Lerer
,
Hengyuan Hu
,
Anton Bakhtin
,
Jacob Andreas
,
Noam Brown
ICLRW
2022
Modeling Strong and Human-like Gameplay with KL-Regularized Search
Athul Paul Jacob
,
David J Wu
,
Gabriele Farina
,
Adam Lerer
,
Hengyuan Hu
,
Anton Bakhtin
,
Jacob Andreas
,
Noam Brown
NeurIPS
2022
Self-Explaining Deviations for Coordination
Hengyuan Hu
,
Samuel Sokota
,
David Wu
,
Anton Bakhtin
,
Andrei Lupu
,
Brandon Cui
,
Jakob Foerster
ICLR
2021
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Jonathan Gray
,
Adam Lerer
,
Anton Bakhtin
,
Noam Brown
NeurIPS
2021
No-Press Diplomacy from Scratch
Anton Bakhtin
,
David Wu
,
Adam Lerer
,
Noam Brown
JMLR
2021
Residual Energy-Based Models for Text
Anton Bakhtin
,
Yuntian Deng
,
Sam Gross
,
Myle Ott
,
Marc'Aurelio Ranzato
,
Arthur Szlam
NeurIPS
2020
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Noam Brown
,
Anton Bakhtin
,
Adam Lerer
,
Qucheng Gong
ICLR
2020
Residual Energy-Based Models for Text Generation
Yuntian Deng
,
Anton Bakhtin
,
Myle Ott
,
Arthur Szlam
,
Marc'Aurelio Ranzato
NeurIPS
2019
PHYRE: A New Benchmark for Physical Reasoning
Anton Bakhtin
,
Laurens van der Maaten
,
Justin Johnson
,
Laura Gustafson
,
Ross Girshick