Zhao, Stephen

4 publications

NeurIPS 2025 Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference Stephen Zhao, Aidan Li, Rob Brekelmans, Roger Baker Grosse
ICML 2024 Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo Stephen Zhao, Rob Brekelmans, Alireza Makhzani, Roger Baker Grosse
NeurIPS 2022 Proximal Learning with Opponent-Learning Awareness Stephen Zhao, Chris Lu, Roger B Grosse, Jakob Foerster
ICML 2020 Maximum Entropy Gain Exploration for Long Horizon Multi-Goal Reinforcement Learning Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba