Zhang, Hugh

8 publications

ICLR 2025 Planning in Natural Language Improves LLM Search for Code Generation Evan Z Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, William Song, Vaskar Nath, Ziwen Han, Sean M. Hendryx, Summer Yue, Hugh Zhang
NeurIPS 2024 A Careful Examination of Large Language Model Performance on Grade School Arithmetic Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Charlotte Zhuang, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue
NeurIPSW 2024 LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks yet Nathaniel Li, Ziwen Han, Ian Steneker, Willow E. Primack, Riley Goodside, Hugh Zhang, Zifan Wang, Cristina Menghini, Summer Yue
NeurIPS 2024 Learning Goal-Conditioned Representations for Language Reward Models Vaskar Nath, Dylan Slack, Jeff Da, Yuntao Ma, Hugh Zhang, Spencer Whitehead, Sean Hendryx
NeurIPSW 2024 Planning in Natural Language Improves LLM Search for Code Generation Evan Z Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, William Song, Vaskar Nath, Ziwen Han, Sean M. Hendryx, Summer Yue, Hugh Zhang
ICML 2024 Q-Probe: A Lightweight Approach to Reward Maximization for Language Models Kenneth Li, Samy Jelassi, Hugh Zhang, Sham M. Kakade, Martin Wattenberg, David Brandfonbrener
NeurIPSW 2023 Chain-of-Thought Reasoning Is a Policy Improvement Operator Hugh Zhang, David Parkes
AAAI 2022 Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization Hugh Zhang, Adam Lerer, Noam Brown