Slack, Dylan

5 publications

NeurIPS 2024 A Careful Examination of Large Language Model Performance on Grade School Arithmetic Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Charlotte Zhuang, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue
NeurIPS 2024 Learning Goal-Conditioned Representations for Language Reward Models Vaskar Nath, Dylan Slack, Jeff Da, Yuntao Ma, Hugh Zhang, Spencer Whitehead, Sean Hendryx
NeurIPS 2023 Post Hoc Explanations of Language Models Can Improve Language Models Satyapriya Krishna, Jiaqi Ma, Dylan Slack, Asma Ghandeharioun, Sameer Singh, Himabindu Lakkaraju
NeurIPS 2021 Counterfactual Explanations Can Be Manipulated Dylan Slack, Anna Hilgard, Himabindu Lakkaraju, Sameer Singh
NeurIPS 2021 Reliable Post Hoc Explanations: Modeling Uncertainty in Explainability Dylan Slack, Anna Hilgard, Sameer Singh, Himabindu Lakkaraju