Bankes, William

5 publications

NeurIPS 2025 Detecting High-Stakes Interactions with Activation Probes Alex McKenzie, Urja Pawar, Phil Blandfort, William Bankes, David Krueger, Ekdeep Singh Lubana, Dmitrii Krasheninnikov
ICML 2025 Right Now, Wrong Then: Non-Stationary Direct Preference Optimization Under Preference Drift Seongho Son, William Bankes, Sayak Ray Chowdhury, Brooks Paige, Ilija Bogunovic
NeurIPSW 2024 Group Robust Best-of-K Decoding of Language Models for Pluralistic Alignment Sangwoong Yoon, William Bankes, Seongho Son, Anja Petrovic, Shyam Sundhar Ramesh, Xiaohang Tang, Ilija Bogunovic
NeurIPS 2024 REDUCR: Robust Data Downsampling Using Class Priority Reweighting William Bankes, George Hughes, Ilija Bogunovic, Zi Wang
NeurIPSW 2023 REDUCR: Robust Data Downsampling Using Class Priority Reweighting William Bankes, George Hughes, Ilija Bogunovic, Zi Wang