Richter, Leo

2 publications

ICLR 2025 An Auditing Test to Detect Behavioral Shift in Language Models Leo Richter, Xuanli He, Pasquale Minervini, Matt Kusner
ICMLW 2024 An Auditing Test to Detect Behavioral Shift in Language Models Leo Richter, Nitin Agrawal, Xuanli He, Pasquale Minervini, Matt Kusner