ML Anthology
Authors
Search
About
Richter, Leo
2 publications
ICLR
2025
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter
,
Xuanli He
,
Pasquale Minervini
,
Matt Kusner
ICMLW
2024
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter
,
Nitin Agrawal
,
Xuanli He
,
Pasquale Minervini
,
Matt Kusner