ML Anthology
Authors
Search
About
Fish, Kyle
1 publications
ICLR
2026
LitmusValues: Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
Yu Ying Chiu
,
Zhilin Wang
,
Sharan Maiya
,
Yejin Choi
,
Kyle Fish
,
Sydney Levine
,
Evan J Hubinger