ML Anthology
Authors
Search
About
Binder, Felix Jedidja
2 publications
ICLR
2025
Looking Inward: Language Models Can Learn About Themselves by Introspection
Felix Jedidja Binder
,
James Chua
,
Tomek Korbak
,
Henry Sleight
,
John Hughes
,
Robert Long
,
Ethan Perez
,
Miles Turpin
,
Owain Evans
ICLRW
2024
Lessons Learned in the Study of Representational Alignment in Physical Reasoning
Felix Jedidja Binder
,
Rahul Mysore Venkatesh
,
Daniel LK Yamins
,
Judith E Fan