ML Anthology
Authors
Search
About
Neo, Clement
4 publications
NeurIPS
2025
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
Cameron Tice
,
Philipp Alexander Kreer
,
Nathan Helm-Burger
,
Prithviraj Singh Shahani
,
Fedor Ryzhenkov
,
Fabien Roger
,
Clement Neo
,
Jacob Haimes
,
Felix Hofstätter
,
Teun van der Weij
ICLR
2025
Towards Interpreting Visual Information Processing in Vision-Language Models
Clement Neo
,
Luke Ong
,
Philip Torr
,
Mor Geva
,
David Krueger
,
Fazl Barez
ICLR
2025
Turning up the Heat: Min-P Sampling for Creative and Coherent LLM Outputs
Nguyen Nhat Minh
,
Andrew Baker
,
Clement Neo
,
Allen G Roush
,
Andreas Kirsch
,
Ravid Shwartz-Ziv
NeurIPS
2024
Interpreting Learned Feedback Patterns in Large Language Models
Luke Marks
,
Amir Abdullah
,
Clement Neo
,
Rauno Arike
,
David Krueger
,
Philip Torr
,
Fazl Barez