Corlouer, Guillaume

2 publications

ICMLW 2024 An Information-Theoretic Study of Lying in LLMs Ann-Kathrin Dombrowski, Guillaume Corlouer
NeurIPSW 2023 Linearly Structured World Representations in Maze-Solving Transformers Michael Ivanitskiy, Alexander F Spies, Tilman Räuker, Guillaume Corlouer, Christopher Mathwin, Lucia Quirke, Can Rager, Rusheb Shah, Dan Valentine, Cecilia Diniz Behn, Katsumi Inoue, Samy Wu Fung