Troll, Rajan

1 publications

ICLR 2025 Scaling and Evaluating Sparse Autoencoders Leo Gao, Tom Dupre la Tour, Henk Tillman, Gabriel Goh, Rajan Troll, Alec Radford, Ilya Sutskever, Jan Leike, Jeffrey Wu