Nabeshima, Noa

4 publications

ICML 2025 Learning Multi-Level Features with Matryoshka Sparse Autoencoders Bart Bussmann, Noa Nabeshima, Adam Karvonen, Neel Nanda
NeurIPS 2025 Parameterized Synthetic Text Generation with SimpleStories Lennart Finke, Chandan Sreedhara, Thomas Dooms, Mat Allen, Juan Diego Rodriguez, Noa Nabeshima, Thomas Marshall, Dan Braun
ICLRW 2025 [Tiny] Parameterized Synthetic Text Generation with SimpleStories Lennart Finke, Thomas Dooms, Mat Allen, Juan Diego Rodriguez, Noa Nabeshima, Dan Braun
NeurIPS 2022 Adversarial Training for High-Stakes Reliability Daniel Ziegler, Seraphina Nix, Lawrence Chan, Tim Bauman, Peter Schmidt-Nielsen, Tao Lin, Adam Scherlis, Noa Nabeshima, Benjamin Weinstein-Raun, Daniel de Haas, Buck Shlegeris, Nate Thomas