Taylor, Jordan

5 publications

ICLR 2026 Obfuscated Activations Bypass LLM Latent-Space Defenses Luke Bailey, Alex Serrano, Abhay Sheshadri, Mikhail Seleznyov, Jordan Taylor, Erik Jenner, Jacob Hilton, Stephen Casper, Carlos Guestrin, Scott Emmons
NeurIPS 2024 Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning Dan Braun, Jordan Taylor, Nicholas Goldowsky-Dill, Lee Sharkey
ICMLW 2024 Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning Dan Braun, Jordan Taylor, Nicholas Goldowsky-Dill, Lee Sharkey
ICLRW 2024 Learning to Abstract Visuomotor Mappings Using Meta-Reinforcement Learning Carlos A. Velazquez-Vargas, Isaac Ray Christian, Jordan Taylor, Sreejan Kumar
ICMLW 2023 Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu