ML Anthology
Authors
Search
About
Taylor, Jordan
5 publications
ICLR
2026
Obfuscated Activations Bypass LLM Latent-Space Defenses
Luke Bailey
,
Alex Serrano
,
Abhay Sheshadri
,
Mikhail Seleznyov
,
Jordan Taylor
,
Erik Jenner
,
Jacob Hilton
,
Stephen Casper
,
Carlos Guestrin
,
Scott Emmons
NeurIPS
2024
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning
Dan Braun
,
Jordan Taylor
,
Nicholas Goldowsky-Dill
,
Lee Sharkey
ICMLW
2024
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning
Dan Braun
,
Jordan Taylor
,
Nicholas Goldowsky-Dill
,
Lee Sharkey
ICLRW
2024
Learning to Abstract Visuomotor Mappings Using Meta-Reinforcement Learning
Carlos A. Velazquez-Vargas
,
Isaac Ray Christian
,
Jordan Taylor
,
Sreejan Kumar
ICMLW
2023
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
Logan Stapleton
,
Jordan Taylor
,
Sarah Fox
,
Tongshuang Wu
,
Haiyi Zhu