Wang, Jeffrey George

2 publications

ICLR 2026 Persona Features Control Emergent Misalignment Miles Wang, Tom Dupre la Tour, Olivia Watkins, Aleksandar Makelov, Ryan Andrew Chi, Samuel Miserendino, Jeffrey George Wang, Achyuta Rajaram, Johannes Heidecke, Tejal Patwardhan, Daniel P Mossing
ICMLW 2024 Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models Sahil Kuchlous, Marvin Li, Jeffrey George Wang