Stanley, Jason

4 publications

ICLR 2026 No, of Course I Can! Deeper Fine-Tuning Attacks That Bypass Token-Level Safety Mechanisms Joshua Kazdan, Abhay Puri, Rylan Schaeffer, Lisa Yu, Chris Cundy, Jason Stanley, Sanmi Koyejo, Krishnamurthy Dj Dvijotham
TMLR 2025 LitLLMs, LLMs for Literature Review: Are We There yet? Shubham Agarwal, Gaurav Sahu, Abhay Puri, Issam H. Laradji, Krishnamurthy Dj Dvijotham, Jason Stanley, Laurent Charlin, Christopher Pal
ICLRW 2025 Societal Alignment Frameworks Can Improve LLM Alignment Karolina Stanczak, Nicholas Meade, Mehar Bhatia, Hattie Zhou, Konstantin Böttinger, Jeremy Barnes, Jason Stanley, Jessica Montgomery, Richard Zemel, Nicolas Papernot, Nicolas Chapados, Denis Therien, Timothy P Lillicrap, Ana Marasovic, Sylvie Delacroix, Gillian K Hadfield, Siva Reddy
UAI 2011 Active Diagnosis via AUC Maximization: An Efficient Approach for Multiple Fault Identification in Large Scale, Noisy Networks Gowtham Bellala, Jason Stanley, Clayton Scott, Suresh K. Bhavnani