Bai, Xiaoyan

3 publications

NeurIPS 2025 Concept Incongruence: An Exploration of Time and Death in Role Playing Xiaoyan Bai, Ike Peng, Aditya Singh, Chenhao Tan

ICML 2024 A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity Andrew Lee, Xiaoyan Bai, Itamar Pres, Martin Wattenberg, Jonathan K. Kummerfeld, Rada Mihalcea

NeurIPS 2024 Learn to Be Efficient: Build Structured Sparsity in Large Language Models Haizhong Zheng, Xiaoyan Bai, Xueshen Liu, Z. Morley Mao, Beidi Chen, Fan Lai, Atul Prakash