Shi, Weiyan

8 publications

TMLR 2026 SocialFusion: Addressing Social Degradation in Pre-Trained Vision-Language Models Hamza Tahboub, Weiyan Shi, Gang Hua, Huaizu Jiang
NeurIPS 2025 LLMs Encode Harmfulness and Refusal Separately Jiachen Zhao, Jing Huang, Zhengxuan Wu, David Bau, Weiyan Shi
AAAI 2025 Persuasion for Social Good: How to Build and Break AI Weiyan Shi
NeurIPSW 2024 AI-Generated Content and Public Persuasion: The Limited Effect of AI Authorship Labels Isabel O. Gallegos, Chen Shani, Weiyan Shi, Federico Bianchi, Robb Willer, Dan Jurafsky
ICML 2024 Position: A Safe Harbor for AI Evaluation and Red Teaming Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Alex Pentland, Arvind Narayanan, Percy Liang, Peter Henderson
NeurIPS 2024 PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action Yijia Shao, Tianshi Li, Weiyan Shi, Yanchen Liu, Diyi Yang
CVPR 2024 The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, Ruoxi Jia
AAAI 2020 End-to-End Trainable Non-Collaborative Dialog System Yu Li, Kun Qian, Weiyan Shi, Zhou Yu