Shi, Weiyan

11 publications

ICLR 2026 GEM: A Gym for Generalist LLMs Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Simon Yu, Xiangxin Zhou, Haotian Xu, Shaopan Xiong, Bo Liu, Chenmien Tan, Weixun Wang, Hao Zhu, Weiyan Shi, Diyi Yang, Michael Qizhe Shieh, Yee Whye Teh, Wee Sun Lee, Min Lin
ICLR 2026 PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction for Continual Learning Simon Yu, Gang Li, Weiyan Shi, Peng Qi
ICLR 2026 SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Bo Liu, Simon Yu, Zichen Liu, Leon Guertler, Penghui Qi, Daniel Balcells, Mickel Liu, Cheston Tan, Weiyan Shi, Min Lin, Wee Sun Lee, Natasha Jaques
TMLR 2026 SocialFusion: Addressing Social Degradation in Pre-Trained Vision-Language Models Hamza Tahboub, Weiyan Shi, Gang Hua, Huaizu Jiang
NeurIPS 2025 LLMs Encode Harmfulness and Refusal Separately Jiachen Zhao, Jing Huang, Zhengxuan Wu, David Bau, Weiyan Shi
AAAI 2025 Persuasion for Social Good: How to Build and Break AI Weiyan Shi
NeurIPSW 2024 AI-Generated Content and Public Persuasion: The Limited Effect of AI Authorship Labels Isabel O. Gallegos, Chen Shani, Weiyan Shi, Federico Bianchi, Robb Willer, Dan Jurafsky
ICML 2024 Position: A Safe Harbor for AI Evaluation and Red Teaming Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Alex Pentland, Arvind Narayanan, Percy Liang, Peter Henderson
NeurIPS 2024 PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action Yijia Shao, Tianshi Li, Weiyan Shi, Yanchen Liu, Diyi Yang
CVPR 2024 The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, Ruoxi Jia
AAAI 2020 End-to-End Trainable Non-Collaborative Dialog System Yu Li, Kun Qian, Weiyan Shi, Zhou Yu