Wu, Tongshuang

8 publications

NeurIPS 2025 Checklists Are Better than Reward Models for Aligning Language Models Vijay Viswanathan, Yanchao Sun, Xiang Kong, Meng Cao, Graham Neubig, Tongshuang Wu
AAAI 2025 Evaluating Mathematical Reasoning Beyond Accuracy Shijie Xia, Xuefeng Li, Yixin Liu, Tongshuang Wu, Pengfei Liu
IJCAI 2025 How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging (Extended Abstract) Qianou Ma, Hua Shen, Ken Koedinger, Tongshuang Wu
NeurIPSW 2024 HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation Zirui Wang, Xinran Zhao, Simon Stepputtis, Woojun Kim, Tongshuang Wu, Katia P. Sycara, Yaqi Xie
ICMLW 2024 WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu
ICMLW 2024 WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu
ICMLW 2023 Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu
IJCAI 2021 Beyond Accuracy: Behavioral Testing of NLP Models with Checklist (Extended Abstract) Marco TĂșlio Ribeiro, Tongshuang Wu, Carlos Guestrin, Sameer Singh