Xu, Baixuan

2 publications

ICLR 2026 NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents Tianshi Zheng, Kelvin Kiu Wai Tam, Newt Nguyen Kim Hue Nam, Baixuan Xu, Zhaowei Wang, Cheng Jiayang, Hong Ting Tsang, Weiqi Wang, Jiaxin Bai, Tianqing Fang, Yangqiu Song, Ginny Wong, Simon See
TMLR 2025 The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning Tianshi Zheng, Yixiang Chen, Chengxi Li, Chunyang Li, Qing Zong, Haochen Shi, Baixuan Xu, Yangqiu Song, Ginny Wong, Simon See