Jiayang, Cheng

4 publications

ICLR 2026 AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations Cheng Jiayang, Dongyu Ru, Lin Qiu, Yiyang Li, Xuezhi Cao, Yangqiu Song, Xunliang Cai
ICLR 2026 NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents Tianshi Zheng, Kelvin Kiu Wai Tam, Newt Nguyen Kim Hue Nam, Baixuan Xu, Zhaowei Wang, Cheng Jiayang, Hong Ting Tsang, Weiqi Wang, Jiaxin Bai, Tianqing Fang, Yangqiu Song, Ginny Wong, Simon See
NeurIPS 2024 Can Language Models Learn to Skip Steps? Tengxiao Liu, Qipeng Guo, Xiangkun Hu, Cheng Jiayang, Yue Zhang, Xipeng Qiu, Zheng Zhang
NeurIPS 2024 RAGChecker: A Fine-Grained Framework for Diagnosing Retrieval-Augmented Generation Dongyu Ru, Lin Qiu, Xiangkun Hu, Tianhang Zhang, Peng Shi, Shuaichen Chang, Cheng Jiayang, Cunxiang Wang, Shichao Sun, Huanyu Li, Zizhao Zhang, Binjie Wang, Jiarong Jiang, Tong He, Zhiguo Wang, Pengfei Liu, Yue Zhang, Zheng Zhang