Jiao, Jianpeng
4 publications
ICLR
2026
DiscoX: Benchmarking Discourse-Level Translation in Expert Domains
Xiying Zhao, Zhoufutu Wen, Zhixuan Chen, Jingzhe Ding, Jianpeng Jiao, Shuai Li, Xi Li, Danni Liang, Shengda Long, Qianqian Liu, Xianbo Wu, Hongwan Gao, Xiang Gao, Liang Hu, Jiashuo Liu, Liumengyun, Weiran Shi, Chenghao Yang, Qianyu Yang, Xuanliang Zhang, Ge Zhang, Wenhao Huang ICLR
2026
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Liang Hu, Jianpeng Jiao, Jiashuo Liu, Dongyuan Mutu, Yanle Ren, Zhoufutu Wen, Kaiyuan Zhang, Xuanliang Zhang, Xiang Gao, Tianci He, Fei Hu, Yali Liao, Zaiyuan Wang, Jingkai Liu, Sun Daibin, Ziqing Zeng, Zhiyuan Zeng, Chenghao Yang, Qianyu Yang, Mingren Yin, Ge Zhang, Xinyi Zhang, Xiying Zhao, Zhu Zhenwei, Hongseok Namkoong, Wenhao Huang ICLR
2026
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Zhiyuan Zeng, Jiashuo Liu, Siyuan Chen, Tianci He, Yali Liao, Yixiao Tian, Wangjinpeng.Levi, Zaiyuan Wang, YangYang, Lingyue Yin, Mingren Yin, Zhu Zhenwei, Tianle Cai, Xinjie Chen, Zehui Chen, Jiecao Chen, Yantao Du, Xiang Gao, Jiacheng Guo, Liang Hu, Jianpeng Jiao, Xiangsheng Li, Jingkai Liu, Nishuang, Zhoufutu Wen, Ge Zhang, Kaiyuan Zhang, 周欣, Jose Blanchet, Xipeng Qiu, Mengdi Wang, Wenhao Huang ICLR
2026
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?
Qinyan Zhang, Xinping Lei, Ruijie Miao, Fu Yu, Haojie Fan, Le Chang, Jiafan Hou, Dingling Zhang, Zhongfei Hou, ZiqiangYang, Puchangxin, Fei Hu, Jingkai Liu, Jiaheng Liu, Tong Yang, Zaiyuan Wang, Ge Zhang, Xinjie Chen, Jianpeng Jiao, Wenhao Huang