Xu, Yin

3 publications

ICLR 2026 ChemEval: A Multi-Level and Fine-Grained Chemical Capability Evaluation for Large Language Models Yuqing Huang, Rongyang Zhang, Xuesong He, Xuyang Zhi, Hao Wang, Nuo Chen, Zongbo Liu, Xin Li, Feiyang Xu, Deguang Liu, Huadong Liang, YiLi, Jian Cui, Yin Xu, Shijin Wang, Qi Liu, Defu Lian, Guiquan Liu, Enhong Chen
NeurIPS 2025 DrVD-Bench: Do Vision-Language Models Reason like Human Doctors in Medical Image Diagnosis? Tianhong Zhou, Yin Xu, Yingtao Zhu, Chuxi Xiao, Haiyang Bian, Lei Wei, Xuegong Zhang
NeurIPS 2025 RAG-IGBench: Innovative Evaluation for RAG-Based Interleaved Generation in Open-Domain Question Answering Rongyang Zhang, Yuqing Huang, Chengqiang Lu, Qimeng Wang, Yan Gao, Yiwu, Yao Hu, Yin Xu, Wei Wang, Hao Wang, Enhong Chen