Gui, Tao

23 publications

AAAI 2025 Alleviating Shifted Distribution in Human Preference Alignment Through Meta-Learning Shihan Dou, Yan Liu, Enyu Zhou, Songyang Gao, Tianlong Li, Limao Xiong, Xin Zhao, Haoxiang Jia, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
NeurIPS 2025 BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset Zhiheng Xi, Guanyu Li, YuTao Fan, Honglin Guo, Yufang Liu, Xiaoran Fan, Jiaqi Liu, Dingjinchao, Wangmeng Zuo, Zhenfei Yin, Lei Bai, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
NeurIPS 2025 EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving Shihan Dou, Ming Zhang, Chenhao Huang, Jiayi Chen, Feng Chen, Shichun Liu, Yan Liu, Chenxiao Liu, Cheng Zhong, Zongzhang Zhang, Tao Gui, Chao Xin, Wei Chengzhi, Lin Yan, Qi Zhang, Xuanjing Huang
ICLR 2025 Have the VLMs Lost Confidence? a Study of Sycophancy in VLMs Shuo Li, Tao Ji, Xiaoran Fan, Linsheng Lu, Leyi Yang, Yuming Yang, Zhiheng Xi, Rui Zheng, Yuran Wang, Xh.Zhao, Tao Gui, Qi Zhang, Xuanjing Huang
NeurIPS 2025 INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning Wujian Peng, Lingchen Meng, Yitong Chen, Yiweng Xie, Yang Liu, Tao Gui, Hang Xu, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang
NeurIPS 2025 Pre-Trained Policy Discriminators Are General Reward Models Shihan Dou, Shichun Liu, Yuming Yang, Yicheng Zou, Yunhua Zhou, Shuhao Xing, Chenhao Huang, Qiming Ge, Haijun Lv, Demin Song, Songyang Gao, Chengqi Lyu, Enyu Zhou, Honglin Guo, Zhiheng Xi, Qipeng Guo, Wenwei Zhang, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Kai Chen
ICLR 2025 RMB: Comprehensively Benchmarking Reward Models in LLM Alignment Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
CVPR 2025 SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models Yongting Zhang, Lu Chen, Guodong Zheng, Yifeng Gao, Rui Zheng, Jinlan Fu, Zhenfei Yin, Senjie Jin, Yu Qiao, Xuanjing Huang, Feng Zhao, Tao Gui, Jing Shao
NeurIPS 2025 Understanding Parametric and Contextual Knowledge Reconciliation Within Large Language Models Jun Zhao, Yongzhuo Yang, Xiang Hu, Jingqi Tong, Yi Lu, Wei Wu, Tao Gui, Qi Zhang, Xuanjing Huang
ICLR 2024 Improving Generalization of Alignment with Human Preferences Through Group Invariant Learning Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang
AAAI 2024 LLMEval: A Preliminary Study on How to Evaluate Large Language Models Yue Zhang, Ming Zhang, Haipeng Yuan, Shichun Liu, Yongyao Shi, Tao Gui, Qi Zhang, Xuanjing Huang
ICML 2024 Training Large Language Models for Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
CVPR 2023 Correspondence Transformers with Asymmetric Feature Learning and Matching Flow Super-Resolution Yixuan Sun, Dongyang Zhao, Zhangyue Yin, Yiwen Huang, Tao Gui, Wenqiang Zhang, Weifeng Ge
NeurIPSW 2023 Delve into PPO: Implementation Matters for Stable RLHF Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang
IJCAI 2022 Searching for Optimal Subword Tokenization in Cross-Domain NER Ruotian Ma, Yiding Tan, Xin Zhou, Xuanting Chen, Di Liang, Sirui Wang, Wei Wu, Tao Gui
AAAI 2020 Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification with K-Means Features Tao Gui, Lizhi Qing, Qi Zhang, Jiacheng Ye, Hang Yan, Zichu Fei, Xuanjing Huang
IJCAI 2020 Leveraging Document-Level Label Consistency for Named Entity Recognition Tao Gui, Jiacheng Ye, Qi Zhang, Yaqian Zhou, Yeyun Gong, Xuanjing Huang
IJCAI 2019 CNN-Based Chinese NER with Lexicon Rethinking Tao Gui, Ruotian Ma, Qi Zhang, Lujun Zhao, Yu-Gang Jiang, Xuanjing Huang
AAAI 2019 Cooperative Multimodal Approach to Depression Detection in Twitter Tao Gui, Liang Zhu, Qi Zhang, Minlong Peng, Xu Zhou, Keyu Ding, Zhigang Chen
IJCAI 2019 Learning Task-Specific Representation for Novel Words in Sequence Labeling Minlong Peng, Qi Zhang, Xiaoyu Xing, Tao Gui, Jinlan Fu, Xuanjing Huang
AAAI 2019 Long Short-Term Memory with Dynamic Skip Connections Tao Gui, Qi Zhang, Lujun Zhao, Yaosong Lin, Minlong Peng, Jingjing Gong, Xuanjing Huang
AAAI 2019 Switch-LSTMs for Multi-Criteria Chinese Word Segmentation Jingjing Gong, Xinchi Chen, Tao Gui, Xipeng Qiu
AAAI 2019 Trainable Undersampling for Class-Imbalance Learning Minlong Peng, Qi Zhang, Xiaoyu Xing, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Keyu Ding, Zhigang Chen