Sun, Maosong

93 publications

TMLR 2026 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu, Shengding Hu, Yuxuan Song, Yufei Huang, Mingxuan Wang, Hao Zhou, Zhiyuan Liu, Wei-Ying Ma, Maosong Sun
ICLR 2025 A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules Kairong Luo, Haodong Wen, Shengding Hu, Zhenbo Sun, Zhiyuan Liu, Maosong Sun, Kaifeng Lyu, Wenguang Chen
NeurIPS 2025 A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings Xiaoang Xu, Shuo Wang, Xu Han, Zhenghao Liu, Huijia Wu, Pei Pei Li, Zhiyuan Liu, Maosong Sun, Zhaofeng He
ICLRW 2025 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu, Shengding Hu, Yuxuan Song, Yufei Huang, Mingxuan Wang, Hao Zhou, Zhiyuan Liu, Wei-Ying Ma, Maosong Sun
CVPR 2025 AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization Yiyang Du, Xiaochen Wang, Chi Chen, Jiabo Ye, Yiru Wang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Zhifang Sui, Maosong Sun, Yang Liu
ICLR 2025 Advancing LLM Reasoning Generalists with Preference Trees Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Boji Shan, Zeyuan Liu, Jia Deng, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun
NeurIPS 2025 DCAD-2000: A Multilingual Dataset Across 2000+ Languages with Data Cleaning as Anomaly Detection Wen Lai, Yingli Shen, Shuo Wang, Xueren Zhang, Kangyang Luo, Alexander Fraser, Maosong Sun
ICLR 2025 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun
NeurIPS 2025 Multi-Agent Collaboration via Evolving Orchestration Yufan Dang, Chen Qian, Xueheng Luo, Jingru Fan, Zihao Xie, Ruijie Shi, Weize Chen, Cheng Yang, Xiaoyin Che, Ye Tian, Xuantang Xiong, Lei Han, Zhiyuan Liu, Maosong Sun
IJCAI 2025 NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms Yashan Wang, Shangda Wu, Jianhuai Hu, Xingjian Du, Yueqi Peng, Yongxin Huang, Shuai Fan, Xiaobing Li, Feng Yu, Maosong Sun
NeurIPS 2025 ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation Pengcheng Huang, Zhenghao Liu, Yukun Yan, Haiyan Zhao, Xiaoyuan Yi, Hao Chen, Zhiyuan Liu, Maosong Sun, Tong Xiao, Ge Yu, Chenyan Xiong
ICLR 2025 Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance Yaxi Lu, Shenzhi Yang, Cheng Qian, Guirong Chen, Qinyu Luo, Yesai Wu, Huadong Wang, Xin Cong, Zhong Zhang, Yankai Lin, Weiwen Liu, Yasheng Wang, Zhiyuan Liu, Fangming Liu, Maosong Sun
ICLR 2025 RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards Xinze Li, Sen Mei, Zhenghao Liu, Yukun Yan, Shuo Wang, Shi Yu, Zheni Zeng, Hao Chen, Ge Yu, Zhiyuan Liu, Maosong Sun, Chenyan Xiong
CVPR 2025 RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness Tianyu Yu, Haoye Zhang, Qiming Li, Qixin Xu, Yuan Yao, Da Chen, Xiaoman Lu, Ganqu Cui, Yunkai Dang, Taiwen He, Xiaocheng Feng, Jun Song, Bo Zheng, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun
ICLR 2025 Rational Decision-Making Agent with Learning Internal Utility Judgment Yining Ye, Xin Cong, Shizuo Tian, Yujia Qin, Chong Liu, Yankai Lin, Zhiyuan Liu, Maosong Sun
ICLR 2025 Scaling Large Language Model-Based Multi-Agent Collaboration Chen Qian, Zihao Xie, YiFei Wang, Wei Liu, Kunlun Zhu, Hanchen Xia, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun
ICML 2025 Sparsing Law: Towards Large Language Models with Greater Activation Sparsity Yuqi Luo, Chenyang Song, Xu Han, Yingfa Chen, Chaojun Xiao, Xiaojun Meng, Liqun Deng, Jiansheng Wei, Zhiyuan Liu, Maosong Sun
NeurIPS 2025 The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training Weize Chen, Jiarui Yuan, Tailin Jin, Ning Ding, Huimin Chen, Zhiyuan Liu, Maosong Sun
ICLR 2025 VisRAG: Vision-Based Retrieval-Augmented Generation on Multi-Modality Documents Shi Yu, Chaoyue Tang, Bokai Xu, Junbo Cui, Junhao Ran, Yukun Yan, Zhenghao Liu, Shuo Wang, Xu Han, Zhiyuan Liu, Maosong Sun
ICLR 2025 WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models Shengda Fan, Xin Cong, Yuepeng Fu, Zhong Zhang, Shuyan Zhang, Yuanwei Liu, Yesai Wu, Yankai Lin, Zhiyuan Liu, Maosong Sun
CVPR 2025 XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery? Fengxiang Wang, Hongzhen Wang, Zonghao Guo, Di Wang, Yulin Wang, Mingshuo Chen, Qiang Ma, Long Lan, Wenjing Yang, Jing Zhang, Zhiyuan Liu, Maosong Sun
NeurIPSW 2024 A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules Kairong Luo, Haodong Wen, Shengding Hu, Zhenbo Sun, Zhiyuan Liu, Maosong Sun, Kaifeng Lyu, Wenguang Chen
ICMLW 2024 Advancing LLM Reasoning Generalists with Preference Trees Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun
ICLR 2024 AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou
NeurIPS 2024 Can Large Language Models Analyze Graphs like Professionals? a Benchmark, Datasets and Models Xin Li, Weize Chen, Qizhi Chu, Haopeng Li, Zhaojun Sun, Ran Li, Chen Qian, Yiwei Wei, Zhiyuan Liu, Chuan Shi, Maosong Sun, Cheng Yang
NeurIPS 2024 Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models Bowen Ping, Shuo Wang, Hanqing Wang, Xu Han, Yuzhuang Xu, Yukun Yan, Yun Chen, Baobao Chang, Zhiyuan Liu, Maosong Sun
TMLR 2024 Exploring Format Consistency for Instruction Tuning Shihao Liang, Runchu Tian, Kunlun Zhu, Yujia Qin, Huadong Wang, Xin Cong, Zhiyuan Liu, Xiaojiang Liu, Maosong Sun
ICML 2024 Exploring the Benefit of Activation Sparsity in Pre-Training Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai Lin, Zhiyuan Zeng, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie Zhou
NeurIPS 2024 InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory Chaojun Xiao, Pengle Zhang, Xu Han, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun
ICMLW 2024 InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory Chaojun Xiao, Pengle Zhang, Xu Han, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun
ICMLW 2024 LEGENT: Open Platform for Embodied Agents Zhili Cheng, Jinyi Hu, Zhitong Wang, Yuge Tu, Shengding Hu, An Liu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun
ICLR 2024 Large Multilingual Models Pivot Zero-Shot Multimodal Learning Across Languages Jinyi Hu, Yuan Yao, Chongyi Wang, Shan Wang, Yinxu Pan, Qianyu Chen, Tianyu Yu, Hanghao Wu, Yue Zhao, Haoye Zhang, Xu Han, Yankai Lin, Jiao Xue, Dahai Li, Zhiyuan Liu, Maosong Sun
IJCAI 2024 On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie
ICLR 2024 Predicting Emergent Abilities with Infinite Resolution Evaluation Shengding Hu, Xin Liu, Xu Han, Xinrong Zhang, Chaoqun He, Weilin Zhao, Yankai Lin, Ning Ding, Zebin Ou, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
CVPR 2024 RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-Tao Zheng, Maosong Sun, Tat-Seng Chua
ICLR 2024 ToolLLM: Facilitating Large Language Models to Master 16000+ Real-World APIs Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Lauren Hong, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun
ICML 2024 ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Bingxiang He, Wei Zhu, Yuan Ni, Guotong Xie, Ruobing Xie, Yankai Lin, Zhiyuan Liu, Maosong Sun
NeurIPS 2023 C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models Yuzhen Huang, Yuzhuo Bai, Zhihao Zhu, Junlei Zhang, Jinghan Zhang, Tangjun Su, Junteng Liu, Chuancheng Lv, Yikai Zhang, Jiayi Lei, Yao Fu, Maosong Sun, Junxian He
NeurIPS 2023 H3T: Efficient Integration of Memory Optimization and Parallelism for Large-Scale Transformer Training Yuzhong Wang, Xu Han, Weilin Zhao, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
NeurIPS 2023 Revisiting Out-of-Distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations Lifan Yuan, Yangyi Chen, Ganqu Cui, Hongcheng Gao, FangYuan Zou, Xingyi Cheng, Heng Ji, Zhiyuan Liu, Maosong Sun
AAAI 2023 Visually Grounded Commonsense Knowledge Acquisition Yuan Yao, Tianyu Yu, Ao Zhang, Mengdi Li, Ruobing Xie, Cornelius Weber, Zhiyuan Liu, Hai-Tao Zheng, Stefan Wermter, Tat-Seng Chua, Maosong Sun
NeurIPS 2022 A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks Ganqu Cui, Lifan Yuan, Bingxiang He, Yangyi Chen, Zhiyuan Liu, Maosong Sun
ECCV 2022 Fine-Grained Scene Graph Generation with Data Transfer Ao Zhang, Yuan Yao, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua
NeurIPS 2022 Moderate-Fitting as a Natural Backdoor Defender for Pre-Trained Language Models Biru Zhu, Yujia Qin, Ganqu Cui, Yangyi Chen, Weilin Zhao, Chong Fu, Yangdong Deng, Zhiyuan Liu, Jingang Wang, Wei Wu, Maosong Sun, Ming Gu
NeurIPS 2022 Sparse Structure Search for Delta Tuning Shengding Hu, Zhen Zhang, Ning Ding, Yadao Wang, Yasheng Wang, Zhiyuan Liu, Maosong Sun
AAAI 2021 Adversarial Language Games for Advanced Natural Language Intelligence Yuan Yao, Haoxi Zhong, Zhengyan Zhang, Xu Han, Xiaozhi Wang, Kai Zhang, Chaojun Xiao, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
AAAI 2021 Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework Huimin Chen, Yankai Lin, Fanchao Qi, Jinyi Hu, Peng Li, Jie Zhou, Maosong Sun
ICMLW 2021 Red Alarm for Pre-Trained Models: Universal Vulnerability to Neuron-Level Backdoor Attacks Zhengyan Zhang, Guangxuan Xiao, Yongwei Li, Tian Lv, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Xin Jiang, Maosong Sun
ICCV 2021 Visual Distant Supervision for Scene Graph Generation Yuan Yao, Ao Zhang, Xu Han, Mengdi Li, Cornelius Weber, Zhiyuan Liu, Stefan Wermter, Maosong Sun
AAAI 2020 Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction Haoxi Zhong, Yuzhong Wang, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun
AAAI 2020 JEC-QA: A Legal-Domain Question Answering Dataset Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun
AAAI 2020 MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun
IJCAI 2020 Modeling Voting for System Combination in Machine Translation Xuancheng Huang, Jiacheng Zhang, Zhixing Tan, Derek F. Wong, Huanbo Luan, Jingfang Xu, Maosong Sun, Yang Liu
AAAI 2020 Multi-Channel Reverse Dictionary Model Lei Zheng, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun
AAAI 2020 Neural Snowball for Few-Shot Relation Learning Tianyu Gao, Xu Han, Ruobing Xie, Zhiyuan Liu, Fen Lin, Leyu Lin, Maosong Sun
IJCAI 2020 Text Style Transfer via Learning Style Instance Supported Latent Space Xiaoyuan Yi, Zhenghao Liu, Wenhao Li, Maosong Sun
AAAI 2020 Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets Fanchao Qi, Liang Chang, Maosong Sun, Sicong Ouyang, Zhiyuan Liu
NeurIPS 2020 Towards Interpretable Natural Language Understanding with Explanations as Latent Variables Wangchunshu Zhou, Jinyi Hu, Hanlin Zhang, Xiaodan Liang, Maosong Sun, Chenyan Xiong, Jian Tang
IJCAI 2019 Enhancing Stock Movement Prediction with Adversarial Training Fuli Feng, Huimin Chen, Xiangnan He, Ji Ding, Maosong Sun, Tat-Seng Chua
AAAI 2019 Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification Tianyu Gao, Xu Han, Zhiyuan Liu, Maosong Sun
IJCAI 2019 Multi-Scale Information Diffusion Prediction with Reinforced Recurrent Networks Cheng Yang, Jian Tang, Maosong Sun, Ganqu Cui, Zhiyuan Liu
IJCAI 2019 Sentiment-Controllable Chinese Poetry Generation Huimin Chen, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Zhipeng Guo
NeurIPS 2018 Bandit Learning with Implicit Feedback Yi Qi, Qingyun Wu, Hongning Wang, Jie Tang, Maosong Sun
AAAI 2018 Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention Xiangkai Zeng, Cheng Yang, Cunchao Tu, Zhiyuan Liu, Maosong Sun
IJCAI 2018 Chinese Poetry Generation with a Working Memory Model Xiaoyuan Yi, Maosong Sun, Ruoyu Li, Zonghan Yang
AAAI 2018 Improving Neural Fine-Grained Entity Typing with Knowledge Attention Ji Xin, Yankai Lin, Zhiyuan Liu, Maosong Sun
AAAI 2018 Neural Knowledge Acquisition via Mutual Attention Between Knowledge Graph and Text Xu Han, Zhiyuan Liu, Maosong Sun
AAAI 2017 Bilingual Lexicon Induction from Non-Parallel Data with Minimal Supervision Meng Zhang, Haoruo Peng, Yang Liu, Huan-Bo Luan, Maosong Sun
IJCAI 2017 Fast Network Embedding Enhancement via High Order Proximity Approximation Cheng Yang, Maosong Sun, Zhiyuan Liu, Cunchao Tu
IJCAI 2017 Image-Embodied Knowledge Representation Learning Ruobing Xie, Zhiyuan Liu, Huanbo Luan, Maosong Sun
IJCAI 2017 Iterative Entity Alignment via Joint Knowledge Embeddings Hao Zhu, Ruobing Xie, Zhiyuan Liu, Maosong Sun
IJCAI 2017 Joint Training for Pivot-Based Neural Machine Translation Yong Cheng, Qian Yang, Yang Liu, Maosong Sun, Wei Xu
IJCAI 2017 Lexical Sememe Prediction via Word Embeddings and Matrix Factorization Ruobing Xie, Xingchi Yuan, Zhiyuan Liu, Maosong Sun
IJCAI 2017 TransNet: Translation-Based Network Representation Learning for Social Relation Extraction Cunchao Tu, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun
IJCAI 2016 Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation Yong Cheng, Shiqi Shen, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu
AAAI 2016 Building Earth Mover's Distance on Bilingual Word Embeddings for Machine Translation Meng Zhang, Yang Liu, Huan-Bo Luan, Maosong Sun, Tatsuya Izuha, Jie Hao
IJCAI 2016 Knowledge Representation Learning with Entities, Attributes and Relations Yankai Lin, Zhiyuan Liu, Maosong Sun
IJCAI 2016 Max-Margin DeepWalk: Discriminative Learning of Network Representation Cunchao Tu, Weicheng Zhang, Zhiyuan Liu, Maosong Sun
AAAI 2016 Representation Learning of Knowledge Graphs with Entity Descriptions Ruobing Xie, Zhiyuan Liu, Jia Jia, Huanbo Luan, Maosong Sun
IJCAI 2016 Representation Learning of Knowledge Graphs with Hierarchical Types Ruobing Xie, Zhiyuan Liu, Maosong Sun
AAAI 2015 Contrastive Unsupervised Word Alignment with Non-Local Features Yang Liu, Maosong Sun
IJCAI 2015 Iterative Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora Meiping Dong, Yang Liu, Huan-Bo Luan, Maosong Sun, Tatsuya Izuha, Dakun Zhang
IJCAI 2015 Joint Learning of Character and Word Embeddings Xinxiong Chen, Lei Xu, Zhiyuan Liu, Maosong Sun, Huan-Bo Luan
AAAI 2015 Learning Entity and Relation Embeddings for Knowledge Graph Completion Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, Xuan Zhu
IJCAI 2015 Network Representation Learning with Rich Text Information Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, Edward Y. Chang
AAAI 2015 Phrase Type Sensitive Tensor Indexing Model for Semantic Composition Yu Zhao, Zhiyuan Liu, Maosong Sun
IJCAI 2015 Representation Learning for Measuring Entity Relatedness with Rich Information Yu Zhao, Zhiyuan Liu, Maosong Sun
AAAI 2015 Topical Word Embeddings Yang Liu, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun
AAAI 2013 An Extended GHKM Algorithm for Inducing Lambda-SCFG Peng Li, Yang Liu, Maosong Sun
NeurIPS 2012 Monte Carlo Methods for Maximum Margin Supervised Topic Models Qixia Jiang, Jun Zhu, Maosong Sun, Eric P. Xing
IJCAI 2011 CHIME: An Efficient Error-Tolerant Chinese Pinyin Input Method Yabin Zheng, Chen Li, Maosong Sun
AAAI 2011 Fast Query Recommendation by Search Qixia Jiang, Maosong Sun
IJCAI 2009 Incorporating User Behaviors in New Word Detection Yabin Zheng, Zhiyuan Liu, Maosong Sun, Liyun Ru, Yang Zhang