Lou, Xingzhou

5 publications

AAAI 2025 Sequential Preference Optimization: Multi-Dimensional Preference Alignment with Implicit Reward Modeling Xingzhou Lou, Junge Zhang, Jian Xie, Lifeng Liu, Dong Yan, Kaiqi Huang
NeurIPS 2025 Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-Grained Rewards Honghao Chen, Xingzhou Lou, Xiaokun Feng, Kaiqi Huang, Xinlong Wang
ICML 2024 Position: Foundation Agents as the Paradigm Shift for Decision Making Xiaoqian Liu, Xingzhou Lou, Jianbin Jiao, Junge Zhang
AAAI 2024 TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du
NeurIPS 2023 An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du