ML Anthology
Authors
Search
About
Bao, Xiaoyi
10 publications
ICLR
2025
Aligned Better, Listen Better for Audio-Visual Large Language Models
Yuxin Guo
,
Shuailei Ma
,
Shijie Ma
,
Xiaoyi Bao
,
Chen-Wei Xie
,
Kecheng Zheng
,
Tingyu Weng
,
Siyang Sun
,
Yun Zheng
,
Wei Zou
AAAI
2025
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
,
Xiaofeng Wang
,
Zheng Zhu
,
Xinze Chen
,
Guan Huang
,
Xiaoyi Bao
,
Xingang Wang
ICCV
2025
DynImg: Key Frames with Visual Prompts Are Good Representation for Multi-Modal Video Understanding
Xiaoyi Bao
,
Chenwei Xie
,
Hao Tang
,
Tingyu Weng
,
Xiaofeng Wang
,
Yun Zheng
,
Xingang Wang
NeurIPS
2025
EgoVid-5m: A Large-Scale Video-Action Dataset for Egocentric Videos Generation
Xiaofeng Wang
,
Kang Zhao
,
Feng Liu
,
Jiayu Wang
,
Guosheng Zhao
,
Xiaoyi Bao
,
Zheng Zhu
,
Yingya Zhang
NeurIPS
2025
UFO: A Unified Approach to Fine-Grained Visual Perception via Open-Ended Language Interface
Hao Tang
,
Chen-Wei Xie
,
Haiyang Wang
,
Xiaoyi Bao
,
Tingyu Weng
,
Pandeng Li
,
Yun Zheng
,
Liwei Wang
ECCV
2024
CoReS: Orchestrating the Dance of Reasoning and Segmentation
Xiaoyi Bao
,
Siyang Sun
,
Shuailei Ma
,
Kecheng Zheng
,
Yuxin Guo
,
Guosheng Zhao
,
Yun Zheng
,
Xingang Wang
CVPR
2024
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
Yuxin Guo
,
Siyang Sun
,
Shuailei Ma
,
Kecheng Zheng
,
Xiaoyi Bao
,
Shijie Ma
,
Wei Zou
,
Yun Zheng
AAAI
2024
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation
Xiaoyi Bao
,
Jie Qin
,
Siyang Sun
,
Xingang Wang
,
Yun Zheng
IJCAI
2022
Aspect-Based Sentiment Analysis with Opinion Tree Generation
Xiaoyi Bao
,
Zhongqing Wang
,
Xiaotong Jiang
,
Rong Xiao
,
Shoushan Li
AAAI
2021
Building Interpretable Interaction Trees for Deep NLP Models
Die Zhang
,
Hao Zhang
,
Huilin Zhou
,
Xiaoyi Bao
,
Da Huo
,
Ruizhao Chen
,
Xu Cheng
,
Mengyue Wu
,
Quanshi Zhang