Sun, Siyang

8 publications

ICLR 2025 Aligned Better, Listen Better for Audio-Visual Large Language Models Yuxin Guo, Shuailei Ma, Shijie Ma, Xiaoyi Bao, Chen-Wei Xie, Kecheng Zheng, Tingyu Weng, Siyang Sun, Yun Zheng, Wei Zou
ECCV 2024 CoReS: Orchestrating the Dance of Reasoning and Segmentation Xiaoyi Bao, Siyang Sun, Shuailei Ma, Kecheng Zheng, Yuxin Guo, Guosheng Zhao, Yun Zheng, Xingang Wang
CVPR 2024 CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training Yuxin Guo, Siyang Sun, Shuailei Ma, Kecheng Zheng, Xiaoyi Bao, Shijie Ma, Wei Zou, Yun Zheng
ECCV 2024 FuseTeacher: Modality-Fused Encoders Are Strong Vision Supervisors Chen-Wei Xie, Siyang Sun, Liming Zhao, Pandeng Li, Shuailei Ma, Yun Zheng
AAAI 2024 Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation Xiaoyi Bao, Jie Qin, Siyang Sun, Xingang Wang, Yun Zheng
NeurIPS 2023 Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng
CVPR 2023 RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training Chen-Wei Xie, Siyang Sun, Xiong Xiong, Yun Zheng, Deli Zhao, Jingren Zhou
AAAI 2021 Fashion Focus: Multi-Modal Retrieval System for Video Commodity Localization in E-Commerce Yanhao Zhang, Qiang Wang, Pan Pan, Yun Zheng, Cheng Da, Siyang Sun, Yinghui Xu