Jin, Sheng

41 publications

AAAI 2025 AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks Zekang Yang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu
CVPR 2025 F-LMM: Grounding Frozen Large Multimodal Models Size Wu, Sheng Jin, Wenwei Zhang, Lumin Xu, Wentao Liu, Wei Li, Chen Change Loy
ICLR 2025 Frame-Voyager: Learning to Query Frames for Video Large Language Models Sicheng Yu, Chengkai Jin, Huanyu Wang, Zhenghao Chen, Sheng Jin, Zhongrong Zuo, Xu Xiaolei, Zhenbang Sun, Bingni Zhang, Jiawei Wu, Hao Zhang, Qianru Sun
ICCV 2025 Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li, Chen Change Loy
NeurIPS 2025 JavisGPT: A Unified Multi-Modal LLM for Sounding-Video Comprehension and Generation Kai Liu, Jungang Li, Yuchong Sun, Shengqiong Wu, Jianzhang Gao, Daoan Zhang, Wei Zhang, Sheng Jin, Sicheng Yu, Geng Zhan, Jiayi Ji, Fan Zhou, Liang Zheng, Shuicheng Yan, Hao Fei, Tat-Seng Chua
CVPR 2025 NADER: Neural Architecture Design via Multi-Agent Collaboration Zekang Yang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu
AAAI 2025 Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer Haopeng Sun, Yingwei Zhang, Lumin Xu, Sheng Jin, Yiqiang Chen
CVPR 2025 Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling Haopeng Sun, Yingwei Zhang, Lumin Xu, Sheng Jin, Ping Luo, Chen Qian, Wentao Liu, Yiqiang Chen
AAAI 2024 CLIM: Contrastive Language-Image Mosaic for Region Representation Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Chen Change Loy
ICLR 2024 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy
ECCV 2024 GKGNet: Group K-Nearest Neighbor Based Graph Convolutional Network for Multi-Label Image Recognition Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu
NeurIPS 2024 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang, Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Ruimao Zhang
ICLR 2024 LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-Grained Descriptors Sheng Jin, Xueying Jiang, Jiaxing Huang, Lewei Lu, Shijian Lu
NeurIPS 2024 MonoMAE: Enhancing Monocular 3D Detection Through Depth-Aware Masked Autoencoders Xueying Jiang, Sheng Jin, Xiaoqin Zhang, Ling Shao, Shijian Lu
ICLR 2024 PROGRAM: PROtotype GRAph Model Based Pseudo-Label Learning for Test-Time Adaptation Haopeng Sun, Lumin Xu, Sheng Jin, Ping Luo, Chen Qian, Wentao Liu
NeurIPS 2024 Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution Kai Liu, Zhihang Fu, Sheng Jin, Chao Chen, Ze Chen, Rongxin Jiang, Fan Zhou, Yaowu Chen, Jieping Ye
ECCV 2024 UniFS: Universal Few-Shot Instance Perception with Point Representations Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo
CVPR 2024 Weakly Supervised Monocular 3D Detection with a Single-View Image Xueying Jiang, Sheng Jin, Lewei Lu, Xiaoqin Zhang, Shijian Lu
ECCV 2024 When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu
ECCV 2024 You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo
CVPR 2023 Aligning Bag of Regions for Open-Vocabulary Object Detection Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy
NeurIPS 2023 Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions Kai Liu, Zhihang Fu, Chao Chen, Sheng Jin, Ze Chen, Mingyuan Tao, Rongxin Jiang, Jieping Ye
ICCV 2023 Domain Generalization via Balancing Training Difficulty and Model Capability Xueying Jiang, Jiaxing Huang, Sheng Jin, Shijian Lu
ICCV 2023 Uncertainty-Aware Unsupervised Multi-Object Tracking Kai Liu, Sheng Jin, Zhihang Fu, Ze Chen, Rongxin Jiang, Jieping Ye
ECCV 2022 3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo
CVPR 2022 Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang
ECCV 2022 Pose for Everything: Towards Category-Agnostic Pose Estimation Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang
ECCV 2022 PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu
ICLR 2022 Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization Can Wang, Sheng Jin, Yingda Guan, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang
AAAI 2022 Temporal Action Proposal Generation with Background Constraint Haosen Yang, Wenhao Wu, Lining Wang, Sheng Jin, Boyang Xia, Hongxun Yao, Hujie Huang
AAAI 2021 Asynchronous Teacher Guided Bit-Wise Hard Mining for Online Hashing Sheng Jin, Qin Zhou, Hongxun Yao, Yao Liu, Xian-Sheng Hua
ICCV 2021 Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang
CVPR 2021 ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang
CVPR 2021 When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo
ECCV 2020 Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo
AAAI 2020 HoMM: Higher-Order Moment Matching for Unsupervised Domain Adaptation Chao Chen, Zhihang Fu, Zhihong Chen, Sheng Jin, Zhaowei Cheng, Xinyu Jin, Xian-Sheng Hua
AAAI 2020 RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning Nan Jiang, Sheng Jin, Zhiyao Duan, Changshui Zhang
AAAI 2020 SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua
NeurIPS 2020 When Counterpoint Meets Chinese Folk Melodies Nan Jiang, Sheng Jin, Zhiyao Duan, Changshui Zhang
ECCV 2020 Whole-Body Human Pose Estimation in the Wild Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo
NeurIPS 2018 Connectionist Temporal Classification with Maximum Entropy Regularization Hu Liu, Sheng Jin, Changshui Zhang