Huang, Di

82 publications

AAAI 2025 3d²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling Zichen Tang, Hongyu Yang, Hanchen Zhang, Jiaxin Chen, Di Huang
CVPR 2025 APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers Zhuguanyu Wu, Jiayi Zhang, Jiaxin Chen, Jinyang Guo, Di Huang, Yunhong Wang
CVPR 2025 CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization Junhao Xu, Yanan Zhang, Zhi Cai, Di Huang
CVPR 2025 ComfyBench: Benchmarking LLM-Based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems Xiangyuan Xue, Zeyu Lu, Di Huang, Zidong Wang, Wanli Ouyang, Lei Bai
ICCV 2025 Constraint-Aware Feature Learning for Parametric Point Cloud Xi Cheng, Ruiqi Lei, Di Huang, Zhichao Liao, Fengyuan Piao, Yan Chen, Pingfa Feng, Long Zeng
ICLR 2025 Depth Any Video with Scalable Synthetic Data Honghui Yang, Di Huang, Wei Yin, Chunhua Shen, Haifeng Liu, Xiaofei He, Binbin Lin, Wanli Ouyang, Tong He
CVPR 2025 Diffusion-4k: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models Jinjin Zhang, Qiuyu Huang, Junjie Liu, Xiefan Guo, Di Huang
TMLR 2025 EMMA: End-to-End Multimodal Model for Autonomous Driving Jyh-Jing Hwang, Runsheng Xu, Hubert Lin, Wei-Chih Hung, Jingwei Ji, Kristy Choi, Di Huang, Tong He, Paul Covington, Benjamin Sapp, Yin Zhou, James Guo, Dragomir Anguelov, Mingxing Tan
AAAI 2025 GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction Junyi Chen, Weicai Ye, Yifan Wang, Danpeng Chen, Di Huang, Wanli Ouyang, Guofeng Zhang, Yu Qiao, Tong He
NeurIPS 2025 Implicit Modeling for Transferability Estimation of Vision Foundation Models Yaoyan Zheng, Huiqun Wang, Nan Zhou, Di Huang
AAAI 2025 InverseCoder: Self-Improving Instruction-Tuned Code LLMs with Inverse-Instruct Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Yewen Pu, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Dawei Yin, Xing Hu, Yunji Chen
ICLR 2025 MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers Yiwen Chen, Tong He, Di Huang, Weicai Ye, Sijin Chen, Jiaxiang Tang, Zhongang Cai, Lei Yang, Gang Yu, Guosheng Lin, Chi Zhang
AAAI 2025 Micro-Macro Wavelet-Based Gaussian Splatting for 3D Reconstruction from Unconstrained Images Yihui Li, Chengxin Lv, Hongyu Yang, Di Huang
NeurIPS 2025 MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions Pucheng Dang, Di Huang, Dong Li, Kang Chen, Yuanbo Wen, Qi Guo, Xing Hu
ICLR 2025 ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction Ziyu Tang, Weicai Ye, Yifan Wang, Di Huang, Hujun Bao, Tong He, Guofeng Zhang
ICLR 2025 Progressive Parameter Efficient Transfer Learning for Semantic Segmentation Nan Zhou, Huiqun Wang, Yaoyan Zheng, Di Huang
NeurIPS 2025 QiMeng-CodeV-R1: Reasoning-Enhanced Verilog Generation Yaoyu Zhu, Di Huang, Hanqi Lyu, Xiaoyun Zhang, Chongxiao Li, Wenxuan Shi, Yutong Wu, Jianan Mu, Jinghua Wang, Yang Zhao, Pengwei Jin, Shuyao Cheng, Shengwen Liang, Xishan Zhang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
NeurIPS 2025 QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code Hainan Fang, Yuanbo Wen, Jun Bi, Yihan Wang, Tonghui He, Yanlin Tang, Di Huang, Jiaming Guo, Rui Zhang, Qi Guo, Yunji Chen
NeurIPS 2025 QiMeng-SALV: Signal-Aware Learning for Verilog Code Generation Yang Zhang, Rui Zhang, Jiaming Guo, Huang Lei, Di Huang, Yunpu Zhao, Shuyao Cheng, Pengwei Jin, Chongxiao Li, Zidong Du, Xing Hu, Qi Guo, Yunji Chen
ICCV 2025 ShortFT: Diffusion Model Alignment via Shortcut-Based Fine-Tuning Xiefan Guo, Miaomiao Cui, Liefeng Bo, Di Huang
NeurIPS 2025 Test-Time Adaptive Object Detection with Foundation Model Yingjie Gao, Yanan Zhang, Zhi Cai, Di Huang
CVPR 2025 Towards Training-Free Anomaly Detection with Vision and Language Foundation Models Jinjin Zhang, Guodong Wang, Yizhou Jin, Di Huang
AAAI 2025 Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation Yajie Liu, Guodong Wang, Jinjin Zhang, Qingjie Liu, Di Huang
ICLR 2025 Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction Junyi Chen, Di Huang, Weicai Ye, Wanli Ouyang, Tong He
NeurIPS 2024 Active Perception for Grasp Detection via Neural Graspness Field Haoxiang Ma, Modi Shi, Boyang Gao, Di Huang
ECCV 2024 AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong, Di Huang, Yunhong Wang
ECCV 2024 Agent3D-Zero: An Agent for Zero-Shot 3D Understanding Sha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He, Yanyong Zhang
WACV 2024 BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jacobs
ECCV 2024 Crowd-SAM:SAM as a Smart Annotator for Object Detection in Crowded Scenes Zhi Cai, Yingjie Gao, Yaoyan Zheng, Nan Zhou, Di Huang
AAAI 2024 Emergent Communication for Numerical Concepts Generalization Enshuai Zhou, Yifan Hao, Rui Zhang, Yuxuan Guo, Zidong Du, Xishan Zhang, Xinkai Song, Chao Wang, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen
ICML 2024 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Wanli Ouyang, Lei Bai
ECCV 2024 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He, Junyi Chen, Sida Peng, Di Huang, Yangguang Li, Xiaoshui Huang, Chun Yuan, Wanli Ouyang, Tong He
CVPR 2024 Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge Haoxiang Ma, Modi Shi, Boyang Gao, Di Huang
AAAI 2024 Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li
CVPR 2024 InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang
AAAI 2024 MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang
ECCV 2024 Multi-Modal Relation Distillation for Unified 3D Representation Learning Huiqun Wang, Yiping Bao, Panwang Pan, Zeming Li, Xiao Liu, Ruijie Yang, Di Huang
NeurIPS 2024 NeuRodin: A Two-Stage Framework for High-Fidelity Neural Surface Reconstruction Yifan Wang, Di Huang, Weicai Ye, Guofeng Zhang, Wanli Ouyang, Tong He
NeurIPS 2024 Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning Haoyi Zhu, Yating Wang, Di Huang, Weicai Ye, Wanli Ouyang, Tong He
ECCV 2024 PredBench: Benchmarking Spatio-Temporal Prediction Across Diverse Disciplines ZiDong Wang, Zeyu Lu, Di Huang, Tong He, Xihui Liu, Wanli Ouyang, Lei Bai
ICLR 2024 Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-Class Classification Guodong Wang, Yunhong Wang, Xiuguo Bao, Di Huang
NeurIPS 2024 Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learner Hanwen Zhong, Jiaxin Chen, Yutong Zhang, Di Huang, Yunhong Wang
CVPR 2024 UniPAD: A Universal Pre-Training Paradigm for Autonomous Driving Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang
NeurIPS 2023 ANPL: Towards Natural Programming with Interactive Decomposition Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen
CVPR 2023 Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images Bowei Du, Yecheng Huang, Jiaxin Chen, Di Huang
NeurIPS 2023 Compressed Video Prompt Tuning Bing Li, Jiaxin Chen, Xiuguo Bao, Di Huang
ICCV 2023 DR-Tune: Improving Fine-Tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration Nan Zhou, Jiaxin Chen, Di Huang
ICCV 2023 Denoising Diffusion Autoencoders Are Unified Self-Supervised Learners Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang
NeurIPS 2023 Emergent Communication for Rules Reasoning Yuxuan Guo, Yifan Hao, Rui Zhang, Enshuai Zhou, Zidong Du, Xishan Zhang, Xinkai Song, Yuanbo Wen, Yongwei Zhao, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen
AAAI 2023 Learning Polysemantic Spoof Trace: A Multi-Modal Disentanglement Network for Face Anti-Spoofing Kaicheng Li, Hongyu Yang, Binghui Chen, Pengyu Li, Biao Wang, Di Huang
CVPR 2023 NeuFace: Realistic 3D Neural Face Rendering from Multi-View Images Mingwu Zheng, Haiyu Zhang, Hongyu Yang, Di Huang
CVPR 2023 OcTr: Octree-Based Transformer for 3D Object Detection Chao Zhou, Yanan Zhang, Jiaxin Chen, Di Huang
AAAI 2023 Online Symbolic Regression with Informative Query Pengwei Jin, Di Huang, Rui Zhang, Xing Hu, Ziyuan Nan, Zidong Du, Qi Guo, Yunji Chen
ICCV 2023 Ponder: Point Cloud Pre-Training via Neural Rendering Di Huang, Sida Peng, Tong He, Honghui Yang, Xiaowei Zhou, Wanli Ouyang
NeurIPS 2023 Seeing Is Not Always Believing: Benchmarking Human and Model Perception of AI-Generated Images Zeyu Lu, Di Huang, Lei Bai, Jingjing Qu, Chengyue Wu, Xihui Liu, Wanli Ouyang
ICCV 2023 Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang
CVPR 2022 ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang
AAAI 2022 ACGNet: Action Complement Graph Network for Weakly-Supervised Temporal Action Localization Zichen Yang, Jie Qin, Di Huang
CVPR 2022 CAT-Det: Contrastively Augmented Transformer for Multi-Modal 3D Object Detection Yanan Zhang, Jiaxin Chen, Di Huang
CVPR 2022 Entropy-Based Active Learning for Object Detection with Progressive Diversity Constraint Jiaxi Wu, Jiaxin Chen, Di Huang
CVPR 2022 ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations Mingwu Zheng, Hongyu Yang, Di Huang, Liming Chen
ECCV 2022 Motion Sensitive Contrastive Learning for Self-Supervised Video Representation Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang
ICLR 2022 Neural Program Synthesis with Query Di Huang, Rui Zhang, Xing Hu, Xishan Zhang, Pengwei Jin, Nan Li, Zidong Du, Qi Guo, Yunji Chen
NeurIPS 2022 OnePose++: Keypoint-Free One-Shot Object Pose Estimation Without CAD Models Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, Xiaowei Zhou
IJCAI 2022 Representation Learning for Compressed Video Action Recognition via Attentive Cross-Modal Interaction with Motion Enhancement Bing Li, Jiaxin Chen, Dongming Zhang, Xiuguo Bao, Di Huang
CVPR 2022 Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang
CoRL 2022 Towards Scale Balanced 6-DoF Grasp Detection in Cluttered Scenes Haoxiang Ma, Di Huang
AAAI 2022 UFPMP-Det: Toward Accurate and Efficient Object Detection on Drone Imagery Yecheng Huang, Jiaxin Chen, Di Huang
ECCV 2022 Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang
ICCV 2021 Image Inpainting via Conditional Texture and Structure Dual Generation Xiefan Guo, Hongyu Yang, Di Huang
AAAI 2021 PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection Yanan Zhang, Di Huang, Yunhong Wang
ICCV 2021 PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6d Pose Estimation Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang
ECCV 2020 Beyond 3DMM Space: Towards Fine-Grained 3D Face Reconstruction Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li
ICML 2020 Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels Lu Jiang, Di Huang, Mason Liu, Weilong Yang
AAAI 2020 DWM: A Decomposable Winograd Method for Convolution Acceleration Di Huang, Xishan Zhang, Rui Zhang, Tian Zhi, Deyuan He, Jiaming Guo, Chang Liu, Qi Guo, Zidong Du, Shaoli Liu, Tianshi Chen, Yunji Chen
AAAI 2020 Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism Mingda Wu, Di Huang, Yuanfang Guo, Yunhong Wang
ECCV 2020 Improving Object Detection with Selective Self-Supervised Self-Training Yandong Li, Di Huang, Danfeng Qin, Liqiang Wang, Boqing Gong
ECCV 2020 Multi-Scale Positive Sample Refinement for Few-Shot Object Detection Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang
CVPRW 2019 2D-3D Heterogeneous Face Recognition Based on Deep Coupled Spectral Regression Yangtao Zheng, Di Huang, Weixin Li, Shupeng Wang, Yunhong Wang
ECCV 2018 Receptive Field Block Net for Accurate and Fast Object Detection Songtao Liu, Di Huang, andYunhong Wang
ICCVW 2017 Detecting Smiles of Young Children via Deep Transfer Learning Yu Xia, Di Huang, Yunhong Wang
CVPRW 2011 Textured 3D Face Recognition Using Biological Vision-Based Facial Representation and Optimized Weighted Sum Fusion Di Huang, Wael Ben Soltana, Mohsen Ardabilian, Yunhong Wang, Liming Chen