ML Anthology
Authors
Search
About
Wang, Pichao
32 publications
ICLR
2025
Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach
Zechen Bai
,
Tianjun Xiao
,
Tong He
,
Pichao Wang
,
Zheng Zhang
,
Thomas Brox
,
Mike Zheng Shou
NeurIPS
2025
SparseDiT: Token Sparsification for Efficient Diffusion Transformer
Shuning Chang
,
Pichao Wang
,
Jiasheng Tang
,
Fan Wang
,
Yi Yang
ICCV
2025
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
Yufei Wang
,
Lanqing Guo
,
Zhihao Li
,
Jiaxing Huang
,
Pichao Wang
,
Bihan Wen
,
Jian Wang
NeurIPS
2024
Diffusion-Inspired Truncated Sampler for Text-Video Retrieval
Jiamian Wang
,
Pichao Wang
,
Dongfang Liu
,
Qiang Guan
,
Sohail Dianat
,
Majid Rabbani
,
Raghuveer Rao
,
Zhiqiang Tao
NeurIPS
2024
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
Penghui Ruan
,
Pichao Wang
,
Divya Saxena
,
Jiannong Cao
,
Yuhui Shi
CVPR
2024
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
Wenhao Li
,
Mengyuan Liu
,
Hong Liu
,
Pichao Wang
,
Jialun Cai
,
Nicu Sebe
NeurIPS
2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Zechen Bai
,
Tong He
,
Haiyang Mei
,
Pichao Wang
,
Ziteng Gao
,
Joya Chen
,
Lei Liu
,
Zheng Zhang
,
Mike Zheng Shou
CVPR
2024
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
Jiamian Wang
,
Guohao Sun
,
Pichao Wang
,
Dongfang Liu
,
Sohail Dianat
,
Majid Rabbani
,
Raghuveer Rao
,
Zhiqiang Tao
ICCV
2023
Audio-Enhanced Text-to-Video Retrieval Using Text-Conditioned Feature Alignment
Sarah Ibrahimi
,
Xiaohang Sun
,
Pichao Wang
,
Amanmeet Garg
,
Ashutosh Sanan
,
Mohamed Omar
CVPRW
2023
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
,
Pichao Wang
,
Fan Wang
,
Jiashi Feng
,
Mike Zheng Shou
AAAI
2023
Frequency Domain Disentanglement for Arbitrary Neural Style Transfer
Dongyang Li
,
Hao Luo
,
Pichao Wang
,
Zhibin Wang
,
Shang Liu
,
Fan Wang
AAAI
2023
Head-Free Lightweight Semantic Segmentation with Linear Transformer
Bo Dong
,
Pichao Wang
,
Fan Wang
CVPR
2023
Making Vision Transformers Efficient from a Token Sparsification View
Shuning Chang
,
Pichao Wang
,
Ming Lin
,
Fan Wang
,
David Junhao Zhang
,
Rong Jin
,
Mike Zheng Shou
CVPR
2023
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation
Qitao Zhao
,
Ce Zheng
,
Mengyuan Liu
,
Pichao Wang
,
Chen Chen
ICCV
2023
Revisiting Vision Transformer from the View of Path Ensemble
Shuning Chang
,
Pichao Wang
,
Hao Luo
,
Fan Wang
,
Mike Zheng Shou
CVPR
2023
Selective Structured State-Spaces for Long-Form Video Understanding
Jue Wang
,
Wentao Zhu
,
Pichao Wang
,
Xiang Yu
,
Linda Liu
,
Mohamed Omar
,
Raffay Hamid
ICLR
2022
CDTrans: Cross-Domain Transformer for Unsupervised Domain Adaptation
Tongkun Xu
,
Weihua Chen
,
Pichao Wang
,
Fan Wang
,
Hao Li
,
Rong Jin
CVPR
2022
Decoupling and Recoupling Spatiotemporal Representation for RGB-D-Based Motion Recognition
Benjia Zhou
,
Pichao Wang
,
Jun Wan
,
Yanyan Liang
,
Fan Wang
,
Du Zhang
,
Zhen Lei
,
Hao Li
,
Rong Jin
CVPR
2022
EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation
Hansheng Chen
,
Pichao Wang
,
Fan Wang
,
Wei Tian
,
Lu Xiong
,
Hao Li
ECCV
2022
KVT: k-NN Attention for Boosting Vision Transformers
Pichao Wang
,
Xue Wang
,
Fan Wang
,
Ming Lin
,
Shuning Chang
,
Hao Li
,
Rong Jin
CVPR
2022
MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
Wenhao Li
,
Hong Liu
,
Hao Tang
,
Pichao Wang
,
Luc Van Gool
AAAI
2022
Scaled ReLU Matters for Training Vision Transformers
Pichao Wang
,
Xue Wang
,
Hao Luo
,
Jingkai Zhou
,
Zhipeng Zhou
,
Fan Wang
,
Hao Li
,
Rong Jin
ECCV
2022
TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation
Zhaoyuan Yin
,
Pichao Wang
,
Fan Wang
,
Xianzhe Xu
,
Hanling Zhang
,
Hao Li
,
Rong Jin
NeurIPS
2022
VTC-LFC: Vision Transformer Compression with Low-Frequency Components
Zhenyu Wang
,
Hao Luo
,
Pichao Wang
,
Feng Ding
,
Fan Wang
,
Hao Li
ICCV
2021
TransReID: Transformer-Based Object Re-Identification
Shuting He
,
Hao Luo
,
Pichao Wang
,
Fan Wang
,
Hao Li
,
Wei Jiang
ICCV
2021
Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition
Ming Lin
,
Pichao Wang
,
Zhenhong Sun
,
Hesen Chen
,
Xiuyu Sun
,
Qi Qian
,
Hao Li
,
Rong Jin
AAAI
2020
R²MRF: Defocus Blur Detection via Recurrently Refining Multi-Scale Residual Features
Chang Tang
,
Xinwang Liu
,
Xinzhong Zhu
,
En Zhu
,
Kun Sun
,
Pichao Wang
,
Lizhe Wang
,
Albert Y. Zomaya
AAAI
2018
Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition
Pichao Wang
,
Wanqing Li
,
Jun Wan
,
Philip Ogunbona
,
Xinwang Liu
ICCVW
2017
Large-Scale Multimodal Gesture Recognition Using Heterogeneous Networks
Huogen Wang
,
Pichao Wang
,
Zhanjie Song
,
Wanqing Li
ICCVW
2017
Large-Scale Multimodal Gesture Segmentation and Recognition Based on Convolutional Neural Networks
Huogen Wang
,
Pichao Wang
,
Zhanjie Song
,
Wanqing Li
CVPR
2017
Scene Flow to Action mAP: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks
Pichao Wang
,
Wanqing Li
,
Zhimin Gao
,
Yuyao Zhang
,
Chang Tang
,
Philip Ogunbona
ICCVW
2017
Structured Images for RGB-D Action Recognition
Pichao Wang
,
Shuang Wang
,
Zhimin Gao
,
Yonghong Hou
,
Wanqing Li