CVPR 2022

2074 papers

"The Pedestrian Next to the Lamppost" Adaptive Object Graphs for Better Instantaneous Mapping Avishkar Saha, Oscar Mendez, Chris Russell, Richard Bowden
PDF
360-Attack: Distortion-Aware Perturbations from Perspective-Views Yunjian Zhang, Yanwei Liu, Jinxia Liu, Jingbo Miao, Antonios Argyriou, Liming Wang, Zhen Xu
PDF
360MonoDepth: High-Resolution 360deg Monocular Depth Estimation Manuel Rey-Area, Mingze Yuan, Christian Richardt
PDF
3D Common Corruptions and Data Augmentation Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir
PDF
3D Human Tongue Reconstruction from Single "In-the-Wild" Images Stylianos Ploumpis, Stylianos Moschoglou, Vasileios Triantafyllou, Stefanos Zafeiriou
PDF
3D Moments from Near-Duplicate Photos Qianqian Wang, Zhengqi Li, David Salesin, Noah Snavely, Brian Curless, Janne Kontkanen
PDF
3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image Fangzhou Mu, Jian Wang, Yicheng Wu, Yin Li
PDF
3D Scene Painting via Semantic Image Synthesis Jaebong Jeong, Janghun Jo, Sunghyun Cho, Jaesik Park
PDF
3D Shape Reconstruction from 2D Images with Disentangled Attribute Flow Xin Wen, Junsheng Zhou, Yu-Shen Liu, Hua Su, Zhen Dong, Zhizhong Han
PDF
3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces Simone Foti, Bongjin Koo, Danail Stoyanov, Matthew J. Clarkson
PDF
3D-Aware Image Synthesis via Learning Structural and Textural Representations Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
PDF
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection Junyu Luo, Jiahui Fu, Xianghao Kong, Chen Gao, Haibing Ren, Hao Shen, Huaxia Xia, Si Liu
PDF
3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Mohammad-Ali Nikouei Mahani, Nassir Navab, Benjamin Busam, Federico Tombari
PDF
3DAC: Learning Attribute Compression for Point Clouds Guangchi Fang, Qingyong Hu, Hanyun Wang, Yiling Xu, Yulan Guo
PDF
3DeformRS: Certifying Spatial Deformations on Point Clouds Gabriel Pérez S., Juan C. Pérez, Motasem Alfarra, Silvio Giancola, Bernard Ghanem
PDF
3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds Daigang Cai, Lichen Zhao, Jing Zhang, Lu Sheng, Dong Xu
PDF
3MASSIV: Multilingual, Multimodal and Multi-Aspect Dataset of Social Media Short Videos Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha
PDF
3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies Weikai Chen, Cheng Lin, Weiyang Li, Bo Yang
PDF
A Brand New Dance Partner: Music-Conditioned Pluralistic Dancing Controlled by Multiple Dance Genres Jinwoo Kim, Heeseok Oh, Seongjean Kim, Hoseok Tong, Sanghoon Lee
PDF
A Closer Look at Few-Shot Image Generation Yunqing Zhao, Henghui Ding, Houjing Huang, Ngai-Man Cheung
PDF
A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes Mazda Moayeri, Phillip Pope, Yogesh Balaji, Soheil Feizi
PDF
A Conservative Approach for Unbiased Learning on Unknown Biases Myeongho Jeon, Daekyung Kim, Woochul Lee, Myungjoo Kang, Joonseok Lee
PDF
A ConvNet for the 2020s Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie
PDF
A Deeper Dive into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis
PDF
A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction with Large Shift Shi Guo, Xi Yang, Jianqi Ma, Gaofeng Ren, Lei Zhang
PDF
A Dual Weighting Label Assignment Scheme for Object Detection Shuai Li, Chenhang He, Ruihuang Li, Lei Zhang
PDF
A Framework for Learning Ante-Hoc Explainable Models via Concepts Anirban Sarkar, Deepak Vijaykeerthy, Anindya Sarkar, Vineeth N Balasubramanian
PDF
A Graph Matching Perspective with Transformers on Video Instance Segmentation Zheyun Qin, Xiankai Lu, Xiushan Nie, Yilong Yin, Jianbing Shen
PDF
A Hybrid Egocentric Activity Anticipation Framework via Memory-Augmented Recurrent and One-Shot Representation Forecasting Tianshan Liu, Kin-Man Lam
PDF
A Hybrid Quantum-Classical Algorithm for Robust Fitting Anh-Dzung Doan, Michele Sasdelli, David Suter, Tat-Jun Chin
PDF
A Keypoint-Based Global Association Network for Lane Detection Jinsheng Wang, Yinchao Ma, Shaofei Huang, Tianrui Hui, Fei Wang, Chen Qian, Tianzhu Zhang
PDF
A Large-Scale Comprehensive Dataset and Copy-Overlap Aware Evaluation Protocol for Segment-Level Video Copy Detection Sifeng He, Xudong Yang, Chen Jiang, Gang Liang, Wei Zhang, Tan Pan, Qing Wang, Furong Xu, Chunguang Li, JinXiong Liu, Hui Xu, Kaiming Huang, Yuan Cheng, Feng Qian, Xiaobo Zhang, Lei Yang
PDF
A Low-Cost & Real-Time Motion Capture System Anargyros Chatzitofis, Georgios Albanis, Nikolaos Zioulis, Spyridon Thermos
PDF
A Probabilistic Graphical Model Based on Neural-Symbolic Reasoning for Visual Relationship Detection Dongran Yu, Bo Yang, Qianhao Wei, Anchen Li, Shirui Pan
PDF
A Proposal-Based Paradigm for Self-Supervised Sound Source Localization in Videos Hanyu Xuan, Zhiliang Wu, Jian Yang, Yan Yan, Xavier Alameda-Pineda
PDF
A Re-Balancing Strategy for Class-Imbalanced Classification Based on Instance Difficulty Sihao Yu, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Zizhen Wang, Xueqi Cheng
PDF
A Sampling-Based Approach for Efficient Clustering in Large Datasets Georgios Exarchakis, Omar Oubari, Gregor Lenz
PDF
A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching Paul Roetzer, Paul Swoboda, Daniel Cremers, Florian Bernard
PDF
A Self-Supervised Descriptor for Image Copy Detection Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, Matthijs Douze
PDF
A Simple Data Mixing Prior for Improving Self-Supervised Learning Sucheng Ren, Huiyu Wang, Zhengqi Gao, Shengfeng He, Alan Yuille, Yuyin Zhou, Cihang Xie
PDF
A Simple Episodic Linear Probe Improves Visual Recognition in the Wild Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang
PDF
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation Yutong Chen, Fangyun Wei, Xiao Sun, Zhirong Wu, Stephen Lin
PDF
A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration Ramya Hebbalaguppe, Jatin Prakash, Neelabh Madan, Chetan Arora
PDF
A Structured Dictionary Perspective on Implicit Neural Representations Gizem Yüce, Guillermo Ortiz-Jiménez, Beril Besbinar, Pascal Frossard
PDF
A Study on the Distribution of Social Biases in Self-Supervised Learning Visual Models Kirill Sirotkin, Pablo Carballeira, Marcos Escudero-Viñolo
PDF
A Style-Aware Discriminator for Controllable Image Translation Kunhee Kim, Sanghun Park, Eunyeong Jeon, Taehun Kim, Daijin Kim
PDF
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution Jianqi Ma, Zhetong Liang, Lei Zhang
PDF
A Unified Framework for Implicit Sinkhorn Differentiation Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Florian Bernard, Daniel Cremers
PDF
A Unified Model for Line Projections in Catadioptric Cameras with Rotationally Symmetric Mirrors Pedro Miraldo, José Pedro Iglesias
PDF
A Unified Query-Based Paradigm for Point Cloud Understanding Zetong Yang, Li Jiang, Yanan Sun, Bernt Schiele, Jiaya Jia
PDF
A Variational Bayesian Method for Similarity Learning in Non-Rigid Image Registration Daniel Grzech, Mohammad Farid Azampour, Ben Glocker, Julia Schnabel, Nassir Navab, Bernhard Kainz, Loïc Le Folgoc
PDF
A Versatile Multi-View Framework for LiDAR-Based 3D Object Detection with Guidance from Panoptic Segmentation Hamidreza Fazlali, Yixuan Xu, Yuan Ren, Bingbing Liu
PDF
A Voxel Graph CNN for Object Classification with Event Cameras Yongjian Deng, Hao Chen, Hai Liu, Youfu Li
PDF
A-ViT: Adaptive Tokens for Efficient Vision Transformer Hongxu Yin, Arash Vahdat, Jose M. Alvarez, Arun Mallya, Jan Kautz, Pavlo Molchanov
PDF
Abandoning the Bayer-Filter to See in the Dark Xingbo Dong, Wanyan Xu, Zhihui Miao, Lan Ma, Chao Zhang, Jiewen Yang, Zhe Jin, Andrew Beng Jin Teoh, Jiajun Shen
PDF
ABO: Dataset and Benchmarks for Real-World 3D Object Understanding Jasmine Collins, Shubham Goel, Kenan Deng, Achleshwar Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F. Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, Jitendra Malik
PDF
ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang
PDF
Accelerating DETR Convergence via Semantic-Aligned Matching Gongjie Zhang, Zhipeng Luo, Yingchen Yu, Kaiwen Cui, Shijian Lu
PDF
Accelerating Neural Network Optimization Through an Automated Control Theory Lens Jiahao Wang, Baoyuan Wu, Rui Su, Mingdeng Cao, Shuwei Shi, Wanli Ouyang, Yujiu Yang
PDF
Accelerating Video Object Segmentation with Compressed Video Kai Xu, Angela Yao
PDF
Accurate 3D Body Shape Regression Using Metric and Semantic Attributes Vasileios Choutas, Lea Müller, Chun-Hao P. Huang, Siyu Tang, Dimitrios Tzionas, Michael J. Black
PDF
ACPL: Anti-Curriculum Pseudo-Labelling for Semi-Supervised Medical Image Classification Fengbei Liu, Yu Tian, Yuanhong Chen, Yuyuan Liu, Vasileios Belagiannis, Gustavo Carneiro
PDF
Acquiring a Dynamic Light Field Through a Single-Shot Coded Image Ryoya Mizuno, Keita Takahashi, Michitaka Yoshida, Chihiro Tsutake, Toshiaki Fujii, Hajime Nagahara
PDF
Active Learning by Feature Mixing Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Gholamreza Haffari, Anton van den Hengel, Javen Qinfeng Shi
PDF
Active Learning for Open-Set Annotation Kun-Peng Ning, Xun Zhao, Yu Li, Sheng-Jun Huang
PDF
Active Teacher for Semi-Supervised Object Detection Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji
PDF
ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation Isabella Liu, Edward Yang, Jianyu Tao, Rui Chen, Xiaoshuai Zhang, Qing Ran, Zhu Liu, Hao Su
PDF
AdaFace: Quality Adaptive Margin for Face Recognition Minchul Kim, Anil K. Jain, Xiaoming Liu
PDF
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang
PDF
AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-Time Image Enhancement Canqian Yang, Meiguang Jin, Xu Jia, Yi Xu, Ying Chen
PDF
AdaMixer: A Fast-Converging Query-Based Object Detector Ziteng Gao, Limin Wang, Bing Han, Sheng Guo
PDF
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts Bingqian Lin, Yi Zhu, Zicong Chen, Xiwen Liang, Jianzhuang Liu, Xiaodan Liang
PDF
Adaptive Early-Learning Correction for Segmentation from Noisy Annotations Sheng Liu, Kangning Liu, Weicheng Zhu, Yiqiu Shen, Carlos Fernandez-Granda
PDF
Adaptive Gating for Single-Photon 3D Imaging Ryan Po, Adithya Pediredla, Ioannis Gkioulekas
PDF
Adaptive Hierarchical Representation Learning for Long-Tailed Object Detection Banghuai Li
PDF
Adaptive Trajectory Prediction via Transferable GNN Yi Xu, Lichen Wang, Yizhou Wang, Yun Fu
PDF
AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation Mohsen Gholami, Bastian Wandt, Helge Rhodin, Rabab Ward, Z. Jane Wang
PDF
ADAS: A Direct Adaptation Strategy for Multi-Target Domain Adaptive Semantic Segmentation Seunghun Lee, Wonhyeok Choi, Changjae Kim, Minwoo Choi, Sunghoon Im
PDF
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks Huu Le, Rasmus Kjær Høier, Che-Tsung Lin, Christopher Zach
PDF
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim
PDF
ADeLA: Automatic Dense Labeling with Attention for Viewpoint Shift in Semantic Segmentation Hanxiang Ren, Yanchao Yang, He Wang, Bokui Shen, Qingnan Fan, Youyi Zheng, C. Karen Liu, Leonidas J. Guibas
PDF
Adiabatic Quantum Computing for Multi Object Tracking Jan-Nico Zaech, Alexander Liniger, Martin Danelljan, Dengxin Dai, Luc Van Gool
PDF
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions Hongwei Xue, Tiankai Hang, Yanhong Zeng, Yuchong Sun, Bei Liu, Huan Yang, Jianlong Fu, Baining Guo
PDF
Adversarial Eigen Attack on Black-Box Models Linjun Zhou, Peng Cui, Xingxuan Zhang, Yinan Jiang, Shiqiang Yang
PDF
Adversarial Parametric Pose Prior Andrey Davydov, Anastasia Remizova, Victor Constantin, Sina Honari, Mathieu Salzmann, Pascal Fua
PDF
Adversarial Texture for Fooling Person Detectors in the Physical World Zhanhao Hu, Siyuan Huang, Xiaopei Zhu, Fuchun Sun, Bo Zhang, Xiaolin Hu
PDF
AEGNN: Asynchronous Event-Based Graph Neural Networks Simon Schaefer, Daniel Gehrig, Davide Scaramuzza
PDF
Aesthetic Text Logo Synthesis via Content-Aware Layout Inferring Yizhi Wang, Guo Pu, Wenhan Luo, Yexin Wang, Pengfei Xiong, Hongwen Kang, Zhouhui Lian
PDF
Affine Medical Image Registration with Coarse-to-Fine Vision Transformer Tony C. W. Mok, Albert C. S. Chung
PDF
AIM: An Auto-Augmenter for Images and Meshes Vinit Veerendraveer Singh, Chandra Kambhamettu
PDF
AirObject: A Temporally Evolving Graph Embedding for Object Identification Nikhil Varma Keetha, Chen Wang, Yuheng Qiu, Kuan Xu, Sebastian Scherer
PDF
AKB-48: A Real-World Articulated Object Knowledge Base Liu Liu, Wenqiang Xu, Haoyuan Fu, Sucheng Qian, Qiaojun Yu, Yang Han, Cewu Lu
PDF
Aladdin: Joint Atlas Building and Diffeomorphic Registration Learning with Pairwise Alignment Zhipeng Ding, Marc Niethammer
PDF
Align and Prompt: Video-and-Language Pre-Training with Entity Prompts Dongxu Li, Junnan Li, Hongdong Li, Juan Carlos Niebles, Steven C.H. Hoi
PDF
Align Representations with Base: A New Approach to Self-Supervised Learning Shaofeng Zhang, Lyn Qiu, Feng Zhu, Junchi Yan, Hengrui Zhang, Rui Zhao, Hongyang Li, Xiaokang Yang
PDF
Alignment-Uniformity Aware Representation Learning for Zero-Shot Video Classification Shi Pu, Kaili Zhao, Mao Zheng
PDF
AlignMixup: Improving Representations by Interpolating Aligned Features Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis
PDF
AlignQ: Alignment Quantization with ADMM-Based Correlation Preservation Ting-An Chen, De-Nian Yang, Ming-Syan Chen
PDF
All-in-One Image Restoration for Unknown Corruption Boyun Li, Xiao Liu, Peng Hu, Zhongqin Wu, Jiancheng Lv, Xi Peng
PDF
All-Photon Polarimetric Time-of-Flight Imaging Seung-Hwan Baek, Felix Heide
PDF
Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint Jiaxian Guo, Jiachen Li, Huan Fu, Mingming Gong, Kun Zhang, Dacheng Tao
PDF
AME: Attention and Memory Enhancement in Hyper-Parameter Optimization Nuo Xu, Jianlong Chang, Xing Nie, Chunlei Huo, Shiming Xiang, Chunhong Pan
PDF
Amodal Panoptic Segmentation Rohit Mohan, Abhinav Valada
PDF
Amodal Segmentation Through Out-of-Task and Out-of-Distribution Generalization with a Bayesian Model Yihong Sun, Adam Kortylewski, Alan Yuille
PDF
An Efficient Training Approach for Very Large Scale Face Recognition Kai Wang, Shuo Wang, Panpan Zhang, Zhipeng Zhou, Zheng Zhu, Xiaobo Wang, Xiaojiang Peng, Baigui Sun, Hao Li, Yang You
PDF
An Empirical Study of End-to-End Temporal Action Detection Xiaolong Liu, Song Bai, Xiang Bai
PDF
An Empirical Study of Training End-to-End Vision-and-Language Transformers Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng
PDF
An Image Patch Is a Wave: Phase-Aware Vision MLP Yehui Tang, Kai Han, Jianyuan Guo, Chang Xu, Yanxi Li, Chao Xu, Yunhe Wang
PDF
An Iterative Quantum Approach for Transformation Estimation from Point Sets Natacha Kuete Meli, Florian Mannel, Jan Lellmann
PDF
An MIL-Derived Transformer for Weakly Supervised Point Cloud Segmentation Cheng-Kun Yang, Ji-Jia Wu, Kai-Syun Chen, Yung-Yu Chuang, Yen-Yu Lin
PDF
Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding Xun Long Ng, Kian Eng Ong, Qichen Zheng, Yun Ni, Si Yong Yeo, Jun Liu
PDF
Anomaly Detection via Reverse Distillation from One-Class Embedding Hanqiu Deng, Xingyu Li
PDF
AnyFace: Free-Style Text-to-Face Synthesis and Manipulation Jianxin Sun, Qiyao Deng, Qi Li, Muyi Sun, Min Ren, Zhenan Sun
PDF
AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network Wooseok Lee, Sanghyun Son, Kyoung Mu Lee
PDF
APES: Articulated Part Extraction from Sprite Sheets Zhan Xu, Matthew Fisher, Yang Zhou, Deepali Aneja, Rushikesh Dudhat, Li Yi, Evangelos Kalogerakis
PDF
Appearance and Structure Aware Robust Deep Visual Graph Matching: Attack, Defense and Beyond Qibing Ren, Qingquan Bao, Runzhong Wang, Junchi Yan
PDF
APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers Jiahao Lu, Xi Sheryl Zhang, Tianli Zhao, Xiangyu He, Jian Cheng
PDF
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields Takuhiro Kaneko
PDF
Arbitrary-Scale Image Synthesis Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc Van Gool
PDF
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search Minbin Huang, Zhijian Huang, Changlin Li, Xin Chen, Hang Xu, Zhenguo Li, Xiaodan Liang
PDF
ARCS: Accurate Rotation and Correspondence Search Liangzu Peng, Manolis C. Tsakiris, René Vidal
PDF
Are Multimodal Transformers Robust to Missing Modality? Mengmeng Ma, Jian Ren, Long Zhao, Davide Testuggine, Xi Peng
PDF
ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation Ruibin Wang, Yibo Yang, Dacheng Tao
PDF
ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis Lixin Yang, Kailin Li, Xinyu Zhan, Jun Lv, Wenqiang Xu, Jiefeng Li, Cewu Lu
PDF
Artistic Style Discovery with Independent Components Xin Xie, Yi Li, Huaibo Huang, Haiyan Fu, Wanwan Wang, Yanqing Guo
PDF
ASM-Loc: Action-Aware Segment Modeling for Weakly-Supervised Temporal Action Localization Bo He, Xitong Yang, Le Kang, Zhiyu Cheng, Xin Zhou, Abhinav Shrivastava
PDF
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities Fadime Sener, Dibyadip Chatterjee, Daniel Shelepov, Kun He, Dipika Singhania, Robert Wang, Angela Yao
PDF
ATPFL: Automatic Trajectory Prediction Model Design Under Federated Learning Framework Chunnan Wang, Xiang Chen, Junzhe Wang, Hongzhi Wang
PDF
Attention Concatenation Volume for Accurate and Efficient Stereo Matching Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang
PDF
Attentive Fine-Grained Structured Sparsity for Image Restoration Junghun Oh, Heewon Kim, Seungjun Nah, Cheeun Hong, Jonghyun Choi, Kyoung Mu Lee
PDF
Attributable Visual Similarity Learning Borui Zhang, Wenzhao Zheng, Jie Zhou, Jiwen Lu
PDF
Attribute Group Editing for Reliable Few-Shot Image Generation Guanqi Ding, Xinzhe Han, Shuhui Wang, Shuzhe Wu, Xin Jin, Dandan Tu, Qingming Huang
PDF
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-Shot Learning Yangji He, Weihan Liang, Dongyang Zhao, Hong-Yu Zhou, Weifeng Ge, Yizhou Yu, Wenqiang Zhang
PDF
Audio-Adaptive Activity Recognition Across Video Domains Yunhua Zhang, Hazel Doughty, Ling Shao, Cees G. M. Snoek
PDF
Audio-Driven Neural Gesture Reenactment with Video Motion Graphs Yang Zhou, Jimei Yang, Dingzeyu Li, Jun Saito, Deepali Aneja, Evangelos Kalogerakis
PDF
Audio-Visual Generalised Zero-Shot Learning with Cross-Modal Attention and Language Otniel-Bogdan Mercea, Lukas Riesch, A. Sophia Koepke, Zeynep Akata
PDF
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Karren Yang, Dejan Marković, Steven Krenn, Vasu Agrawal, Alexander Richard
PDF
Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage Zhuohang Li, Jiaxin Zhang, Luyang Liu, Jian Liu
PDF
Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations Tianlong Chen, Peihao Wang, Zhiwen Fan, Zhangyang Wang
PDF
Augmented Geometric Distillation for Data-Free Incremental Person ReID Yichen Lu, Mei Wang, Weihong Deng
PDF
Autofocus for Event Cameras Shijie Lin, Yinqiang Zhang, Lei Yu, Bin Zhou, Xiaowei Luo, Jia Pan
PDF
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation Xueyi Liu, Xiaomeng Xu, Anyi Rao, Chuang Gan, Li Yi
PDF
AutoLoss-GMS: Searching Generalized Margin-Based SoftMax Loss Function for Person Re-Identification Hongyang Gu, Jianmin Li, Guangyuan Fu, Chifong Wong, Xinghao Chen, Jun Zhu
PDF
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu
PDF
Automated Progressive Learning for Efficient Training of Vision Transformers Changlin Li, Bohan Zhuang, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang
PDF
Automatic Color Image Stitching Using Quaternion Rank-1 Alignment Jiaxue Li, Yicong Zhou
PDF
Automatic Relation-Aware Graph Network Proliferation Shaofei Cai, Liang Li, Xinzhe Han, Jiebo Luo, Zheng-Jun Zha, Qingming Huang
PDF
Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis Albert Tseng, Jennifer J. Sun, Yisong Yue
PDF
AutoMine: An Unmanned Mine Dataset Yuchen Li, Zixuan Li, Siyu Teng, Yu Zhang, Yuhang Zhou, Yuchang Zhu, Dongpu Cao, Bin Tian, Yunfeng Ai, Zhe Xuanyuan, Long Chen
PDF
Autoregressive Image Generation Using Residual Quantization Doyup Lee, Chiheon Kim, Saehoon Kim, Minsu Cho, Wook-Shin Han
PDF
AutoRF: Learning 3D Object Radiance Fields from Single View Observations Norman Müller, Andrea Simonelli, Lorenzo Porzi, Samuel Rota Bulò, Matthias Nießner, Peter Kontschieder
PDF
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation Paritosh Mittal, Yen-Chi Cheng, Maneesh Singh, Shubham Tulsiani
PDF
AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis Zhiqin Chen, Kangxue Yin, Sanja Fidler
PDF
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval Riku Togashi, Mayu Otani, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Tetsuya Sakai
PDF
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Wenqiang Zhang, Qian Zhang, Chang Huang, Wenyu Liu
PDF
B-Cos Networks: Alignment Is All We Need for Interpretability Moritz Böhle, Mario Fritz, Bernt Schiele
PDF
B-DARTS: Beta-Decay Regularization for Differentiable Architecture Search Peng Ye, Baopu Li, Yikang Li, Tao Chen, Jiayuan Fan, Wanli Ouyang
PDF
Back to Reality: Weakly-Supervised 3D Object Detection with Shape-Guided Label Enhancement Xiuwei Xu, Yifan Wang, Yu Zheng, Yongming Rao, Jie Zhou, Jiwen Lu
PDF
Backdoor Attacks on Self-Supervised Learning Aniruddha Saha, Ajinkya Tejankar, Soroush Abbasi Koohpayegani, Hamed Pirsiavash
PDF
Background Activation Suppression for Weakly Supervised Object Localization Pingyu Wu, Wei Zhai, Yang Cao
PDF
BACON: Band-Limited Coordinate Networks for Multiscale Scene Representation David B. Lindell, Dave Van Veen, Jeong Joon Park, Gordon Wetzstein
PDF
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory Li Siyao, Weijiang Yu, Tianpei Gu, Chunze Lin, Quan Wang, Chen Qian, Chen Change Loy, Ziwei Liu
PDF
Balanced and Hierarchical Relation Learning for One-Shot Object Detection Hanqing Yang, Sijia Cai, Hualian Sheng, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Yong Tang, Yu Zhang
PDF
Balanced Contrastive Learning for Long-Tailed Visual Recognition Jianggang Zhu, Zheng Wang, Jingjing Chen, Yi-Ping Phoebe Chen, Yu-Gang Jiang
PDF
Balanced MSE for Imbalanced Visual Regression Jiawei Ren, Mingyuan Zhang, Cunjun Yu, Ziwei Liu
PDF
Balanced Multimodal Learning via On-the-Fly Gradient Modulation Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu
PDF
BaLeNAS: Differentiable Architecture Search via the Bayesian Learning Rule Miao Zhang, Shirui Pan, Xiaojun Chang, Steven Su, Jilin Hu, Gholamreza Haffari, Bin Yang
PDF
Bandits for Structure Perturbation-Based Black-Box Attacks to Graph Neural Networks with Theoretical Guarantees Binghui Wang, Youqi Li, Pan Zhou
PDF
BANMo: Building Animatable 3D Neural Models from Many Casual Videos Gengshan Yang, Minh Vo, Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo
PDF
BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information Nadine Rüegg, Silvia Zuffi, Konrad Schindler, Michael J. Black
PDF
BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment Kelvin C.K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy
PDF
BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning Zhi Hou, Baosheng Yu, Dacheng Tao
PDF
Bayesian Invariant Risk Minimization Yong Lin, Hanze Dong, Hao Wang, Tong Zhang
PDF
Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection Hitesh Sapkota, Qi Yu
PDF
BCOT: A Markerless High-Precision 3D Object Tracking Benchmark Jiachen Li, Bin Wang, Shiqiang Zhu, Xin Cao, Fan Zhong, Wenxuan Chen, Te Li, Jason Gu, Xueying Qin
PDF
BE-STI: Spatial-Temporal Integrated Network for Class-Agnostic Motion Prediction with Bidirectional Enhancement Yunlong Wang, Hongyu Pan, Jun Zhu, Yu-Huan Wu, Xin Zhan, Kun Jiang, Diange Yang
PDF
BEHAVE: Dataset and Method for Tracking Human Object Interactions Bharat Lal Bhatnagar, Xianghui Xie, Ilya A. Petrov, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll
PDF
Bending Graphs: Hierarchical Shape Matching Using Gated Optimal Transport Mahdi Saleh, Shun-Cheng Wu, Luca Cosmo, Nassir Navab, Benjamin Busam, Federico Tombari
PDF
Bending Reality: Distortion-Aware Transformers for Adapting to Panoramic Semantic Segmentation Jiaming Zhang, Kailun Yang, Chaoxiang Ma, Simon Reiß, Kunyu Peng, Rainer Stiefelhagen
PDF
Better Trigger Inversion Optimization in Backdoor Scanning Guanhong Tao, Guangyu Shen, Yingqi Liu, Shengwei An, Qiuling Xu, Shiqing Ma, Pan Li, Xiangyu Zhang
PDF
BEVT: BERT Pretraining of Video Transformers Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan
PDF
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds Chaoda Zheng, Xu Yan, Haiming Zhang, Baoyuan Wang, Shenghui Cheng, Shuguang Cui, Zhen Li
PDF
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning Chia-Wen Kuo, Zsolt Kira
PDF
Beyond Cross-View Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image Yujiao Shi, Hongdong Li
PDF
Beyond Fixation: Dynamic Window Visual Transformer Pengzhen Ren, Changlin Li, Guangrun Wang, Yun Xiao, Qing Du, Xiaodan Liang, Xiaojun Chang
PDF
Beyond Semantic to Instance Segmentation: Weakly-Supervised Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement Beomyoung Kim, YoungJoon Yoo, Chae Eun Rhee, Junmo Kim
PDF
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning Matthew Gwilliam, Abhinav Shrivastava
PDF
Bi-Directional Object-Context Prioritization Learning for Saliency Ranking Xin Tian, Ke Xu, Xin Yang, Lin Du, Baocai Yin, Rynson W.H. Lau
PDF
Bi-Level Alignment for Cross-Domain Crowd Counting Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele
PDF
Bi-Level Doubly Variational Learning for Energy-Based Latent Variable Models Ge Kan, Jinhu Lü, Tian Wang, Baochang Zhang, Aichun Zhu, Lei Huang, Guodong Guo, Hichem Snoussi
PDF
BigDatasetGAN: Synthesizing ImageNet with Pixel-Wise Annotations Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Sanja Fidler, Antonio Torralba
PDF
BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster Jason Dai, Ding Ding, Dongjie Shi, Shengsheng Huang, Jiao Wang, Xin Qiu, Kai Huang, Guoqiong Song, Yang Wang, Qiyuan Gong, Jiaming Song, Shan Yu, Le Zheng, Yina Chen, Junwei Deng, Ge Song
PDF
Bijective Mapping Network for Shadow Removal Yurui Zhu, Jie Huang, Xueyang Fu, Feng Zhao, Qibin Sun, Zheng-Jun Zha
PDF
Bilateral Video Magnification Filter Shoichiro Takeda, Kenta Niwa, Mariko Isogawa, Shinya Shimizu, Kazuki Okami, Yushi Aono
PDF
Blended Diffusion for Text-Driven Editing of Natural Images Omri Avrahami, Dani Lischinski, Ohad Fried
PDF
Blind Face Restoration via Integrating Face Shape and Generative Priors Feida Zhu, Junwei Zhu, Wenqing Chu, Xinyi Zhang, Xiaozhong Ji, Chengjie Wang, Ying Tai
PDF
Blind Image Super-Resolution with Elaborate Degradation Modeling on Noise and Kernel Zongsheng Yue, Qian Zhao, Jianwen Xie, Lei Zhang, Deyu Meng, Kwan-Yee K. Wong
PDF
Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots Zejin Wang, Jiazheng Liu, Guoqing Li, Hua Han
PDF
Block-NeRF: Scalable Large Scene Neural View Synthesis Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jonathan T. Barron, Henrik Kretzschmar
PDF
BNUDC: A Two-Branched Deep Neural Network for Restoring Images from Under-Display Cameras Jaihyun Koh, Jangho Lee, Sungroh Yoon
PDF
BNV-Fusion: Dense 3D Reconstruction Using Bi-Level Neural Volume Fusion Kejie Li, Yansong Tang, Victor Adrian Prisacariu, Philip H.S. Torr
PDF
BodyGAN: General-Purpose Controllable Neural Human Body Generation Chaojie Yang, Hanhui Li, Shengjie Wu, Shengkai Zhang, Haonan Yan, Nianhong Jiao, Jie Tang, Runnan Zhou, Xiaodan Liang, Tianxiang Zheng
PDF
BodyMap: Learning Full-Body Dense Correspondence mAP Anastasia Ianina, Nikolaos Sarafianos, Yuanlu Xu, Ignacio Rocco, Tony Tung
PDF
BokehMe: When Neural Rendering Meets Classical Rendering Juewen Peng, Zhiguo Cao, Xianrui Luo, Hao Lu, Ke Xian, Jianming Zhang
PDF
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions Huaizu Jiang, Xiaojian Ma, Weili Nie, Zhiding Yu, Yuke Zhu, Anima Anandkumar
PDF
BoosterNet: Improving Domain Generalization of Deep Neural Nets Using Culpability-Ranked Features Nourhan Bayasi, Ghassan Hamarneh, Rafeef Garbi
PDF
Boosting 3D Object Detection by Simulating Multimodality on Point Clouds Wu Zheng, Mingxuan Hong, Li Jiang, Chi-Wing Fu
PDF
Boosting Black-Box Attack with Partially Transferred Conditional Adversarial Distribution Yan Feng, Baoyuan Wu, Yanbo Fan, Li Liu, Zhifeng Li, Shu-Tao Xia
PDF
Boosting Crowd Counting via Multifaceted Attention Hui Lin, Zhiheng Ma, Rongrong Ji, Yaowei Wang, Xiaopeng Hong
PDF
Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation Yutong Dai, Brian Price, He Zhang, Chunhua Shen
PDF
Boosting View Synthesis with Residual Transfer Xuejian Rong, Jia-Bin Huang, Ayush Saraf, Changil Kim, Johannes Kopf
PDF
BoostMIS: Boosting Medical Image Semi-Supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation Wenqiao Zhang, Lei Zhu, James Hallinan, Shengyu Zhang, Andrew Makmur, Qingpeng Cai, Beng Chin Ooi
PDF
Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-Training Haofei Zhang, Jiarui Duan, Mengqi Xue, Jie Song, Li Sun, Mingli Song
PDF
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding Xianzheng Ma, Zhixiang Wang, Yacheng Zhan, Yinqiang Zheng, Zheng Wang, Dengxin Dai, Chia-Wen Lin
PDF
Bounded Adversarial Attack on Deep Content Features Qiuling Xu, Guanhong Tao, Xiangyu Zhang
PDF
BoxeR: Box-Attention for 2D and 3D Transformers Duy-Kien Nguyen, Jihong Ju, Olaf Booij, Martin R. Oswald, Cees G. M. Snoek
PDF
BppAttack: Stealthy and Efficient Trojan Attacks Against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning Zhenting Wang, Juan Zhai, Shiqing Ma
PDF
Brain-Inspired Multilayer Perceptron with Spiking Neurons Wenshuo Li, Hanting Chen, Jianyuan Guo, Ziyang Zhang, Yunhe Wang
PDF
Brain-Supervised Image Editing Keith M. Davis Iii, Carlos de la Torre-Ortiz, Tuukka Ruotsalo
PDF
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos Muheng Li, Lei Chen, Yueqi Duan, Zhilan Hu, Jianjiang Feng, Jie Zhou, Jiwen Lu
PDF
Bridged Transformer for Vision and Point Cloud 3D Object Detection Yikai Wang, TengQi Ye, Lele Cao, Wenbing Huang, Fuchun Sun, Fengxiang He, Dacheng Tao
PDF
Bridging Global Context Interactions for High-Fidelity Image Completion Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai, Dinh Phung
PDF
Bridging the Gap Between Classification and Localization for Weakly Supervised Object Localization Eunji Kim, Siwon Kim, Jungbeom Lee, Hyunwoo Kim, Sungroh Yoon
PDF
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation Yicong Hong, Zun Wang, Qi Wu, Stephen Gould
PDF
Bridging Video-Text Retrieval with Multiple Choice Questions Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo
PDF
Bring Evanescent Representations to Life in Lifelong Class Incremental Learning Marco Toldo, Mete Ozay
PDF
Bringing Old Films Back to Life Ziyu Wan, Bo Zhang, Dongdong Chen, Jing Liao
PDF
BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild Xixi Xu, Zhongang Qi, Jianqi Ma, Honglun Zhang, Ying Shan, Xiaohu Qie
PDF
Burst Image Restoration and Enhancement Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang
PDF
C-CAM: Causal CAM for Weakly Supervised Semantic Segmentation on Medical Image Zhang Chen, Zhiqiang Tian, Jihua Zhu, Ce Li, Shaoyi Du
PDF
C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection Tong Wang, Yousong Zhu, Yingying Chen, Chaoyang Zhao, Bin Yu, Jinqiao Wang, Ming Tang
PDF
C2AM: Contrastive Learning of Class-Agnostic Activation mAP for Weakly Supervised Object Localization and Semantic Segmentation Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen
PDF
C2SLR: Consistency-Enhanced Continuous Sign Language Recognition Ronglai Zuo, Brian Mak
PDF
CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification Philip Chikontwe, Soopil Kim, Sang Hyun Park
PDF
CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism Jiahui Lei, Kostas Daniilidis
PDF
CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings Zhiwen Fan, Tianlong Chen, Peihao Wang, Zhangyang Wang
PDF
CAFE: Learning to Condense Dataset by Aligning Features Kai Wang, Bo Zhao, Xiangyu Peng, Zheng Zhu, Shuo Yang, Shuo Wang, Guan Huang, Hakan Bilen, Xinchao Wang, Yang You
PDF
Calibrating Deep Neural Networks by Pairwise Constraints Jiacheng Cheng, Nuno Vasconcelos
PDF
Camera Pose Estimation Using Implicit Distortion Models Linfei Pan, Marc Pollefeys, Viktor Larsson
PDF
Camera-Conditioned Stable Feature Generation for Isolated Camera Supervised Person Re-IDentification Chao Wu, Wenhang Ge, Ancong Wu, Xiaobin Chang
PDF
CamLiFlow: Bidirectional Camera-LiDAR Fusion for Joint Optical Flow and Scene Flow Estimation Haisong Liu, Tao Lu, Yihui Xu, Jia Liu, Wenjie Li, Lijun Chen
PDF
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective Gowthami Somepalli, Liam Fowl, Arpit Bansal, Ping Yeh-Chiang, Yehuda Dar, Richard Baraniuk, Micah Goldblum, Tom Goldstein
PDF
Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection Ruijun Gao, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Huazhu Fu, Wei Feng, Yang Liu, Song Wang
PDF
Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos Sukjun Hwang, Miran Heo, Seoung Wug Oh, Seon Joo Kim
PDF
Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes Yang You, Zelin Ye, Yujing Lou, Chengkun Li, Yong-Lu Li, Lizhuang Ma, Weiming Wang, Cewu Lu
PDF
CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly Fenggen Yu, Zhiqin Chen, Manyi Li, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, Hao Zhang
PDF
Capturing and Inferring Dense Full-Body Human-Scene Contact Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. Black
PDF
Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video Wen-Li Wei, Jen-Chun Lin, Tyng-Luh Liu, Hong-Yuan Mark Liao
PDF
Cascade Transformers for End-to-End Person Search Rui Yu, Dawei Du, Rodney LaLonde, Daniel Davila, Christopher Funk, Anthony Hoogs, Brian Clipp
PDF
CAT-Det: Contrastively Augmented Transformer for Multi-Modal 3D Object Detection Yanan Zhang, Jiaxin Chen, Di Huang
PDF
Catching Both Gray and Black Swans: Open-Set Supervised Anomaly Detection Choubo Ding, Guansong Pang, Chunhua Shen
PDF
Category Contrast for Unsupervised Domain Adaptation in Visual Tasks Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu, Ling Shao
PDF
Category-Aware Transformer Network for Better Human-Object Interaction Detection Leizhen Dong, Zhimin Li, Kunlun Xu, Zhijun Zhang, Luxin Yan, Sheng Zhong, Xu Zou
PDF
Causal Transportability for Visual Recognition Chengzhi Mao, Kevin Xia, James Wang, Hao Wang, Junfeng Yang, Elias Bareinboim, Carl Vondrick
PDF
Causality Inspired Representation Learning for Domain Generalization Fangrui Lv, Jian Liang, Shuang Li, Bin Zang, Chi Harold Liu, Ziteng Wang, Di Liu
PDF
CD2-pFed: Cyclic Distillation-Guided Channel Decoupling for Model Personalization in Federated Learning Yiqing Shen, Yuyin Zhou, Lequan Yu
PDF
CDGNet: Class Distribution Guided Network for Human Parsing Kunliang Liu, Ouk Choi, Jianming Wang, Wonjun Hwang
PDF
CellTypeGraph: A New Geometric Computer Vision Benchmark Lorenzo Cerrone, Athul Vijayan, Tejasvinee Mody, Kay Schneitz, Fred A. Hamprecht
PDF
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing Xiaoxue Chen, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
PDF
Certified Patch Robustness via Smoothed Vision Transformers Hadi Salman, Saachi Jain, Eric Wong, Aleksander Madry
PDF
Channel Balancing for Accurate Quantization of Winograd Convolutions Vladimir Chikin, Vladimir Kryzhanovskiy
PDF
CHEX: CHannel EXploration for CNN Model Compression Zejiang Hou, Minghai Qin, Fei Sun, Xiaolong Ma, Kun Yuan, Yi Xu, Yen-Kuang Chen, Rong Jin, Yuan Xie, Sun-Yuan Kung
PDF
Chitransformer: Towards Reliable Stereo from Cues Qing Su, Shihao Ji
PDF
Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation Zhaozheng Chen, Tan Wang, Xiongwei Wu, Xian-Sheng Hua, Hanwang Zhang, Qianru Sun
PDF
Class Similarity Weighted Knowledge Distillation for Continual Semantic Segmentation Minh Hieu Phan, The-Anh Ta, Son Lam Phung, Long Tran-Thanh, Abdesselam Bouzerdoum
PDF
Class-Aware Contrastive Semi-Supervised Learning Fan Yang, Kai Wu, Shuyi Zhang, Guannan Jiang, Yong Liu, Feng Zheng, Wei Zhang, Chengjie Wang, Long Zeng
PDF
Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation Ruihuang Li, Shuai Li, Chenhang He, Yabin Zhang, Xu Jia, Lei Zhang
PDF
Class-Incremental Learning by Knowledge Distillation with Adaptive Feature Consolidation Minsoo Kang, Jaeyoo Park, Bohyung Han
PDF
Class-Incremental Learning with Strong Pre-Trained Models Tz-Ying Wu, Gurumurthy Swaminathan, Zhizhong Li, Avinash Ravichandran, Nuno Vasconcelos, Rahul Bhotika, Stefano Soatto
PDF
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao
PDF
Clean Implicit 3D Structure from Noisy 2D STEM Images Hannah Kniesel, Timo Ropinski, Tim Bergner, Kavitha Shaga Devan, Clarissa Read, Paul Walther, Tobias Ritschel, Pedro Hermosilla
PDF
CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen
PDF
CLIP-Event: Connecting Text and Images with Event Structures Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou, Xudong Lin, Chenguang Zhu, Michael Zeng, Heng Ji, Shih-Fu Chang
PDF
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation Aditya Sanghi, Hang Chu, Joseph G. Lambourne, Ye Wang, Chin-Yi Cheng, Marco Fumero, Kamal Rahimi Malekshan
PDF
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields Can Wang, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao
PDF
Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers Yunhui Guo, Xudong Wang, Yubei Chen, Stella X. Yu
PDF
CLIPstyler: Image Style Transfer with a Single Text Condition Gihyun Kwon, Jong Chul Ye
PDF
Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification Yanan Wang, Xuezhi Liang, Shengcai Liao
PDF
Closing the Generalization Gap of Cross-Silo Federated Medical Image Segmentation An Xu, Wenqi Li, Pengfei Guo, Dong Yang, Holger R. Roth, Ali Hatamizadeh, Can Zhao, Daguang Xu, Heng Huang, Ziyue Xu
PDF
Cloth-Changing Person Re-Identification from a Single Image with Gait Prediction and Regularization Xin Jin, Tianyu He, Kecheng Zheng, Zhiheng Yin, Xu Shen, Zhen Huang, Ruoyu Feng, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua
PDF
Clothes-Changing Person Re-Identification with RGB Modality Only Xinqian Gu, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, Xilin Chen
PDF
ClothFormer: Taming Video Virtual Try-on in All Module Jianbin Jiang, Tan Wang, He Yan, Junhui Liu
PDF
CLRNet: Cross Layer Refinement Network for Lane Detection Tu Zheng, Yifei Huang, Yang Liu, Wenjian Tang, Zheng Yang, Deng Cai, Xiaofei He
PDF
Cluster-Guided Image Synthesis with Unconditional Models Markos Georgopoulos, James Oldfield, Grigorios G. Chrysos, Yannis Panagakis
PDF
ClusterGNN: Cluster-Based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching Yan Shi, Jun-Xiong Cai, Yoli Shavit, Tai-Jiang Mu, Wensen Feng, Kai Zhang
PDF
Clustering Plotted Data by Image Segmentation Tarek Naous, Srinjay Sarkar, Abubakar Abid, James Zou
PDF
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
PDF
CMT: Convolutional Neural Networks Meet Vision Transformers Jianyuan Guo, Kai Han, Han Wu, Yehui Tang, Xinghao Chen, Yunhe Wang, Chang Xu
PDF
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters Paul Gavrikov, Janis Keuper
PDF
Co-Advise: Cross Inductive Bias Distillation Sucheng Ren, Zhengqi Gao, Tianyu Hua, Zihui Xue, Yonglong Tian, Shengfeng He, Hang Zhao
PDF
Co-Domain Symmetry for Complex-Valued Deep Learning Utkarsh Singhal, Yifei Xing, Stella X. Yu
PDF
CO-SNE: Dimensionality Reduction and Visualization for Hyperbolic Data Yunhui Guo, Haoran Guo, Stella X. Yu
PDF
COAP: Compositional Articulated Occupancy of People Marko Mihajlovic, Shunsuke Saito, Aayush Bansal, Michael Zollhöfer, Siyu Tang
PDF
Coarse-to-Fine Deep Video Coding with Hyperprior-Guided Mode Prediction Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu
PDF
Coarse-to-Fine Feature Mining for Video Semantic Segmentation Guolei Sun, Yun Liu, Henghui Ding, Thomas Probst, Luc Van Gool
PDF
Coarse-to-Fine Q-Attention: Efficient Learning for Visual Robotic Manipulation via Discretisation Stephen James, Kentaro Wada, Tristan Laidlow, Andrew J. Davison
PDF
CodedVTR: Codebook-Based Sparse Voxel Transformer with Geometric Guidance Tianchen Zhao, Niansong Zhang, Xuefei Ning, He Wang, Li Yi, Yu Wang
PDF
Coherent Point Drift Revisited for Non-Rigid Shape Matching and Registration Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Liu
PDF
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars Le Yang, Junwei Han, Dingwen Zhang
PDF
Collaborative Learning for Hand and Object Reconstruction with Attention-Guided Graph Convolution Tze Ho Elden Tse, Kwang In Kim, Ales̆ Leonardis, Hyung Jin Chang
PDF
Collaborative Transformers for Grounded Situation Recognition Junhyeong Cho, Youngseok Yoon, Suha Kwak
PDF
Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems Through Stochastic Contraction Hyungjin Chung, Byeongsu Sim, Jong Chul Ye
PDF
Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-Free Synthetic Data Kyungjune Baek, Hyunjung Shim
PDF
Comparing Correspondences: Video Prediction with Correspondence-Wise Losses Daniel Geng, Max Hamilton, Andrew Owens
PDF
Complex Backdoor Detection by Symmetric Feature Differencing Yingqi Liu, Guangyu Shen, Guanhong Tao, Zhenting Wang, Shiqing Ma, Xiangyu Zhang
PDF
Complex Video Action Reasoning via Learnable Markov Logic Network Yang Jin, Linchao Zhu, Yadong Mu
PDF
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang
PDF
Compound Domain Generalization via Meta-Knowledge Encoding Chaoqi Chen, Jiongcheng Li, Xiaoguang Han, Xiaoqing Liu, Yizhou Yu
PDF
Comprehending and Ordering Semantics for Image Captioning Yehao Li, Yingwei Pan, Ting Yao, Tao Mei
PDF
Compressing Models with Few Samples: Mimicking Then Replacing Huanyu Wang, Junjie Liu, Xin Ma, Yang Yong, Zhenhua Chai, Jianxin Wu
PDF
Compressive Single-Photon 3D Cameras Felipe Gutierrez-Barragan, Atul Ingle, Trevor Seets, Mohit Gupta, Andreas Velten
PDF
Computing Wasserstein-P Distance Between Images with Linear Cost Yidong Chen, Chen Li, Zhonghua Lu
PDF
Condensing CNNs with Partial Differential Equations Anil Kag, Venkatesh Saligrama
PDF
Conditional Prompt Learning for Vision-Language Models Kaiyang Zhou, Jingkang Yang, Chen Change Loy, Ziwei Liu
PDF
ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes Rahul Sajnani, Adrien Poulenard, Jivitesh Jain, Radhika Dua, Leonidas J. Guibas, Srinath Sridhar
PDF
CoNeRF: Controllable Neural Radiance Fields Kacper Kania, Kwang Moo Yi, Marek Kowalski, Tomasz Trzciński, Andrea Tagliasacchi
PDF
Confidence Propagation Cluster: Unleash Full Potential of Object Detectors Yichun Shen, Wanli Jiang, Zhen Xu, Rundong Li, Junghyun Kwon, Siyi Li
PDF
Connecting the Complementary-View Videos: Joint Camera Identification and Subject Association Ruize Han, Yiyang Gan, Jiacheng Li, Feifan Wang, Wei Feng, Song Wang
PDF
Consistency Driven Sequential Transformers Attention Model for Partially Observable Scenes Samrudhdhi B. Rangrej, Chetan L. Srinidhi, James J. Clark
PDF
Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection Jihwan Park, SeungJun Lee, Hwan Heo, Hyeong Kyu Choi, Hyunwoo J. Kim
PDF
Consistent Explanations by Contrastive Learning Vipin Pillai, Soroush Abbasi Koohpayegani, Ashley Ouligian, Dennis Fong, Hamed Pirsiavash
PDF
Constrained Few-Shot Class-Incremental Learning Michael Hersche, Geethan Karunaratne, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi
PDF
Context-Aware Sequence Alignment Using 4D Skeletal Augmentation Taein Kwon, Bugra Tekin, Siyu Tang, Marc Pollefeys
PDF
Context-Aware Video Reconstruction for Rolling Shutter Cameras Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He
PDF
Contextual Debiasing for Visual Recognition with Causal Mechanisms Ruyang Liu, Hao Liu, Ge Li, Haodi Hou, TingHao Yu, Tao Yang
PDF
Contextual Instance Decoupling for Robust Multi-Person Pose Estimation Dongkai Wang, Shiliang Zhang
PDF
Contextual Outpainting with Object-Level Contrastive Learning Jiacheng Li, Chang Chen, Zhiwei Xiong
PDF
Contextual Similarity Distillation for Asymmetric Image Retrieval Hui Wu, Min Wang, Wengang Zhou, Houqiang Li, Qi Tian
PDF
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu
PDF
ContIG: Self-Supervised Multimodal Contrastive Learning for Medical Imaging with Genetics Aiham Taleb, Matthias Kirchler, Remo Monti, Christoph Lippert
PDF
Continual Learning for Visual Search with Backward Consistent Feature Embedding Timmy S. T. Wan, Jun-Cheng Chen, Tzer-Yi Wu, Chu-Song Chen
PDF
Continual Learning with Lifelong Vision Transformer Zhen Wang, Liu Liu, Yiqun Duan, Yajing Kong, Dacheng Tao
PDF
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism Binbin Yang, Xinchi Deng, Han Shi, Changlin Li, Gengwei Zhang, Hang Xu, Shen Zhao, Liang Lin, Xiaodan Liang
PDF
Continual Predictive Learning from Videos Geng Chen, Wendong Zhang, Han Lu, Siyu Gao, Yunbo Wang, Mingsheng Long, Xiaokang Yang
PDF
Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture Chenghao Zhang, Kun Tian, Bin Fan, Gaofeng Meng, Zhaoxiang Zhang, Chunhong Pan
PDF
Continual Test-Time Domain Adaptation Qin Wang, Olga Fink, Luc Van Gool, Dengxin Dai
PDF
Continuous Scene Representations for Embodied AI Samir Yitzhak Gadre, Kiana Ehsani, Shuran Song, Roozbeh Mottaghi
PDF
Contour-Hugging Heatmaps for Landmark Detection James McCouat, Irina Voiculescu
PDF
Contrastive Boundary Learning for Point Cloud Segmentation Liyao Tang, Yibing Zhan, Zhe Chen, Baosheng Yu, Dacheng Tao
PDF
Contrastive Conditional Neural Processes Zesheng Ye, Lina Yao
PDF
Contrastive Dual Gating: Learning Sparse Features with Contrastive Learning Jian Meng, Li Yang, Jinwoo Shin, Deliang Fan, Jae-sun Seo
PDF
Contrastive Learning for Space-Time Correspondence via Self-Cycle Consistency Jeany Son
PDF
Contrastive Learning for Unsupervised Video Highlight Detection Taivanbat Badamdorj, Mrigank Rochan, Yang Wang, Li Cheng
PDF
Contrastive Regression for Domain Adaptation on Gaze Estimation Yaoming Wang, Yangzhou Jiang, Jin Li, Bingbing Ni, Wenrui Dai, Chenglin Li, Hongkai Xiong, Teng Li
PDF
Contrastive Test-Time Adaptation Dian Chen, Dequan Wang, Trevor Darrell, Sayna Ebrahimi
PDF
ContrastMask: Contrastive Learning to Segment Every Thing Xuehui Wang, Kai Zhao, Ruixin Zhang, Shouhong Ding, Yan Wang, Wei Shen
PDF
Controllable Animation of Fluid Elements in Still Images Aniruddha Mahapatra, Kuldeep Kulkarni
PDF
Controllable Dynamic Multi-Task Architectures Dripta S. Raychaudhuri, Yumin Suh, Samuel Schulter, Xiang Yu, Masoud Faraki, Amit K. Roy-Chowdhury, Manmohan Chandraker
PDF
Convolution of Convolution: Let Kernels Spatially Collaborate Rongzhen Zhao, Jian Li, Zhenzhi Wu
PDF
Convolutions for Spatial Interaction Modeling Zhaoen Su, Chao Wang, David Bradley, Carlos Vallespi-Gonzalez, Carl Wellington, Nemanja Djuric
PDF
Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles Jiaxun Cui, Hang Qiu, Dian Chen, Peter Stone, Yuke Zhu
PDF
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs Jiteng Mu, Shalini De Mello, Zhiding Yu, Nuno Vasconcelos, Xiaolong Wang, Jan Kautz, Sifei Liu
PDF
Correlation Verification for Image Retrieval Seongwon Lee, Hongje Seong, Suhyeon Lee, Euntai Kim
PDF
Correlation-Aware Deep Tracking Fei Xie, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng
PDF
CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning Yue Fan, Dengxin Dai, Anna Kukleva, Bernt Schiele
PDF
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval Haoyu Lu, Nanyi Fei, Yuqi Huo, Yizhao Gao, Zhiwu Lu, Ji-Rong Wen
PDF
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation Hanqing Wang, Wei Liang, Jianbing Shen, Luc Van Gool, Wenguan Wang
PDF
Coupled Iterative Refinement for 6d Multi-Object Pose Estimation Lahav Lipson, Zachary Teed, Ankit Goyal, Jia Deng
PDF
Coupling Vision and Proprioception for Navigation of Legged Robots Zipeng Fu, Ashish Kumar, Ananye Agarwal, Haozhi Qi, Jitendra Malik, Deepak Pathak
PDF
CPPF: Towards Robust Category-Level 9d Pose Estimation in the Wild Yang You, Ruoxi Shi, Weiming Wang, Cewu Lu
PDF
CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow Xiuchao Sui, Shaohua Li, Xue Geng, Yan Wu, Xinxing Xu, Yong Liu, Rick Goh, Hongyuan Zhu
PDF
Crafting Better Contrastive Views for Siamese Representation Learning Xiangyu Peng, Kai Wang, Zheng Zhu, Mang Wang, Yang You
PDF
CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Rui-Wei Zhao, Tao Zhang, Xuequan Lu, Shang Gao
PDF
CRIS: CLIP-Driven Referring Image Segmentation Zhaoqing Wang, Yu Lu, Qiang Li, Xunqiang Tao, Yandong Guo, Mingming Gong, Tongliang Liu
PDF
Critical Regularizations for Neural Surface Reconstruction in the Wild Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan
PDF
CroMo: Cross-Modal Learning for Monocular Depth Estimation Yannick Verdié, Jifei Song, Barnabé Mas, Benjamin Busam, Ales̆ Leonardis, Steven McDonagh
PDF
Cross Domain Object Detection by Target-Perceived Dual Branch Distillation Mengzhe He, Yali Wang, Jiaxi Wu, Yiru Wang, Hanqing Li, Bo Li, Weihao Gan, Wei Wu, Yu Qiao
PDF
Cross Modal Retrieval with Querybank Normalisation Simion-Vlad Bogolin, Ioana Croitoru, Hailin Jin, Yang Liu, Samuel Albanie
PDF
Cross-Architecture Self-Supervised Video Representation Learning Sheng Guo, Zihua Xiong, Yujie Zhong, Limin Wang, Xiaobo Guo, Bing Han, Weilin Huang
PDF
Cross-Domain Adaptive Teacher for Object Detection Yu-Jhe Li, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Kan Chen, Bichen Wu, Zijian He, Kris Kitani, Peter Vajda
PDF
Cross-Domain Correlation Distillation for Unsupervised Domain Adaptation in Nighttime Semantic Segmentation Huan Gao, Jichang Guo, Guoli Wang, Qian Zhang
PDF
Cross-Domain Few-Shot Learning with Task-Specific Adapters Wei-Hong Li, Xialei Liu, Hakan Bilen
PDF
Cross-Image Relational Knowledge Distillation for Semantic Segmentation Chuanguang Yang, Helong Zhou, Zhulin An, Xue Jiang, Yongjun Xu, Qian Zhang
PDF
Cross-Modal Background Suppression for Audio-Visual Event Localization Yan Xia, Zhou Zhao
PDF
Cross-Modal Clinical Graph Transformer for Ophthalmic Report Generation Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang
PDF
Cross-Modal mAP Learning for Vision and Language Navigation Georgios Georgakis, Karl Schmeckpeper, Karan Wanchoo, Soham Dan, Eleni Miltsakaki, Dan Roth, Kostas Daniilidis
PDF
Cross-Modal Perceptionist: Can Face Geometry Be Gleaned from Voices? Cho-Ying Wu, Chin-Cheng Hsu, Ulrich Neumann
PDF
Cross-Modal Representation Learning for Zero-Shot Action Recognition Chung-Ching Lin, Kevin Lin, Lijuan Wang, Zicheng Liu, Linjie Li
PDF
Cross-Modal Transferable Adversarial Attacks from Images to Videos Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
PDF
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin
PDF
Cross-Patch Dense Contrastive Learning for Semi-Supervised Segmentation of Cellular Nuclei in Histopathologic Images Huisi Wu, Zhaoze Wang, Youyi Song, Lin Yang, Jing Qin
PDF
Cross-View Transformers for Real-Time mAP-View Semantic Segmentation Brady Zhou, Philipp Krähenbühl
PDF
CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data Qi Yan, Jianhao Zheng, Simon Reding, Shanci Li, Iordan Doytchinov
PDF
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding Mohamed Afham, Isuru Dissanayake, Dinithi Dissanayake, Amaya Dharmasiri, Kanchana Thilakarathna, Ranga Rodrigo
PDF
Crowd Counting in the Frequency Domain Weibo Shu, Jia Wan, Kay Chen Tan, Sam Kwong, Antoni B. Chan
PDF
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Weiming Zhang, Nenghai Yu, Lu Yuan, Dong Chen, Baining Guo
PDF
CVF-SID: Cyclic Multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee
PDF
CVNet: Contour Vibration Network for Building Extraction Ziqiang Xu, Chunyan Xu, Zhen Cui, Xiangwei Zheng, Jian Yang
PDF
Cycle-Consistent Counterfactuals by Latent Transformations Saeed Khorram, Li Fuxin
PDF
CycleMix: A Holistic Strategy for Medical Image Segmentation from Scribble Supervision Ke Zhang, Xiahai Zhuang
PDF
D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions Sammy Christen, Muhammed Kocabas, Emre Aksan, Jemin Hwangbo, Jie Song, Otmar Hilliges
PDF
DAD-3DHeads: A Large-Scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image Tetiana Martyniuk, Orest Kupyn, Yana Kurlyak, Igor Krashenyi, Jiří Matas, Viktoriia Sharmanska
PDF
DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation Lukas Hoyer, Dengxin Dai, Luc Van Gool
PDF
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection Haibao Yu, Yizhen Luo, Mao Shu, Yiyi Huo, Zebang Yang, Yifeng Shi, Zhenglong Guo, Hanyu Li, Xing Hu, Jirui Yuan, Zaiqing Nie
PDF
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion Peize Sun, Jinkun Cao, Yi Jiang, Zehuan Yuan, Song Bai, Kris Kitani, Ping Luo
PDF
Dancing Under the Stars: Video Denoising in Starlight Kristina Monakhova, Stephan R. Richter, Laura Waller, Vladlen Koltun
PDF
DArch: Dental Arch Prior-Assisted 3D Tooth Instance Segmentation with Weak Annotations Liangdong Qiu, Chongjie Ye, Pei Chen, Yunbi Liu, Xiaoguang Han, Shuguang Cui
PDF
DASO: Distribution-Aware Semantics-Oriented Pseudo-Label for Imbalanced Semi-Supervised Learning Youngtaek Oh, Dong-Jin Kim, In So Kweon
PDF
Data-Free Network Compression via Parametric Non-Uniform Mixed Precision Quantization Vladimir Chikin, Mikhail Antiukh
PDF
DATA: Domain-Aware and Task-Aware Self-Supervised Learning Qing Chang, Junran Peng, Lingxi Xie, Jiajun Sun, Haoran Yin, Qi Tian, Zhaoxiang Zhang
PDF
Dataset Distillation by Matching Training Trajectories George Cazenavette, Tongzhou Wang, Antonio Torralba, Alexei A. Efros, Jun-Yan Zhu
PDF
Day-to-Night Image Synthesis for Training Nighttime Neural ISPs Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. Brown
PDF
DC-SSL: Addressing Mismatched Class Distribution in Semi-Supervised Learning Zhen Zhao, Luping Zhou, Yue Duan, Lei Wang, Lei Qi, Yinghuan Shi
PDF
De-Rendering 3D Objects in the Wild Felix Wimbauer, Shangzhe Wu, Christian Rupprecht
PDF
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers Xianing Chen, Qiong Cao, Yujie Zhong, Jing Zhang, Shenghua Gao, Dacheng Tao
PDF
Debiased Learning from Naturally Imbalanced Pseudo-Labels Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu
PDF
Deblur-NeRF: Neural Radiance Fields from Blurry Images Li Ma, Xiaoyu Li, Jing Liao, Qi Zhang, Xuan Wang, Jue Wang, Pedro V. Sander
PDF
Deblurring via Stochastic Refinement Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, Peyman Milanfar
PDF
DECORE: Deep Compression with Reinforcement Learning Manoj Alwani, Yang Wang, Vashisht Madhavan
PDF
Decoupled Knowledge Distillation Borui Zhao, Quan Cui, Renjie Song, Yiyu Qiu, Jiajun Liang
PDF
Decoupled Multi-Task Learning with Cyclical Self-Regulation for Face Parsing Qingping Zheng, Jiankang Deng, Zheng Zhu, Ying Li, Stefanos Zafeiriou
PDF
Decoupling and Recoupling Spatiotemporal Representation for RGB-D-Based Motion Recognition Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang, Du Zhang, Zhen Lei, Hao Li, Rong Jin
PDF
Decoupling Makes Weakly Supervised Local Feature Better Kunhong Li, Longguang Wang, Li Liu, Qing Ran, Kai Xu, Yulan Guo
PDF
Decoupling Zero-Shot Semantic Segmentation Jian Ding, Nan Xue, Gui-Song Xia, Dengxin Dai
PDF
DeeCap: Dynamic Early Exiting for Efficient Image Captioning Zhengcong Fei, Xu Yan, Shuhui Wang, Qi Tian
PDF
Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings Innfarn Yoo, Huiwen Chang, Xiyang Luo, Ondrej Stava, Ce Liu, Peyman Milanfar, Feng Yang
PDF
Deep Anomaly Discovery from Unlabeled Videos via Normality Advantage and Self-Paced Refinement Guang Yu, Siqi Wang, Zhiping Cai, Xinwang Liu, Chuanfu Xu, Chengkun Wu
PDF
Deep Color Consistent Network for Low-Light Image Enhancement Zhao Zhang, Huan Zheng, Richang Hong, Mingliang Xu, Shuicheng Yan, Meng Wang
PDF
Deep Constrained Least Squares for Blind Image Super-Resolution Ziwei Luo, Haibin Huang, Lei Yu, Youwei Li, Haoqiang Fan, Shuaicheng Liu
PDF
Deep Decomposition for Stochastic Normal-Abnormal Transport Peirong Liu, Yueh Lee, Stephen Aylward, Marc Niethammer
PDF
Deep Depth from Focus with Differential Focus Volume Fengting Yang, Xiaolei Huang, Zihan Zhou
PDF
Deep Equilibrium Optical Flow Estimation Shaojie Bai, Zhengyang Geng, Yash Savani, J. Zico Kolter
PDF
Deep Generalized Unfolding Networks for Image Restoration Chong Mou, Qian Wang, Jian Zhang
PDF
Deep Hierarchical Semantic Segmentation Liulei Li, Tianfei Zhou, Wenguan Wang, Jianwu Li, Yi Yang
PDF
Deep Hybrid Models for Out-of-Distribution Detection Senqi Cao, Zhongfei Zhang
PDF
Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection Chunyu Li, Yusuke Monno, Masatoshi Okutomi
PDF
Deep Image-Based Illumination Harmonization Zhongyun Bao, Chengjiang Long, Gang Fu, Daquan Liu, Yuanzhen Li, Jiaming Wu, Chunxia Xiao
PDF
Deep Orientation-Aware Functional Maps: Tackling Symmetry Issues in Shape Matching Nicolas Donati, Etienne Corman, Maks Ovsjanikov
PDF
Deep Rectangling for Image Stitching: A Learning Baseline Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
PDF
Deep Safe Multi-View Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase Huayi Tang, Yong Liu
PDF
Deep Saliency Prior for Reducing Visual Distraction Kfir Aberman, Junfeng He, Yossi Gandelsman, Inbar Mosseri, David E. Jacobs, Kai Kohlhoff, Yael Pritch, Michael Rubinstein
PDF
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization Luke Melas-Kyriazi, Christian Rupprecht, Iro Laina, Andrea Vedaldi
PDF
Deep Stereo Image Compression via Bi-Directional Coding Jianjun Lei, Xiangrui Liu, Bo Peng, Dengchao Jin, Wanqing Li, Jingxiao Gu
PDF
Deep Unlearning via Randomized Conditionally Independent Hessians Ronak Mehta, Sourav Pal, Vikas Singh, Sathya N. Ravi
PDF
Deep Vanishing Point Detection: Geometric Priors Make Dataset Variations Vanish Yancong Lin, Ruben Wiersma, Silvia L. Pintea, Klaus Hildebrandt, Elmar Eisemann, Jan C. van Gemert
PDF
Deep Visual Geo-Localization Benchmark Gabriele Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone, Gabriela Csurka, Torsten Sattler, Barbara Caputo
PDF
DeepCurrents: Learning Implicit Representations of Shapes with Boundaries David Palmer, Dmitriy Smirnov, Stephanie Wang, Albert Chern, Justin Solomon
PDF
DeepDPM: Deep Clustering with an Unknown Number of Clusters Meitar Ronen, Shahaf E. Finder, Oren Freifeld
PDF
DeepFace-EMD: Re-Ranking Using Patch-Wise Earth Mover's Distance Improves Out-of-Distribution Face Identification Hai Phan, Anh Nguyen
PDF
DeepFake Disrupter: The Detector of DeepFake Is My Friend Xueyu Wang, Jiajun Huang, Siqi Ma, Surya Nepal, Chang Xu
PDF
DeepFusion: LiDAR-Camera Deep Fusion for Multi-Modal 3D Object Detection Yingwei Li, Adams Wei Yu, Tianjian Meng, Ben Caine, Jiquan Ngiam, Daiyi Peng, Junyang Shen, Yifeng Lu, Denny Zhou, Quoc V. Le, Alan Yuille, Mingxing Tan
PDF
DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides Parmida Ghahremani, Joseph Marino, Ricardo Dodds, Saad Nadeem
PDF
DEFEAT: Deep Hidden Feature Backdoor Attacks by Imperceptible Perturbation and Latent Representation Constraints Zhendong Zhao, Xiaojun Chen, Yuexin Xuan, Ye Dong, Dakui Wang, Kaitai Liang
PDF
Defensive Patches for Robust Recognition in the Physical World Jiakai Wang, Zixin Yin, Pengfei Hu, Aishan Liu, Renshuai Tao, Haotong Qin, Xianglong Liu, Dacheng Tao
PDF
Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable Prototypes Jon Donnelly, Alina Jade Barnett, Chaofan Chen
PDF
Deformable Sprites for Unsupervised Video Decomposition Vickie Ye, Zhengqi Li, Richard Tucker, Angjoo Kanazawa, Noah Snavely
PDF
Deformable Video Transformer Jue Wang, Lorenzo Torresani
PDF
Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds Zhao Jin, Yinjie Lei, Naveed Akhtar, Haifeng Li, Munawar Hayat
PDF
Degradation-Agnostic Correspondence from Resolution-Asymmetric Stereo Xihao Chen, Zhiwei Xiong, Zhen Cheng, Jiayong Peng, Yueyi Zhang, Zheng-Jun Zha
PDF
Degree-of-Linear-Polarization-Based Color Constancy Taishi Ono, Yuhi Kondo, Legong Sun, Teppei Kurita, Yusuke Moriuchi
PDF
DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos Mathias Parger, Chengcheng Tang, Christopher D. Twigg, Cem Keskin, Robert Wang, Markus Steinberger
PDF
Delving Deep into the Generalization of Vision Transformers Under Distribution Shifts Chongzhi Zhang, Mingyuan Zhang, Shanghang Zhang, Daisheng Jin, Qiang Zhou, Zhongang Cai, Haiyu Zhao, Xianglong Liu, Ziwei Liu
PDF
Delving into the Estimation Shift of Batch Normalization in a Network Lei Huang, Yi Zhou, Tian Wang, Jie Luo, Xianglong Liu
PDF
Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection Siyue Yu, Jimin Xiao, Bingfeng Zhang, Eng Gee Lim
PDF
Demystifying the Neural Tangent Kernel from a Practical Perspective: Can It Be Trusted for Neural Architecture Search Without Training? Jisoo Mok, Byunggook Na, Ji-Hoon Kim, Dongyoon Han, Sungroh Yoon
PDF
Dense Depth Priors for Neural Radiance Fields from Sparse Input Views Barbara Roessle, Jonathan T. Barron, Ben Mildenhall, Pratul P. Srinivasan, Matthias Nießner
PDF
Dense Learning Based Semi-Supervised Object Detection Binghui Chen, Pengyu Li, Xiang Chen, Biao Wang, Lei Zhang, Xian-Sheng Hua
PDF
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting Yongming Rao, Wenliang Zhao, Guangyi Chen, Yansong Tang, Zheng Zhu, Guan Huang, Jie Zhou, Jiwen Lu
PDF
Density-Preserving Deep Point Cloud Compression Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, Yanwei Fu
PDF
Depth Estimation by Combining Binocular Stereo and Monocular Structured-Light Yuhua Xu, Xiaoli Yang, Yushan Yu, Wei Jia, Zhaobi Chu, Yulan Guo
PDF
Depth-Aware Generative Adversarial Network for Talking Head Video Generation Fa-Ting Hong, Longhao Zhang, Li Shen, Dan Xu
PDF
Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows Sheng Liu, Xiaohan Nie, Raffay Hamid
PDF
Depth-Supervised NeRF: Fewer Views and Faster Training for Free Kangle Deng, Andrew Liu, Jun-Yan Zhu, Deva Ramanan
PDF
DESTR: Object Detection with Split Transformer Liqiang He, Sinisa Todorovic
PDF
Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution Jie Liang, Hui Zeng, Lei Zhang
PDF
Detecting Camouflaged Object in Frequency Domain Yijie Zhong, Bo Li, Lv Tang, Senyun Kuang, Shuang Wu, Shouhong Ding
PDF
Detecting Deepfakes with Self-Blended Images Kaede Shiohara, Toshihiko Yamasaki
PDF
Detector-Free Weakly Supervised Group Activity Recognition Dongkeun Kim, Jinsung Lee, Minsu Cho, Suha Kwak
PDF
DetectorDetective: Investigating the Effects of Adversarial Examples on Object Detectors Sivapriya Vellaichamy, Matthew Hull, Zijie J. Wang, Nilaksh Das, ShengYun Peng, Haekyu Park, Duen Horng Chau
PDF
Deterministic Point Cloud Registration via Novel Transformation Decomposition Wen Chen, Haoang Li, Qiang Nie, Yun-Hui Liu
PDF
DETReg: Unsupervised Pretraining with Region Priors for Object Detection Amir Bar, Xin Wang, Vadim Kantorov, Colorado J. Reed, Roei Herzig, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson
PDF
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis Ming Tao, Hao Tang, Fei Wu, Xiao-Yuan Jing, Bing-Kun Bao, Changsheng Xu
PDF
DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6d Pose Estimation Tuo Cao, Fei Luo, Yanping Fu, Wenxiao Zhang, Shengjie Zheng, Chunxia Xiao
PDF
Differentiable Dynamics for Articulated 3D Human Motion Reconstruction Erik Gärtner, Mykhaylo Andriluka, Erwin Coumans, Cristian Sminchisescu
PDF
Differentiable Stereopsis: Meshes from Multiple Views Using Differentiable Rendering Shubham Goel, Georgia Gkioxari, Jitendra Malik
PDF
Differentially Private Federated Learning with Local Regularization and Sparsification Anda Cheng, Peisong Wang, Xi Sheryl Zhang, Jian Cheng
PDF
DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara, Gokul Hari, Cornelia Fermüller, Nitin J. Sanket, Yiannis Aloimonos
PDF
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation Konpat Preechakul, Nattanat Chatthee, Suttisak Wizadwongsa, Supasorn Suwajanakorn
PDF
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation Gwanghyun Kim, Taesung Kwon, Jong Chul Ye
PDF
DIFNet: Boosting Visual Information Flow for Image Captioning Mingrui Wu, Xuying Zhang, Xiaoshuai Sun, Yiyi Zhou, Chao Chen, Jiaxin Gu, Xing Sun, Rongrong Ji
PDF
DiGS: Divergence Guided Shape Implicit Neural Representation for Unoriented Point Clouds Yizhak Ben-Shabat, Chamin Hewa Koneputugodage, Stephen Gould
PDF
DiLiGenT102: A Photometric Stereo Benchmark Dataset with Controlled Shape and Material Variation Jieji Ren, Feishi Wang, Jiahao Zhang, Qian Zheng, Mingjun Ren, Boxin Shi
PDF
Dimension Embeddings for Monocular 3D Object Detection Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Dalong Du, Jie Zhou, Jiwen Lu
PDF
DINE: Domain Adaptation from Single and Multiple Black-Box Predictors Jian Liang, Dapeng Hu, Jiashi Feng, Ran He
PDF
DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow Zihua Zheng, Ni Nie, Zhi Ling, Pengfei Xiong, Jiangyu Liu, Hao Wang, Jiankun Li
PDF
DiRA: Discriminative, Restorative, and Adversarial Learning for Self-Supervised Medical Image Analysis Fatemeh Haghighi, Mohammad Reza Hosseinzadeh Taher, Michael B. Gotway, Jianming Liang
PDF
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu
PDF
Direct Voxel Grid Optimization: Super-Fast Convergence for Radiance Fields Reconstruction Cheng Sun, Min Sun, Hwann-Tzong Chen
PDF
Directional Self-Supervised Learning for Heavy Image Augmentations Yalong Bai, Yifan Yang, Wei Zhang, Tao Mei
PDF
DisARM: Displacement Aware Relation Module for 3D Detection Yao Duan, Chenyang Zhu, Yuqing Lan, Renjiao Yi, Xinwang Liu, Kai Xu
PDF
Discovering Objects That Can Move Zhipeng Bao, Pavel Tokmakov, Allan Jabri, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
PDF
Discrete Cosine Transform Network for Guided Depth mAP Super-Resolution Zixiang Zhao, Jiangshe Zhang, Shuang Xu, Zudi Lin, Hanspeter Pfister
PDF
Discrete Time Convolution for Fast Event-Based Stereo Kaixuan Zhang, Kaiwei Che, Jianguo Zhang, Jie Cheng, Ziyang Zhang, Qinghai Guo, Luziwei Leng
PDF
Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images Ayush Tewari, B R Mallikarjun, Xingang Pan, Ohad Fried, Maneesh Agrawala, Christian Theobalt
PDF
Disentangling Visual and Written Concepts in CLIP Joanna Materzyńska, Antonio Torralba, David Bau
PDF
Disentangling Visual Embeddings for Attributes and Objects Nirat Saini, Khoi Pham, Abhinav Shrivastava
PDF
DiSparse: Disentangled Sparsification for Multitask Model Compression Xinglong Sun, Ali Hassani, Zhangyang Wang, Gao Huang, Humphrey Shi
PDF
Dist-PU: Positive-Unlabeled Learning from a Label Distribution Perspective Yunrui Zhao, Qianqian Xu, Yangbangyan Jiang, Peisong Wen, Qingming Huang
PDF
Distillation Using Oracle Queries for Transformer-Based Human-Object Interaction Detection Xian Qu, Changxing Ding, Xingao Li, Xubin Zhong, Dacheng Tao
PDF
Distinguishing Unseen from Seen for Generalized Zero-Shot Learning Hongzu Su, Jingjing Li, Zhi Chen, Lei Zhu, Ke Lu
PDF
Distribution Consistent Neural Architecture Search Junyi Pan, Chong Sun, Yizhou Zhou, Ying Zhang, Chen Li
PDF
Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation Zitian Wang, Xuecheng Nie, Xiaochao Qu, Yunpeng Chen, Si Liu
PDF
Ditto: Building Digital Twins of Articulated Objects from Interaction Zhenyu Jiang, Cheng-Chun Hsu, Yuke Zhu
PDF
DIVeR: Real-Time and Accurate Neural Radiance Fields with Deterministic Integration for Volume Rendering Liwen Wu, Jae Yong Lee, Anand Bhattad, Yu-Xiong Wang, David Forsyth
PDF
Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation Naofumi Akimoto, Yuhi Matsuo, Yoshimitsu Aoki
PDF
Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection Zhuoling Li, Zhan Qu, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang
PDF
Divide and Conquer: Compositional Experts for Generalized Novel Class Discovery Muli Yang, Yuehua Zhu, Jiaping Yu, Aming Wu, Cheng Deng
PDF
DLFormer: Discrete Latent Transformer for Video Inpainting Jingjing Ren, Qingqing Zheng, Yuanyuan Zhao, Xuemiao Xu, Chen Li
PDF
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising Feng Li, Hao Zhang, Shilong Liu, Jian Guo, Lionel M. Ni, Lei Zhang
PDF
Do Explanations Explain? Model Knows Best Ashkan Khakzar, Pedram Khorsandi, Rozhin Nobahari, Nassir Navab
PDF
Do Learned Representations Respect Causal Relationships? Lan Wang, Vishnu Naresh Boddeti
PDF
DO-GAN: A Double Oracle Framework for Generative Adversarial Networks Aye Phyu Phyu Aung, Xinrun Wang, Runsheng Yu, Bo An, Senthilnath Jayavelu, Xiaoli Li
PDF
Does Robustness on ImageNet Transfer to Downstream Tasks? Yutaro Yamada, Mayu Otani
PDF
Does Text Attract Attention on E-Commerce Images: A Novel Saliency Prediction Dataset and Method Lai Jiang, Yifei Li, Shengxi Li, Mai Xu, Se Lei, Yichen Guo, Bo Huang
PDF
Domain Adaptation on Point Clouds via Geometry-Aware Implicits Yuefan Shen, Yanchao Yang, Mi Yan, He Wang, Youyi Zheng, Leonidas J. Guibas
PDF
Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing Zhuo Wang, Zezheng Wang, Zitong Yu, Weihong Deng, Jiahong Li, Tingting Gao, Zhongyuan Wang
PDF
Domain-Agnostic Prior for Transfer Semantic Segmentation Xinyue Huo, Lingxi Xie, Hengtong Hu, Wengang Zhou, Houqiang Li, Qi Tian
PDF
Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Subhadeep Koley, Rohit Kundu, Aneeshan Sain, Tao Xiang, Yi-Zhe Song
PDF
DoubleField: Bridging the Neural Surface and Radiance Fields for High-Fidelity Human Reconstruction and Rendering Ruizhi Shao, Hongwen Zhang, He Zhang, Mingjia Chen, Yan-Pei Cao, Tao Yu, Yebin Liu
PDF
DPGEN: Differentially Private Generative Energy-Guided Network for Natural Image Synthesis Jia-Wei Chen, Chia-Mu Yu, Ching-Chia Kao, Tzai-Wei Pang, Chun-Shien Lu
PDF
DPICT: Deep Progressive Image Compression Using Trit-Planes Jae-Han Lee, Seungmin Jeon, Kwang Pyo Choi, Youngo Park, Chang-Su Kim
PDF
DR.VIC: Decomposition and Reasoning for Video Individual Counting Tao Han, Lei Bai, Junyu Gao, Qi Wang, Wanli Ouyang
PDF
Dreaming to Prune Image Deraining Networks Weiqi Zou, Yang Wang, Xueyang Fu, Yang Cao
PDF
Dressing in the Wild by Watching Dance Videos Xin Dong, Fuwei Zhao, Zhenyu Xie, Xijin Zhang, Daniel K. Du, Min Zheng, Xiang Long, Xiaodan Liang, Jianchao Yang
PDF
Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative Models Niv Granot, Ben Feinstein, Assaf Shocher, Shai Bagon, Michal Irani
PDF
DST: Dynamic Substitute Training for Data-Free Black-Box Attack Wenxuan Wang, Xuelin Qian, Yanwei Fu, Xiangyang Xue
PDF
DTA: Physical Camouflage Attacks Using Differentiable Transformation Network Naufal Suryanto, Yongsu Kim, Hyoeun Kang, Harashta Tatimma Larasati, Youngyeo Yun, Thi-Thu-Huong Le, Hunmin Yang, Se-Yoon Oh, Howon Kim
PDF
DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification Hongrun Zhang, Yanda Meng, Yitian Zhao, Yihong Qiao, Xiaoyun Yang, Sarah E. Coupland, Yalin Zheng
PDF
Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution Xiaoqian Xu, Pengxu Wei, Weikai Chen, Yang Liu, Mingzhi Mao, Liang Lin, Guanbin Li
PDF
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification Haowei Zhu, Wenjing Ke, Dong Li, Ji Liu, Lu Tian, Yi Shan
PDF
Dual Task Learning by Leveraging Both Dense Correspondence and Mis-Correspondence for Robust Change Detection with Imperfect Matches Jin-Man Park, Ue-Hwan Kim, Seon-Hoon Lee, Jong-Hwan Kim
PDF
Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo Chaoning Zhang, Kang Zhang, Trung X. Pham, Axi Niu, Zhinan Qiao, Chang D. Yoo, In So Kweon
PDF
Dual-AI: Dual-Path Actor Interaction Learning for Group Activity Recognition Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang, Yu Qiao
PDF
Dual-Generator Face Reenactment Gee-Sern Hsu, Chun-Hung Tsai, Hung-Yi Wu
PDF
Dual-Key Multimodal Backdoors for Visual Question Answering Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha
PDF
Dual-Path Image Inpainting with Auxiliary GAN Inversion Wentao Wang, Li Niu, Jianfu Zhang, Xue Yang, Liqing Zhang
PDF
Dual-Shutter Optical Vibration Sensing Mark Sheinin, Dorian Chan, Matthew O'Toole, Srinivasa G. Narasimhan
PDF
Dynamic 3D Gaze from Afar: Deep Gaze Estimation from Temporal Eye-Head-Body Coordination Soma Nonaka, Shohei Nobuhara, Ko Nishino
PDF
Dynamic Dual-Output Diffusion Models Yaniv Benny, Lior Wolf
PDF
Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-Learning Arnav Chavan, Rishabh Tiwari, Udbhav Bamba, Deepak K. Gupta
PDF
Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information Lingfeng Yang, Xiang Li, Renjie Song, Borui Zhao, Juntian Tao, Shihao Zhou, Jiajun Liang, Jian Yang
PDF
Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation Jie Liu, Yanqi Bao, Guo-Sen Xie, Huan Xiong, Jan-Jakob Sonke, Efstratios Gavves
PDF
Dynamic Scene Graph Generation via Anticipatory Pre-Training Yiming Li, Xiaoshan Yang, Changsheng Xu
PDF
Dynamic Sparse R-CNN Qinghang Hong, Fengming Liu, Dong Li, Ji Liu, Lu Tian, Yi Shan
PDF
DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation Aysim Toker, Lukas Kondmann, Mark Weber, Marvin Eisenberger, Andrés Camero, Jingliang Hu, Ariadna Pregel Hoderlein, Çağlar Şenaras, Timothy Davis, Daniel Cremers, Giovanni Marchisio, Xiao Xiang Zhu, Laura Leal-Taixé
PDF
DyRep: Bootstrapping Training with Dynamic Re-Parameterization Tao Huang, Shan You, Bohan Zhang, Yuxuan Du, Fei Wang, Chen Qian, Chang Xu
PDF
DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion Arthur Douillard, Alexandre Ramé, Guillaume Couairon, Matthieu Cord
PDF
E-CIR: Event-Enhanced Continuous Intensity Recovery Chen Song, Qixing Huang, Chandrajit Bajaj
PDF
E2(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition Chiara Plizzari, Mirco Planamente, Gabriele Goletto, Marco Cannici, Emanuele Gusso, Matteo Matteucci, Barbara Caputo
PDF
E2EC: An End-to-End Contour-Based Method for High-Quality High-Speed Instance Segmentation Tao Zhang, Shiqing Wei, Shunping Ji
PDF
E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations Jongwan Kim, DongJin Lee, Byunggook Na, Seongsik Park, Sungroh Yoon
PDF
EASE: Unsupervised Discriminant Subspace Learning for Transductive Few-Shot Learning Hao Zhu, Piotr Koniusz
PDF
EDTER: Edge Detection with Transformer Mengyang Pu, Yaping Huang, Yuming Liu, Qingji Guan, Haibin Ling
PDF
Effective Conditioned and Composed Image Retrieval Combining CLIP-Based Features Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo
PDF
Efficient Classification of Very Large Images with Tiny Objects Fanjie Kong, Ricardo Henao
PDF
Efficient Deep Embedded Subspace Clustering Jinyu Cai, Jicong Fan, Wenzhong Guo, Shiping Wang, Yunhe Zhang, Zhao Zhang
PDF
Efficient Geometry-Aware 3D Generative Adversarial Networks Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas J. Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein
PDF
Efficient Large-Scale Localization by Global Instance Recognition Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla
PDF
Efficient Maximal Coding Rate Reduction by Variational Forms Christina Baek, Ziyang Wu, Kwan Ho Ryan Chan, Tianjiao Ding, Yi Ma, Benjamin D. Haeffele
PDF
Efficient Multi-View Stereo by Iterative Dynamic Cost Volume Shaoqian Wang, Bo Li, Yuchao Dai
PDF
Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer Frederic Z. Zhang, Dylan Campbell, Stephen Gould
PDF
Efficient Video Instance Segmentation via Tracklet Query and Proposal Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath, Gérard Medioni
PDF
EfficientNeRF Efficient Neural Radiance Fields Tao Hu, Shu Liu, Yilun Chen, Tiancheng Shen, Jiaya Jia
PDF
Ego4D: Around the World in 3,000 Hours of Egocentric Video Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolář, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik
PDF
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu
PDF
Egocentric Prediction of Action Target in 3D Yiming Li, Ziang Cao, Andrew Liang, Benjamin Liang, Luoyao Chen, Hang Zhao, Chen Feng
PDF
Egocentric Scene Understanding via Multimodal Spatial Rectifier Tien Do, Khiem Vuong, Hyun Soo Park
PDF
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval Haoyu Ma, Handong Zhao, Zhe Lin, Ajinkya Kale, Zhangyang Wang, Tong Yu, Jiuxiang Gu, Sunav Choudhary, Xiaohui Xie
PDF
Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation Wonhui Park, Dongkwon Jin, Chang-Su Kim
PDF
Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Heeyeon Kwon, Chang-Su Kim
PDF
ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses Bastian Wandt, James J. Little, Helge Rhodin
PDF
ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang
PDF
ELSR: Efficient Line Segment Reconstruction with Planes and Points Guidance Dong Wei, Yi Wan, Yongjun Zhang, Xinyi Liu, Bin Zhang, Xiqi Wang
PDF
Embracing Single Stride 3D Object Detector with Sparse Transformer Lue Fan, Ziqi Pang, Tianyuan Zhang, Yu-Xiong Wang, Hang Zhao, Feng Wang, Naiyan Wang, Zhaoxiang Zhang
PDF
EMOCA: Emotion Driven Monocular Face Capture and Animation Radek Daněček, Michael J. Black, Timo Bolkart
PDF
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching Yaya Shi, Xu Yang, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha
PDF
En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning Xia Kong, Zuodong Gao, Xiaofan Li, Ming Hong, Jun Liu, Chengjie Wang, Yuan Xie, Yanyun Qu
PDF
Enabling Equivariance for Arbitrary Lie Groups Lachlan E. MacDonald, Sameera Ramasinghe, Simon Lucey
PDF
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection Congcong Li, Xinyao Wang, Longyin Wen, Dexiang Hong, Tiejian Luo, Libo Zhang
PDF
End-to-End Generative Pretraining for Multimodal Video Captioning Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid
PDF
End-to-End Human-Gaze-Target Detection with Transformers Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen
PDF
End-to-End Multi-Person Pose Estimation with Transformers Dahu Shi, Xing Wei, Liangqi Li, Ye Ren, Wenming Tan
PDF
End-to-End Reconstruction-Classification Learning for Face Forgery Detection Junyi Cao, Chao Ma, Taiping Yao, Shen Chen, Shouhong Ding, Xiaokang Yang
PDF
End-to-End Referring Video Object Segmentation with Multimodal Transformers Adam Botach, Evgenii Zheltonozhskii, Chaim Baskin
PDF
End-to-End Semi-Supervised Learning for Video Action Detection Akash Kumar, Yogesh Singh Rawat
PDF
End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps Ke Guo, Wenxi Liu, Jia Pan
PDF
Energy-Based Latent Aligner for Incremental Learning K J Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Vineeth N Balasubramanian
PDF
Enhancing Adversarial Robustness for Deep Metric Learning Mo Zhou, Vishal M. Patel
PDF
Enhancing Adversarial Training with Second-Order Statistics of Weights Gaojie Jin, Xinping Yi, Wei Huang, Sven Schewe, Xiaowei Huang
PDF
Enhancing Classifier Conservativeness and Robustness by Polynomiality Ziqi Wang, Marco Loog
PDF
Enhancing Face Recognition with Self-Supervised 3D Reconstruction Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen
PDF
Ensembling Off-the-Shelf Models for GAN Training Nupur Kumari, Richard Zhang, Eli Shechtman, Jun-Yan Zhu
PDF
Entropy-Based Active Learning for Object Detection with Progressive Diversity Constraint Jiaxi Wu, Jiaxin Chen, Di Huang
PDF
EnvEdit: Environment Editing for Vision-and-Language Navigation Jialu Li, Hao Tan, Mohit Bansal
PDF
Episodic Memory Question Answering Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Mukul Khanna, Dhruv Batra, Devi Parikh
PDF
EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation Hansheng Chen, Pichao Wang, Fan Wang, Wei Tian, Lu Xiong, Hao Li
PDF
Equalized Focal Loss for Dense Long-Tailed Object Detection Bo Li, Yongqiang Yao, Jingru Tan, Gang Zhang, Fengwei Yu, Jianwei Lu, Ye Luo
PDF
Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets Vishnu Suresh Lokhande, Rudrasis Chakraborty, Sathya N. Ravi, Vikas Singh
PDF
Equivariant Point Cloud Analysis via Learning Orientations for Message Passing Shitong Luo, Jiahan Li, Jiaqi Guan, Yufeng Su, Chaoran Cheng, Jian Peng, Jianzhu Ma
PDF
ES6D: A Computation Efficient and Symmetry-Aware 6d Pose Regression Framework Ningkai Mo, Wanshui Gan, Naoto Yokoya, Shifeng Chen
PDF
Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination Yiqun Mei, Pengfei Guo, Vishal M. Patel
PDF
ESCNet: Gaze Target Detection with the Understanding of 3D Scenes Jun Bao, Buyu Liu, Jun Yu
PDF
Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Diogo Luvizon, Christian Theobalt
PDF
Estimating Example Difficulty Using Variance of Gradients Chirag Agarwal, Daniel D'souza, Sara Hooker
PDF
Estimating Fine-Grained Noise Model via Contrastive Learning Yunhao Zou, Ying Fu
PDF
Estimating Structural Disparities for Face Models Shervin Ardeshir, Cristina Segalin, Nathan Kallus
PDF
ETHSeg: An Amodel Instance Segmentation Network and a Real-World Dataset for X-Ray Waste Inspection Lingteng Qiu, Zhangyang Xiong, Xuhao Wang, Kenkun Liu, Yihan Li, Guanying Chen, Xiaoguang Han, Shuguang Cui
PDF
Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition Junho Kim, Inwoo Hwang, Young Min Kim
PDF
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization Damien Teney, Ehsan Abbasnejad, Simon Lucey, Anton van den Hengel
PDF
Evaluation-Oriented Knowledge Distillation for Deep Face Recognition Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding
PDF
Event-Aided Direct Sparse Odometry Javier Hidalgo-Carrió, Guillermo Gallego, Davide Scaramuzza
PDF
Event-Based Video Reconstruction via Potential-Assisted Spiking Neural Network Lin Zhu, Xiao Wang, Yi Chang, Jianing Li, Tiejun Huang, Yonghong Tian
PDF
Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogerio S. Feris, David Harwath, James Glass, Hilde Kuehne
PDF
EvUnroll: Neuromorphic Events Based Rolling Shutter Image Correction Xinyu Zhou, Peiqi Duan, Yi Ma, Boxin Shi
PDF
Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization Yabin Zhang, Minghan Li, Ruihuang Li, Kui Jia, Lei Zhang
PDF
Exemplar-Based Pattern Synthesis with Implicit Periodic Field Network Haiwei Chen, Jiayi Liu, Weikai Chen, Shichen Liu, Yajie Zhao
PDF
Expanding Large Pre-Trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification Tao Liang, Guosheng Lin, Mingyang Wan, Tianrui Li, Guojun Ma, Fengmao Lv
PDF
Expanding Low-Density Latent Regions for Open-Set Object Detection Jiaming Han, Yuqiang Ren, Jian Ding, Xingjia Pan, Ke Yan, Gui-Song Xia
PDF
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention Yu Yang, Seungbae Kim, Jungseock Joo
PDF
Exploiting Explainable Metrics for Augmented SGD Mahdi S. Hosseini, Mathieu Tuli, Konstantinos N. Plataniotis
PDF
Exploiting Pseudo Labels in a Self-Supervised Learning Framework for Improved Monocular Depth Estimation Andra Petrovai, Sergiu Nedevschi
PDF
Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation Guanting Dong, Yueyi Zhang, Hanlin Li, Xiaoyan Sun, Zhiwei Xiong
PDF
Exploiting Temporal Relations on Radar Perception for Autonomous Driving Peizhao Li, Pu Wang, Karl Berntorp, Hongfu Liu
PDF
Explore Spatio-Temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline Kailai Zhou, Yibo Wang, Tao Lv, Yunqian Li, Linsen Chen, Qiu Shen, Xun Cao
PDF
Exploring and Evaluating Image Restoration Potential in Dynamic Scenes Cheng Zhang, Shaolin Su, Yu Zhu, Qingsen Yan, Jinqiu Sun, Yanning Zhang
PDF
Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization Jingjing Li, Tianyu Yang, Wei Ji, Jue Wang, Li Cheng
PDF
Exploring Domain-Invariant Parameters for Source Free Domain Adaptation Fan Wang, Zhongyi Han, Yongshun Gong, Yilong Yin
PDF
Exploring Dual-Task Correlation for Pose Guided Person Image Generation Pengze Zhang, Lingxiao Yang, Jian-Huang Lai, Xiaohua Xie
PDF
Exploring Effective Data for Surrogate Training Towards Black-Box Attack Xuxiang Sun, Gong Cheng, Hongda Li, Lei Pei, Junwei Han
PDF
Exploring Endogenous Shift for Cross-Domain Detection: A Large-Scale Benchmark and Perturbation Suppression Network Renshuai Tao, Hainan Li, Tianbo Wang, Yanlu Wei, Yifu Ding, Bowei Jin, Hongping Zhi, Xianglong Liu, Aishan Liu
PDF
Exploring Frequency Adversarial Attacks for Face Forgery Detection Shuai Jia, Chao Ma, Taiping Yao, Bangjie Yin, Shouhong Ding, Xiaokang Yang
PDF
Exploring Geometric Consistency for Monocular 3D Object Detection Qing Lian, Botao Ye, Ruijia Xu, Weilong Yao, Tong Zhang
PDF
Exploring Patch-Wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks Chanyong Jung, Gihyun Kwon, Jong Chul Ye
PDF
Exploring Set Similarity for Dense Self-Supervised Representation Learning Zhaoqing Wang, Qiang Li, Guoxin Zhang, Pengfei Wan, Wen Zheng, Nannan Wang, Mingming Gong, Tongliang Liu
PDF
Exploring Structure-Aware Transformer over Interaction Proposals for Human-Object Interaction Detection Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
PDF
Exploring the Equivalence of Siamese Self-Supervised Learning via a Unified Gradient Framework Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai
PDF
Exposure Normalization and Compensation for Multiple-Exposure Correction Jie Huang, Yajing Liu, Xueyang Fu, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong
PDF
Expressive Talking Head Generation with Granular Audio-Visual Control Borong Liang, Yan Pan, Zhizhi Guo, Hang Zhou, Zhibin Hong, Xiaoguang Han, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
PDF
Extracting Triangular 3D Models, Materials, and Lighting from Images Jacob Munkberg, Jon Hasselgren, Tianchang Shen, Jun Gao, Wenzheng Chen, Alex Evans, Thomas Müller, Sanja Fidler
PDF
EyePAD++: A Distillation-Based Approach for Joint Eye Authentication and Presentation Attack Detection Using Periocular Images Prithviraj Dhar, Amit Kumar, Kirsten Kaplan, Khushi Gupta, Rakesh Ranjan, Rama Chellappa
PDF
F-SFT: Shape-from-Template with a Physics-Based Deformation Model Navami Kairanda, Edith Tretschk, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik
PDF
Face Relighting with Geometrically Consistent Shadows Andrew Hou, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu
PDF
Face2Exp: Combating Data Biases for Facial Expression Recognition Dan Zeng, Zhiyuan Lin, Xiao Yan, Yuting Liu, Fei Wang, Bo Tang
PDF
FaceFormer: Speech-Driven 3D Facial Animation with Transformers Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura
PDF
FaceVerse: A Fine-Grained and Detail-Controllable 3D Face Morphable Model from a Hybrid Dataset Lizhen Wang, Zhiyuan Chen, Tao Yu, Chenguang Ma, Liang Li, Yebin Liu
PDF
Failure Modes of Domain Generalization Algorithms Tigran Galstyan, Hrayr Harutyunyan, Hrant Khachatrian, Greg Ver Steeg, Aram Galstyan
PDF
Fair Contrastive Learning for Facial Attribute Classification Sungho Park, Jewook Lee, Pilhyeon Lee, Sunhee Hwang, Dohyung Kim, Hyeran Byun
PDF
Fairness-Aware Adversarial Perturbation Towards Bias Mitigation for Deployed Deep Models Zhibo Wang, Xiaowei Dong, Henry Xue, Zhifei Zhang, Weifeng Chiu, Tao Wei, Kui Ren
PDF
Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations Zhixuan Zhong, Liangyu Chai, Yang Zhou, Bailin Deng, Jia Pan, Shengfeng He
PDF
FAM: Visual Explanations for the Feature Representations from Deep Convolutional Networks Yuxi Wu, Changhuai Chen, Jun Che, Shiliang Pu
PDF
FashionVLP: Vision Language Transformer for Fashion Retrieval with Feedback Sonam Goenka, Zhaoheng Zheng, Ayush Jaiswal, Rakesh Chada, Yue Wu, Varsha Hedau, Pradeep Natarajan
PDF
Fast Algorithm for Low-Rank Tensor Completion in Delay-Embedded Space Ryuki Yamamoto, Hidekata Hontani, Akira Imakura, Tatsuya Yokota
PDF
Fast and Unsupervised Action Boundary Detection for Action Segmentation Zexing Du, Xue Wang, Guoqing Zhou, Qing Wang
PDF
Fast Light-Weight Near-Field Photometric Stereo Daniel Lichy, Soumyadip Sengupta, David W. Jacobs
PDF
Fast Point Transformer Chunghyun Park, Yoonwoo Jeong, Minsu Cho, Jaesik Park
PDF
Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li, Yunpeng Shi, Gilad Lerman
PDF
FastDOG: Fast Discrete Optimization on GPU Ahmed Abbas, Paul Swoboda
PDF
Feature Erasing and Diffusion Network for Occluded Person Re-Identification Zhikang Wang, Feng Zhu, Shixiang Tang, Rui Zhao, Lihuo He, Jiangning Song
PDF
Feature Statistics Mixing Regularization for Generative Adversarial Networks Junho Kim, Yunjey Choi, Youngjung Uh
PDF
FedCor: Correlation-Based Active Client Selection Strategy for Heterogeneous Federated Learning Minxue Tang, Xuefei Ning, Yitu Wang, Jingwei Sun, Yu Wang, Hai Li, Yiran Chen
PDF
FedCorr: Multi-Stage Federated Learning for Label Noise Correction Jingyi Xu, Zihan Chen, Tony Q.S. Quek, Kai Fong Ernest Chong
PDF
FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction Liang Gao, Huazhu Fu, Li Li, Yingwen Chen, Ming Xu, Cheng-Zhong Xu
PDF
Federated Class-Incremental Learning Jiahua Dong, Lixu Wang, Zhen Fang, Gan Sun, Shichao Xu, Xiao Wang, Qi Zhu
PDF
Federated Learning with Position-Aware Neurons Xin-Chun Li, Yi-Chu Xu, Shaoming Song, Bingshuai Li, Yinchuan Li, Yunfeng Shao, De-Chuan Zhan
PDF
FENeRF: Face Editing in Neural Radiance Fields Jingxiang Sun, Xuan Wang, Yong Zhang, Xiaoyu Li, Qi Zhang, Yebin Liu, Jue Wang
PDF
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos Yan Wang, Yixuan Sun, Yiwen Huang, Zhongying Liu, Shuyong Gao, Wei Zhang, Weifeng Ge, Wenqiang Zhang
PDF
Few Could Be Better than All: Feature Sampling and Grouping for Scene Text Detection Jingqun Tang, Wenqing Zhang, Hongye Liu, MingKun Yang, Bo Jiang, Guanglong Hu, Xiang Bai
PDF
Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment Jiayu Xiao, Liang Li, Chaofei Wang, Zheng-Jun Zha, Qingming Huang
PDF
Few-Shot Backdoor Defense Using Shapley Estimation Jiyang Guan, Zhuozhuo Tu, Ran He, Dacheng Tao
PDF
Few-Shot Font Generation by Learning Fine-Grained Local Styles Licheng Tang, Yiyang Cai, Jiaming Liu, Zhibin Hong, Mingming Gong, Minhu Fan, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
PDF
Few-Shot Head Swapping in the Wild Changyong Shu, Hemao Wu, Hang Zhou, Jiaming Liu, Zhibin Hong, Changxing Ding, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
PDF
Few-Shot Incremental Learning for Label-to-Image Translation Pei Chen, Yangkang Zhang, Zejian Li, Lingyun Sun
PDF
Few-Shot Keypoint Detection with Uncertainty Learning for Unseen Species Changsheng Lu, Piotr Koniusz
PDF
Few-Shot Learning with Noisy Labels Kevin J. Liang, Samrudhdhi B. Rangrej, Vladan Petrovic, Tal Hassner
PDF
Few-Shot Object Detection with Fully Cross-Transformer Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Shih-Fu Chang
PDF
FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis Yu Feng, Benteng Ma, Jing Zhang, Shanshan Zhao, Yong Xia, Dacheng Tao
PDF
FIFO: Learning Fog-Invariant Features for Foggy Scene Segmentation Sohyun Lee, Taeyoung Son, Suha Kwak
PDF
Finding Badly Drawn Bunnies Lan Yang, Kaiyue Pang, Honggang Zhang, Yi-Zhe Song
PDF
Finding Fallen Objects via Asynchronous Audio-Visual Integration Chuang Gan, Yi Gu, Siyuan Zhou, Jeremy Schwartz, Seth Alter, James Traer, Dan Gutfreund, Joshua B. Tenenbaum, Josh H. McDermott, Antonio Torralba
PDF
Finding Good Configurations of Planar Primitives in Unorganized Point Clouds Mulin Yu, Florent Lafarge
PDF
Fine-Grained Object Classification via Self-Supervised Pose Alignment Xuhui Yang, Yaowei Wang, Ke Chen, Yong Xu, Yonghong Tian
PDF
Fine-Grained Predicates Learning for Scene Graph Generation Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song
PDF
Fine-Grained Temporal Contrastive Learning for Weakly-Supervised Temporal Action Localization Junyu Gao, Mengyuan Chen, Changsheng Xu
PDF
Fine-Tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning Lin Zhang, Li Shen, Liang Ding, Dacheng Tao, Ling-Yu Duan
PDF
Fine-Tuning Image Transformers Using Learnable Memory Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Andrew Jackson
PDF
FineDiving: A Fine-Grained Dataset for Procedure-Aware Action Quality Assessment Jinglin Xu, Yongming Rao, Xumin Yu, Guangyi Chen, Jie Zhou, Jiwen Lu
PDF
Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations Zirui Peng, Shaofeng Li, Guoxing Chen, Cheng Zhang, Haojin Zhu, Minhui Xue
PDF
Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction Sara Elkerdawy, Mostafa Elhoushi, Hong Zhang, Nilanjan Ray
PDF
Fisher Information Guidance for Learned Time-of-Flight Imaging Jiaqu Li, Tao Yue, Sijie Zhao, Xuemei Hu
PDF
FisherMatch: Semi-Supervised Rotation Regression via Entropy-Based Filtering Yingda Yin, Yingcheng Cai, He Wang, Baoquan Chen
PDF
Fixing Malfunctional Objects with Learned Physical Simulation and Functional Prediction Yining Hong, Kaichun Mo, Li Yi, Leonidas J. Guibas, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan
PDF
FLAG: Flow-Based 3D Avatar Generation from Sparse Observations Sadegh Aliakbarian, Pashmina Cameron, Federica Bogo, Andrew Fitzgibbon, Thomas J. Cashman
PDF
FLAVA: A Foundational Language and Vision Alignment Model Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela
PDF
FlexIT: Towards Flexible Semantic Image Translation Guillaume Couairon, Asya Grechka, Jakob Verbeek, Holger Schwenk, Matthieu Cord
PDF
FLOAT: Factorized Learning of Object Attributes for Improved Multi-Object Multi-Part Scene Parsing Rishubh Singh, Pranav Gupta, Pradeep Shenoy, Ravikiran Sarvadevabhatla
PDF
FMCNet: Feature-Level Modality Compensation for Visible-Infrared Person Re-Identification Qiang Zhang, Changzhou Lai, Jianan Liu, Nianchang Huang, Jungong Han
PDF
Focal and Global Knowledge Distillation for Detectors Zhendong Yang, Zhe Li, Xiaohu Jiang, Yuan Gong, Zehuan Yuan, Danpei Zhao, Chun Yuan
PDF
Focal Length and Object Pose Estimation via Render and Compare Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, Josef Sivic
PDF
Focal Sparse Convolutional Networks for 3D Object Detection Yukang Chen, Yanwei Li, Xiangyu Zhang, Jian Sun, Jiaya Jia
PDF
FocalClick: Towards Practical Interactive Image Segmentation Xi Chen, Zhiyan Zhao, Yilei Zhang, Manni Duan, Donglian Qi, Hengshuang Zhao
PDF
FocusCut: Diving into a Focus View in Interactive Segmentation Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chun-Le Guo, Ming-Ming Cheng
PDF
FoggyStereo: Stereo Matching with Fog Volume Representation Chengtang Yao, Lidong Yu
PDF
Forecasting Characteristic 3D Poses of Human Actions Christian Diller, Thomas Funkhouser, Angela Dai
PDF
Forecasting from LiDAR via Future Object Detection Neehar Peri, Jonathon Luiten, Mengtian Li, Aljoša Ošep, Laura Leal-Taixé, Deva Ramanan
PDF
Forward Compatible Few-Shot Class-Incremental Learning Da-Wei Zhou, Fu-Yun Wang, Han-Jia Ye, Liang Ma, Shiliang Pu, De-Chuan Zhan
PDF
Forward Compatible Training for Large-Scale Embedding Retrieval Systems Vivek Ramanujan, Pavan Kumar Anasosalu Vasu, Ali Farhadi, Oncel Tuzel, Hadi Pouransari
PDF
Forward Propagation, Backward Regression, and Pose Association for Hand Tracking in the Wild Mingzhen Huang, Supreeth Narasimhaswamy, Saif Vazir, Haibin Ling, Minh Hoai
PDF
Fourier Document Restoration for Robust Document Dewarping and Recognition Chuhui Xue, Zichen Tian, Fangneng Zhan, Shijian Lu, Song Bai
PDF
Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-Time Liao Wang, Jiakai Zhang, Xinhang Liu, Fuqiang Zhao, Yanshun Zhang, Yingliang Zhang, Minye Wu, Jingyi Yu, Lan Xu
PDF
Frame Averaging for Equivariant Shape Space Learning Matan Atzmon, Koki Nagano, Sanja Fidler, Sameh Khamis, Yaron Lipman
PDF
Frame-Wise Action Representations for Long Videos via Sequence Contrastive Learning Minghao Chen, Fangyun Wei, Chong Li, Deng Cai
PDF
FreeSOLO: Learning to Segment Objects Without Annotations Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
PDF
Frequency-Driven Imperceptible Adversarial Attack on Semantic Similarity Cheng Luo, Qinliang Lin, Weicheng Xie, Bizhu Wu, Jinheng Xie, Linlin Shen
PDF
From Representation to Reasoning: Towards Both Evidence and Commonsense Reasoning for Video Question-Answering Jiangtong Li, Li Niu, Liqing Zhang
PDF
FS6D: Few-Shot 6d Pose Estimation of Novel Objects Yisheng He, Yao Wang, Haoqiang Fan, Jian Sun, Qifeng Chen
PDF
Full-Range Virtual Try-on with Recurrent Tri-Level Transform Han Yang, Xinrui Yu, Ziwei Liu
PDF
Future Transformer for Long-Term Action Anticipation Dayoung Gong, Joonseok Lee, Manjin Kim, Seong Jong Ha, Minsu Cho
PDF
FvOR: Robust Joint Shape and Pose Optimization for Few-View Object Reconstruction Zhenpei Yang, Zhile Ren, Miguel Angel Bautista, Zaiwei Zhang, Qi Shan, Qixing Huang
PDF
FWD: Real-Time Novel View Synthesis with Forward Warping and Depth Ang Cao, Chris Rockwell, Justin Johnson
PDF
Gait Recognition in the Wild with Dense 3D Representations and a Benchmark Jinkai Zheng, Xinchen Liu, Wu Liu, Lingxiao He, Chenggang Yan, Tao Mei
PDF
GAN-Supervised Dense Visual Alignment William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei A. Efros, Eli Shechtman
PDF
GanOrCon: Are Generative Models Useful for Few-Shot Segmentation? Oindrila Saha, Zezhou Cheng, Subhransu Maji
PDF
GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation Xingzhe He, Bastian Wandt, Helge Rhodin
PDF
GASP, a Generalized Framework for Agglomerative Clustering of Signed Graphs and Its Application to Instance Segmentation Alberto Bailoni, Constantin Pape, Nathan Hütsch, Steffen Wolf, Thorsten Beier, Anna Kreshuk, Fred A. Hamprecht
PDF
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings Zhaohua Zheng, Jianfang Li, Lingjie Zhu, Honghua Li, Frank Petzold, Ping Tan
PDF
GaTector: A Unified Framework for Gaze Object Prediction Binglu Wang, Tao Hu, Baoshan Li, Xiaojuan Chen, Zhijie Zhang
PDF
Gated2Gated: Self-Supervised Depth Estimation from Gated Images Amanpreet Walia, Stefanie Walz, Mario Bijelic, Fahim Mannan, Frank Julca-Aguilar, Michael Langer, Werner Ritter, Felix Heide
PDF
GateHUB: Gated History Unit with Background Suppression for Online Action Detection Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen
PDF
Gaussian Process Modeling of Approximate Inference Errors for Variational Autoencoders Minyoung Kim
PDF
GazeOnce: Real-Time Multi-Person Gaze Estimation Mingfang Zhang, Yunfei Liu, Feng Lu
PDF
GCFSR: A Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors Jingwen He, Wu Shi, Kai Chen, Lean Fu, Chao Dong
PDF
GCR: Gradient Coreset Based Replay Buffer Selection for Continual Learning Rishabh Tiwari, Krishnateja Killamsetty, Rishabh Iyer, Pradeep Shenoy
PDF
gDNA: Towards Generative Detailed Neural Avatars Xu Chen, Tianjian Jiang, Jie Song, Jinlong Yang, Michael J. Black, Andreas Geiger, Otmar Hilliges
PDF
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu
PDF
GenDR: A Generalized Differentiable Renderer Felix Petersen, Bastian Goldluecke, Christian Borgelt, Oliver Deussen
PDF
General Facial Representation Learning in a Visual-Linguistic Manner Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen
PDF
General Incremental Learning with Domain-Aware Categorical Representations Jiangwei Xie, Shipeng Yan, Xuming He
PDF
Generalizable Cross-Modality Medical Image Segmentation via Style Augmentation and Dual Normalization Ziqi Zhou, Lei Qi, Xin Yang, Dong Ni, Yinghuan Shi
PDF
Generalizable Human Pose Triangulation Kristijan Bartol, David Bojanić, Tomislav Petković, Tomislav Pribanić
PDF
Generalized Binary Search Network for Highly-Efficient Multi-View Stereo Zhenxing Mi, Chang Di, Dan Xu
PDF
Generalized Category Discovery Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman
PDF
Generalized Few-Shot Semantic Segmentation Zhuotao Tian, Xin Lai, Li Jiang, Shu Liu, Michelle Shu, Hengshuang Zhao, Jiaya Jia
PDF
Generalizing Gaze Estimation with Rotation Consistency Yiwei Bao, Yunfei Liu, Haofei Wang, Feng Lu
PDF
Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks Fanqing Lin, Brian Price, Tony Martinez
PDF
Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers Han Joo Chae, Seunghwan Lee, Hyewon Son, Seungyeob Han, Taebin Lim
PDF
Generating Diverse 3D Reconstructions from a Single Occluded Face Image Rahul Dey, Vishnu Naresh Boddeti
PDF
Generating Diverse and Natural 3D Human Motions from Text Chuan Guo, Shihao Zou, Xinxin Zuo, Sen Wang, Wei Ji, Xingyu Li, Li Cheng
PDF
Generating High Fidelity Data from Low-Density Regions Using Diffusion Models Vikash Sehwag, Caner Hazirbas, Albert Gordo, Firat Ozgenel, Cristian Canton
PDF
Generating Representative Samples for Few-Shot Classification Jingyi Xu, Hieu Le
PDF
Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior Davis Rempe, Jonah Philion, Leonidas J. Guibas, Sanja Fidler, Or Litany
PDF
Generative Cooperative Learning for Unsupervised Video Anomaly Detection M. Zaigham Zaheer, Arif Mahmood, M. Haris Khan, Mattia Segu, Fisher Yu, Seung-Ik Lee
PDF
Generative Flows with Invertible Attentions Rhea Sanjay Sukthanker, Zhiwu Huang, Suryansh Kumar, Radu Timofte, Luc Van Gool
PDF
GeoEngine: A Platform for Production-Ready Geospatial Research Sagar Verma, Siddharth Gupta, Hal Shin, Akash Panigrahi, Shubham Goswami, Shweta Pardeshi, Natanael Exe, Ujwal Dutta, Tanka Raj Joshi, Nitin Bhojwani
PDF
Geometric Anchor Correspondence Mining with Uncertainty Modeling for Universal Domain Adaptation Liang Chen, Yihang Lou, Jianzhong He, Tao Bai, Minghua Deng
PDF
Geometric and Textural Augmentation for Domain Gap Reduction Xiao-Chang Liu, Yong-Liang Yang, Peter Hall
PDF
Geometric Structure Preserving Warp for Natural Image Stitching Peng Du, Jifeng Ning, Jiguang Cui, Shaoli Huang, Xinchao Wang, Jiaxin Wang
PDF
Geometric Transformer for Fast and Robust Point Cloud Registration Zheng Qin, Hao Yu, Changjian Wang, Yulan Guo, Yuxing Peng, Kai Xu
PDF
Geometry-Aware Guided Loss for Deep Crack Recognition Zhuangzhuang Chen, Jin Zhang, Zhuonan Lai, Jie Chen, Zun Liu, Jianqiang Li
PDF
GeoNeRF: Generalizing NeRF with Geometry Priors Mohammad Mahdi Johari, Yann Lepoittevin, François Fleuret
PDF
GIFS: Neural Implicit Function for General Shape Representation Jianglong Ye, Yuntao Chen, Naiyan Wang, Xiaolong Wang
PDF
GIQE: Generic Image Quality Enhancement via Nth Order Iterative Degradation Pranjay Shyam, Kyung-Soo Kim, Kuk-Jin Yoon
PDF
GIRAFFE HD: A High-Resolution 3D-Aware Generative Model Yang Xue, Yuheng Li, Krishna Kumar Singh, Yong Jae Lee
PDF
Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness Giulio Lovisotto, Nicole Finnie, Mauricio Munoz, Chaithanya Kumar Mummadi, Jan Hendrik Metzen
PDF
GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras Ye Yuan, Umar Iqbal, Pavlo Molchanov, Kris Kitani, Jan Kautz
PDF
Glass Segmentation Using Intensity and Spectral Polarization Cues Haiyang Mei, Bo Dong, Wen Dong, Jiaxi Yang, Seung-Hwan Baek, Felix Heide, Pieter Peers, Xiaopeng Wei, Xin Yang
PDF
Glass: Geometric Latent Augmentation for Shape Spaces Sanjeev Muralikrishnan, Siddhartha Chaudhuri, Noam Aigerman, Vladimir G. Kim, Matthew Fisher, Niloy J. Mitra
PDF
GlideNet: Global, Local and Intrinsic Based Dense Embedding NETwork for Multi-Category Attributes Prediction Kareem Metwaly, Aerin Kim, Elliot Branson, Vishal Monga
PDF
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation Minghui Hu, Yujie Wang, Tat-Jen Cham, Jianfei Yang, P.N. Suganthan
PDF
Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning Haoxiang Wang, Yite Wang, Ruoyu Sun, Bo Li
PDF
Global Matching with Overlapping Attention for Optical Flow Estimation Shiyu Zhao, Long Zhao, Zhixing Zhang, Enyu Zhou, Dimitris Metaxas
PDF
Global Sensing and Measurements Reuse for Image Compressed Sensing Zi-En Fan, Feng Lian, Jia-Ni Quan
PDF
Global Tracking Transformers Xingyi Zhou, Tianwei Yin, Vladlen Koltun, Philipp Krähenbühl
PDF
Global Tracking via Ensemble of Local Trackers Zikun Zhou, Jianqiu Chen, Wenjie Pei, Kaige Mao, Hongpeng Wang, Zhenyu He
PDF
Global-Aware Registration of Less-Overlap RGB-D Scans Che Sun, Yunde Jia, Yi Guo, Yuwei Wu
PDF
Globetrotter: Connecting Languages by Connecting Images Dídac Surís, Dave Epstein, Carl Vondrick
PDF
GMFlow: Learning Optical Flow via Global Matching Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Dacheng Tao
PDF
GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping Omid Taheri, Vasileios Choutas, Michael J. Black, Dimitrios Tzionas
PDF
GPU-Based Homotopy Continuation for Minimal Problems in Computer Vision Chiang-Heng Chien, Hongyi Fan, Ahmad Abdelfattah, Elias Tsigaridas, Stanimire Tomov, Benjamin Kimia
PDF
GPV-Pose: Category-Level Object Pose Estimation via Geometry-Guided Point-Wise Voting Yan Di, Ruida Zhang, Zhiqiang Lou, Fabian Manhardt, Xiangyang Ji, Nassir Navab, Federico Tombari
PDF
Gradient-SDF: A Semi-Implicit Surface Representation for 3D Reconstruction Christiane Sommer, Lu Sang, David Schubert, Daniel Cremers
PDF
GradViT: Gradient Inversion of Vision Transformers Ali Hatamizadeh, Hongxu Yin, Holger R. Roth, Wenqi Li, Jan Kautz, Daguang Xu, Pavlo Molchanov
PDF
GraFormer: Graph-Oriented Transformer for 3D Pose Estimation Weixi Zhao, Weiqiang Wang, Yunjie Tian
PDF
GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature Biyang Liu, Huimin Yu, Guodong Qi
PDF
GrainSpace: A Large-Scale Dataset for Fine-Grained and Domain-Adaptive Recognition of Cereal Grains Lei Fan, Yiwen Ding, Dongdong Fan, Donglin Di, Maurice Pagnucco, Yang Song
PDF
GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation Yu Deng, Jiaolong Yang, Jianfeng Xiang, Xin Tong
PDF
Graph Sampling Based Deep Metric Learning for Generalizable Person Re-Identification Shengcai Liao, Ling Shao
PDF
Graph-Based Spatial Transformer with Memory Replay for Multi-Future Pedestrian Trajectory Prediction Lihuan Li, Maurice Pagnucco, Yang Song
PDF
Graph-Context Attention Networks for Size-Varied Deep Graph Matching Zheheng Jiang, Hossein Rahmani, Plamen Angelov, Sue Black, Bryan M. Williams
PDF
Gravitationally Lensed Black Hole Emission Tomography Aviad Levis, Pratul P. Srinivasan, Andrew A. Chael, Ren Ng, Katherine L. Bouman
PDF
GreedyNASv2: Greedier Search with a Greedy Path Filter Tao Huang, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu
PDF
GridShift: A Faster Mode-Seeking Algorithm for Image Segmentation and Object Tracking Abhishek Kumar, Oladayo S. Ajani, Swagatam Das, Rammohan Mallipeddi
PDF
Grounded Language-Image Pre-Training Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao
PDF
Grounding Answers for Visual Questions Asked by Visually Impaired People Chongyan Chen, Samreen Anjum, Danna Gurari
PDF
Group Contextualization for Video Recognition Yanbin Hao, Hao Zhang, Chong-Wah Ngo, Xiangnan He
PDF
Group R-CNN for Weakly Semi-Supervised Object Detection with Points Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou, Kai Chen
PDF
GroupNet: Multiscale Hypergraph Neural Networks for Trajectory Prediction with Relational Reasoning Chenxin Xu, Maosen Li, Zhenyang Ni, Ya Zhang, Siheng Chen
PDF
GroupViT: Semantic Segmentation Emerges from Text Supervision Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang
PDF
GuideFormer: Transformers for Image Guided Depth Completion Kyeongha Rho, Jinsung Ha, Youngjung Kim
PDF
H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection Yunqiu Xu, Yifan Sun, Zongxin Yang, Jiaxu Miao, Yi Yang
PDF
H4D: Human 4D Modeling by Learning Neural Compositional Representation Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu
PDF
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale Ram Ramrakhya, Eric Undersander, Dhruv Batra, Abhishek Das
PDF
HairCLIP: Design Your Hair by Text and Reference Image Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu
PDF
HairMapper: Removing Hair from Portraits Using GANs Yiqian Wu, Yong-Liang Yang, Xiaogang Jin
PDF
Hallucinated Neural Radiance Fields in the Wild Xingyu Chen, Qi Zhang, Xiaoyu Li, Yue Chen, Ying Feng, Xuan Wang, Jue Wang
PDF
HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network JoonKyu Park, Yeonguk Oh, Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee
PDF
HARA: A Hierarchical Approach for Robust Rotation Averaging Seong Hun Lee, Javier Civera
PDF
Harmony: A Generic Unsupervised Approach for Disentangling Semantic Content from Parameterized Transformations Mostofa Rafid Uddin, Gregory Howe, Xiangrui Zeng, Min Xu
PDF
HCSC: Hierarchical Contrastive Selective Coding Yuanfan Guo, Minghao Xu, Jiawen Li, Bingbing Ni, Xuanyu Zhu, Zhenbang Sun, Yi Xu
PDF
HDNet: High-Resolution Dual-Domain Learning for Spectral Compressive Imaging Xiaowan Hu, Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van Gool
PDF
HDR-NeRF: High Dynamic Range Neural Radiance Fields Xin Huang, Qi Zhang, Ying Feng, Hongdong Li, Xuan Wang, Qing Wang
PDF
HeadNeRF: A Real-Time NeRF-Based Parametric Head Model Yang Hong, Bo Peng, Haiyao Xiao, Ligang Liu, Juyong Zhang
PDF
HEAT: Holistic Edge Attention Transformer for Structured Reconstruction Jiacheng Chen, Yiming Qian, Yasutaka Furukawa
PDF
HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging Xuanyu Zhang, Yongbing Zhang, Ruiqin Xiong, Qilin Sun, Jian Zhang
PDF
Hierarchical Modular Network for Video Captioning Hanhua Ye, Guorong Li, Yuankai Qi, Shuhui Wang, Qingming Huang, Ming-Hsuan Yang
PDF
Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction Saquib Sarfraz, Marios Koulakis, Constantin Seibold, Rainer Stiefelhagen
PDF
Hierarchical Self-Supervised Representation Learning for Movie Understanding Fanyi Xiao, Kaustav Kundu, Joseph Tighe, Davide Modolo
PDF
High Quality Segmentation for Ultra High-Resolution Images Tiancheng Shen, Yuechen Zhang, Lu Qi, Jason Kuen, Xingyu Xie, Jianlong Wu, Zhe Lin, Jiaya Jia
PDF
High-Fidelity GAN Inversion for Image Attribute Editing Tengfei Wang, Yong Zhang, Yanbo Fan, Jue Wang, Qifeng Chen
PDF
High-Fidelity Human Avatars from a Single RGB Camera Hao Zhao, Jinsong Zhang, Yu-Kun Lai, Zerong Zheng, Yingdi Xie, Yebin Liu, Kun Li
PDF
High-Resolution Face Swapping via Latent Semantics Disentanglement Yangyang Xu, Bailin Deng, Junle Wang, Yanqing Jing, Jia Pan, Shengfeng He
PDF
High-Resolution Image Harmonization via Collaborative Dual Transformations Wenyan Cong, Xinhao Tao, Li Niu, Jing Liang, Xuesong Gao, Qihao Sun, Liqing Zhang
PDF
High-Resolution Image Synthesis with Latent Diffusion Models Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
PDF
Highly-Efficient Incomplete Large-Scale Multi-View Clustering with Consensus Bipartite Graph Siwei Wang, Xinwang Liu, Li Liu, Wenxuan Tu, Xinzhong Zhu, Jiyuan Liu, Sihang Zhou, En Zhu
PDF
HINT: Hierarchical Neuron Concept Explainer Andong Wang, Wei-Ning Lee, Xiaojuan Qi
PDF
Hire-MLP: Vision MLP via Hierarchical Rearrangement Jianyuan Guo, Yehui Tang, Kai Han, Xinghao Chen, Han Wu, Chao Xu, Chang Xu, Yunhe Wang
PDF
HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction Zikang Zhou, Luyao Ye, Jianping Wang, Kui Wu, Kejie Lu
PDF
HL-Net: Heterophily Learning Network for Scene Graph Generation Xin Lin, Changxing Ding, Yibing Zhan, Zijian Li, Dacheng Tao
PDF
HLRTF: Hierarchical Low-Rank Tensor Factorization for Inverse Problems in Multi-Dimensional Imaging Yisi Luo, Xi-Le Zhao, Deyu Meng, Tai-Xiang Jiang
PDF
HODEC: Towards Efficient High-Order DEcomposed Convolutional Neural Networks Miao Yin, Yang Sui, Wanzhao Yang, Xiao Zang, Yu Gong, Bo Yuan
PDF
HODOR: High-Level Object Descriptors for Object Re-Segmentation in Video Learned from Static Images Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
PDF
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction Yunze Liu, Yun Liu, Che Jiang, Kangbo Lyu, Weikang Wan, Hao Shen, Boqiang Liang, Zhoujie Fu, He Wang, Li Yi
PDF
Holocurtains: Programming Light Curtains via Binary Holography Dorian Chan, Srinivasa G. Narasimhan, Matthew O'Toole
PDF
Homography Loss for Monocular 3D Object Detection Jiaqi Gu, Bojian Wu, Lubin Fan, Jianqiang Huang, Shen Cao, Zhiyu Xiang, Xian-Sheng Hua
PDF
HOP: History-and-Order Aware Pre-Training for Vision-and-Language Navigation Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu
PDF
How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs Hazel Doughty, Cees G. M. Snoek
PDF
How Good Is Aesthetic Ability of a Fashion Model? Xingxing Zou, Kaicheng Pang, Wen Zhang, Waikeung Wong
PDF
How Many Observations Are Enough? Knowledge Distillation for Trajectory Forecasting Alessio Monti, Angelo Porrello, Simone Calderara, Pasquale Coscia, Lamberto Ballan, Rita Cucchiara
PDF
How Much Does Input Data Type Impact Final Face Model Accuracy? Jiahao Luo, Fahim Hasan Khan, Issei Mori, Akila de Silva, Eric Sandoval Ruezga, Minghao Liu, Alex Pang, James Davis
PDF
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks Rafid Mahmood, James Lucas, David Acuna, Daiqing Li, Jonah Philion, Jose M. Alvarez, Zhiding Yu, Sanja Fidler, Marc T. Law
PDF
How Well Do Sparse ImageNet Models Transfer? Eugenia Iofinova, Alexandra Peste, Mark Kurtz, Dan Alistarh
PDF
HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang, Zhaoxiang Zhang, Zhen Lei
PDF
HSC4D: Human-Centered 4D Scene Capture in Large-Scale Indoor-Outdoor Space Using Wearable IMUs and LiDAR Yudi Dai, Yitai Lin, Chenglu Wen, Siqi Shen, Lan Xu, Jingyi Yu, Yuexin Ma, Cheng Wang
PDF
Human Hands as Probes for Interactive Object Understanding Mohit Goyal, Sahil Modi, Rishabh Goyal, Saurabh Gupta
PDF
Human Instance Matting via Mutual Guidance and Multi-Instance Refinement Yanan Sun, Chi-Keung Tang, Yu-Wing Tai
PDF
Human Mesh Recovery from Multiple Shots Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa
PDF
Human Trajectory Prediction with Momentary Observation Jianhua Sun, Yuxuan Li, Liang Chai, Hao-Shu Fang, Yong-Lu Li, Cewu Lu
PDF
Human-Aware Object Placement for Visual Environment Reconstruction Hongwei Yi, Chun-Hao P. Huang, Dimitrios Tzionas, Muhammed Kocabas, Mohamed Hassan, Siyu Tang, Justus Thies, Michael J. Black
PDF
Human-Object Interaction Detection via Disentangled Transformer Desen Zhou, Zhichao Liu, Jian Wang, Leshan Wang, Tao Hu, Errui Ding, Jingdong Wang
PDF
HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs Fuqiang Zhao, Wei Yang, Jiakai Zhang, Pei Lin, Yingliang Zhang, Jingyi Yu, Lan Xu
PDF
HumanNeRF: Free-Viewpoint Rendering of Moving People from Monocular Video Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman
PDF
HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture Ziyan Wang, Giljoo Nam, Tuur Stuyck, Stephen Lombardi, Michael Zollhöfer, Jessica Hodgins, Christoph Lassner
PDF
Hybrid Relation Guided Set Matching for Few-Shot Action Recognition Xiang Wang, Shiwei Zhang, Zhiwu Qing, Mingqian Tang, Zhengrong Zuo, Changxin Gao, Rong Jin, Nong Sang
PDF
HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma
PDF
Hyperbolic Image Segmentation Mina Ghadimi Atigh, Julian Schoep, Erman Acar, Nanne van Noord, Pascal Mettes
PDF
Hyperbolic Vision Transformers: Combining Improvements in Metric Learning Aleksandr Ermolov, Leyla Mirvakhabova, Valentin Khrulkov, Nicu Sebe, Ivan Oseledets
PDF
HyperDet3D: Learning a Scene-Conditioned 3D Object Detector Yu Zheng, Yueqi Duan, Jiwen Lu, Jie Zhou, Qi Tian
PDF
Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning Jongin Lim, Sangdoo Yun, Seulki Park, Jin Young Choi
PDF
HyperInverter: Improving StyleGAN Inversion via Hypernetwork Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua
PDF
HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation Using HyperNet Cheng Peng, Andriy Myronenko, Ali Hatamizadeh, Vishwesh Nath, Md Mahfuzur Rahman Siddiquee, Yufan He, Daguang Xu, Rama Chellappa, Dong Yang
PDF
Hyperspherical Consistency Regularization Cheng Tan, Zhangyang Gao, Lirong Wu, Siyuan Li, Stan Z. Li
PDF
HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing Yuval Alaluf, Omer Tov, Ron Mokady, Rinon Gal, Amit Bermano
PDF
HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening Wele Gedara Chaminda Bandara, Vishal M. Patel
PDF
I M Avatar: Implicit Morphable Head Avatars from Videos Yufeng Zheng, Victoria Fernández Abrevaya, Marcel C. Bühler, Xu Chen, Michael J. Black, Otmar Hilliges
PDF
ICON: Implicit Clothed Humans Obtained from Normals Yuliang Xiu, Jinlong Yang, Dimitrios Tzionas, Michael J. Black
PDF
Id-Free Person Similarity Learning Bing Shuai, Xinyu Li, Kaustav Kundu, Joseph Tighe
PDF
IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment Yiming Zeng, Yue Qian, Qijian Zhang, Junhui Hou, Yixuan Yuan, Ying He
PDF
Identifying Ambiguous Similarity Conditions via Semantic Matching Han-Jia Ye, Yi Shi, De-Chuan Zhan
PDF
IDR: Self-Supervised Image Denoising via Iterative Data Refinement Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li
PDF
IFOR: Iterative Flow Minimization for Robotic Object Rearrangement Ankit Goyal, Arsalan Mousavian, Chris Paxton, Yu-Wei Chao, Brian Okorn, Jia Deng, Dieter Fox
PDF
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Xiaoming Huang, Ying Tai, Chengjie Wang, Jie Yang
PDF
iFS-RCNN: An Incremental Few-Shot Instance Segmenter Khoi Nguyen, Sinisa Todorovic
PDF
Image Animation with Perturbed Masks Yoav Shalev, Lior Wolf
PDF
Image Based Reconstruction of Liquids from 2D Surface Detections Florian Richter, Ryan K. Orosco, Michael C. Yip
PDF
Image Dehazing Transformer with Transmission-Aware 3D Position Embedding Chun-Le Guo, Qixin Yan, Saeed Anwar, Runmin Cong, Wenqi Ren, Chongyi Li
PDF
Image Disentanglement Autoencoder for Steganography Without Embedding Xiyao Liu, Ziping Ma, Junxing Ma, Jian Zhang, Gerald Schaefer, Hui Fang
PDF
Image Segmentation Using Text and Image Prompts Timo Lüddecke, Alexander Ecker
PDF
Image-to-LiDAR Self-Supervised Distillation for Autonomous Driving Data Corentin Sautier, Gilles Puy, Spyros Gidaris, Alexandre Boulch, Andrei Bursuc, Renaud Marlet
PDF
ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations Mingwu Zheng, Hongyu Yang, Di Huang, Liming Chen
PDF
Implicit Feature Decoupling with Depthwise Quantization Iordanis Fostiropoulos, Barry Boehm
PDF
Implicit Motion Handling for Video Camouflaged Object Detection Xuelian Cheng, Huan Xiong, Deng-Ping Fan, Yiran Zhong, Mehrtash Harandi, Tom Drummond, Zongyuan Ge
PDF
Implicit Sample Extension for Unsupervised Person Re-Identification Xinyu Zhang, Dongdong Li, Zhigang Wang, Jian Wang, Errui Ding, Javen Qinfeng Shi, Zhaoxiang Zhang, Jingdong Wang
PDF
ImplicitAtlas: Learning Deformable Shape Templates in Medical Imaging Jiancheng Yang, Udaranga Wickramasinghe, Bingbing Ni, Pascal Fua
PDF
Imposing Consistency for Optical Flow Estimation Jisoo Jeong, Jamie Menjay Lin, Fatih Porikli, Nojun Kwak
PDF
Improving Adversarial Transferability via Neuron Attribution-Based Attacks Jianping Zhang, Weibin Wu, Jen-tse Huang, Yizhan Huang, Wenxuan Wang, Yuxin Su, Michael R. Lyu
PDF
Improving Adversarially Robust Few-Shot Image Classification with Generalizable Representations Junhao Dong, Yuan Wang, Jian-Huang Lai, Xiaohua Xie
PDF
Improving GAN Equilibrium by Raising Spatial Awareness Jianyuan Wang, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, Bolei Zhou
PDF
Improving Neural Implicit Surfaces Geometry with Patch Warping François Darmon, Bénédicte Bascle, Jean-Clément Devaux, Pascal Monasse, Mathieu Aubry
PDF
Improving Robustness Against Stealthy Weight Bit-Flip Attacks by Output Code Matching Ozan Özdenizci, Robert Legenstein
PDF
Improving Segmentation of the Inferior Alveolar Nerve Through Deep Label Propagation Marco Cipriano, Stefano Allegretti, Federico Bolelli, Federico Pollastri, Costantino Grana
PDF
Improving Subgraph Recognition with Variational Graph Information Bottleneck Junchi Yu, Jie Cao, Ran He
PDF
Improving the Transferability of Targeted Adversarial Examples Through Object-Based Diverse Input Junyoung Byun, Seungju Cho, Myung-Joon Kwon, Hee-Seon Kim, Changick Kim
PDF
Improving Video Model Transfer with Dynamic Representation Learning Yi Li, Nuno Vasconcelos
PDF
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning Li Yang, Yan Xu, Chunfeng Yuan, Wei Liu, Bing Li, Weiming Hu
PDF
Incorporating Semi-Supervised and Positive-Unlabeled Learning for Boosting Full Reference Image Quality Assessment Yue Cao, Zhaolin Wan, Dongwei Ren, Zifei Yan, Wangmeng Zuo
PDF
Incremental Cross-View Mutual Distillation for Self-Supervised Medical CT Synthesis Chaowei Fang, Liang Wang, Dingwen Zhang, Jun Xu, Yixuan Yuan, Junwei Han
PDF
Incremental Learning in Semantic Segmentation from Image Labels Fabio Cermelli, Dario Fontanel, Antonio Tavera, Marco Ciccone, Barbara Caputo
PDF
Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding Qiaole Dong, Chenjie Cao, Yanwei Fu
PDF
Industrial Style Transfer with Large-Scale Geometric Warping and Content Preservation Jinchao Yang, Fei Guo, Shuo Chen, Jun Li, Jian Yang
PDF
Inertia-Guided Flow Completion and Style Fusion for Video Inpainting Kaidong Zhang, Jingjing Fu, Dong Liu
PDF
InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition Hyung-gun Chi, Myoung Hoon Ha, Seunggeun Chi, Sang Wan Lee, Qixing Huang, Karthik Ramani
PDF
InfoNeRF: Ray Entropy Minimization for Few-Shot Neural Volume Rendering Mijeong Kim, Seonguk Seo, Bohyung Han
PDF
Infrared Invisible Clothing: Hiding from Infrared Detectors at Multiple Angles in Real World Xiaopei Zhu, Zhanhao Hu, Siyuan Huang, Jianmin Li, Xiaolin Hu
PDF
Injecting Semantic Concepts into End-to-End Image Captioning Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lin Liang, Zhe Gan, Lijuan Wang, Yezhou Yang, Zicheng Liu
PDF
InOut: Diverse Image Outpainting via GAN Inversion Yen-Chi Cheng, Chieh Hubert Lin, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Ming-Hsuan Yang
PDF
Input-Level Inductive Biases for 3D Reconstruction Wang Yifan, Carl Doersch, Relja Arandjelović, João Carreira, Andrew Zisserman
PDF
INS-Conv: Incremental Sparse Convolution for Online 3D Segmentation Leyao Liu, Tian Zheng, Yun-Jou Lin, Kai Ni, Lu Fang
PDF
InsetGAN for Full-Body Image Generation Anna Frühstück, Krishna Kumar Singh, Eli Shechtman, Niloy J. Mitra, Peter Wonka, Jingwan Lu
PDF
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer Soohyun Kim, Jongbeom Baek, Jihye Park, Gyeongnyeon Kim, Seungryong Kim
PDF
Instance Segmentation with Mask-Supervised Polygonal Boundary Transformers Justin Lazarow, Weijian Xu, Zhuowen Tu
PDF
Instance-Aware Dynamic Neural Network Quantization Zhenhua Liu, Yunhe Wang, Kai Han, Siwei Ma, Wen Gao
PDF
Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation De Cheng, Tongliang Liu, Yixiong Ning, Nannan Wang, Bo Han, Gang Niu, Xinbo Gao, Masashi Sugiyama
PDF
Instance-Wise Occlusion and Depth Orders in Natural Scenes Hyunmin Lee, Jaesik Park
PDF
Integrating Language Guidance into Vision-Based Deep Metric Learning Karsten Roth, Oriol Vinyals, Zeynep Akata
PDF
Integrative Few-Shot Learning for Classification and Segmentation Dahyun Kang, Minsu Cho
PDF
IntentVizor: Towards Generic Query Guided Interactive Video Summarization Guande Wu, Jianzhe Lin, Claudio T. Silva
PDF
Interact Before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition Lijin Yang, Yifei Huang, Yusuke Sugano, Yoichi Sato
PDF
Interacting Attention Graph for Single Image Two-Hand Reconstruction Mengcheng Li, Liang An, Hongwen Zhang, Lianpeng Wu, Feng Chen, Tao Yu, Yebin Liu
PDF
Interactive Disentanglement: Learning Concepts by Interacting with Their Prototype Representations Wolfgang Stammer, Marius Memmel, Patrick Schramowski, Kristian Kersting
PDF
Interactive Image Synthesis with Panoptic Layout Generation Bo Wang, Tao Wu, Minfeng Zhu, Peng Du
PDF
Interactive Multi-Class Tiny-Object Detection Chunggi Lee, Seonwook Park, Heon Song, Jeongun Ryu, Sanghoon Kim, Haejoon Kim, Sérgio Pereira, Donggeun Yoo
PDF
Interactive Segmentation and Visualization for Tiny Objects in Multi-Megapixel Images Chengyuan Xu, Boning Dong, Noah Stier, Curtis McCully, D. Andrew Howell, Pradeep Sen, Tobias Höllerer
PDF
Interactiveness Field in Human-Object Interactions Xinpeng Liu, Yong-Lu Li, Xiaoqian Wu, Yu-Wing Tai, Cewu Lu, Chi-Keung Tang
PDF
Interactron: Embodied Adaptive Object Detection Klemen Kotar, Roozbeh Mottaghi
PDF
Interpretable Part-Whole Hierarchies and Conceptual-Semantic Relationships in Neural Networks Nicola Garau, Niccolò Bisagno, Zeno Sambugaro, Nicola Conci
PDF
Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs Paul Wimmer, Jens Mehnert, Alexandru Condurache
PDF
IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization Yunshan Zhong, Mingbao Lin, Gongrui Nan, Jianzhuang Liu, Baochang Zhang, Yonghong Tian, Rongrong Ji
PDF
Invariant Grounding for Video Question Answering Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-Seng Chua
PDF
Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving Hanjiang Hu, Zuxin Liu, Sharad Chitlangia, Akhil Agnihotri, Ding Zhao
PDF
Investigating Top-K White-Box and Transferable Black-Box Attack Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon
PDF
Investigating Tradeoffs in Real-World Video Super-Resolution Kelvin C.K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy
PDF
iPLAN: Interactive and Procedural Layout Planning Feixiang He, Yanlong Huang, He Wang
PDF
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker
PDF
IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images Kai Zhang, Fujun Luan, Zhengqi Li, Noah Snavely
PDF
Is Mapping Necessary for Realistic PointGoal Navigation? Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr Maksymets
PDF
ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-High Resolution Segmentation Shaohua Guo, Liang Liu, Zhenye Gan, Yabiao Wang, Wuhao Zhang, Chengjie Wang, Guannan Jiang, Wei Zhang, Ran Yi, Lizhuang Ma, Ke Xu
PDF
ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior Metin Ersin Arican, Ozgur Kara, Gustav Bredell, Ender Konukoglu
PDF
ISNet: Shape Matters for Infrared Small Target Detection Mingjin Zhang, Rui Zhang, Yuxiang Yang, Haichen Bai, Jing Zhang, Jie Guo
PDF
It Is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection Youssef Mohamed, Faizan Farooq Khan, Kilichbek Haydarov, Mohamed Elhoseiny
PDF
It's About Time: Analog Clock Reading in the Wild Charig Yang, Weidi Xie, Andrew Zisserman
PDF
It's All in the Teacher: Zero-Shot Quantization Brought Closer to the Teacher Kanghyun Choi, Hye Yoon Lee, Deokki Hong, Joonsang Yu, Noseong Park, Youngsok Kim, Jinho Lee
PDF
It's Time for Artistic Correspondence in Music and Video Dídac Surís, Carl Vondrick, Bryan Russell, Justin Salamon
PDF
Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects Manuel Stoiber, Martin Sundermeyer, Rudolph Triebel
PDF
Iterative Deep Homography Estimation Si-Yuan Cao, Jianxin Hu, Zehua Sheng, Hui-Liang Shen
PDF
IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Marc Pollefeys
PDF
Ithaca365: Dataset and Driving Perception Under Repeated and Challenging Weather Conditions Carlos A. Diaz-Ruiz, Youya Xia, Yurong You, Jose Nino, Junan Chen, Josephine Monica, Xiangyu Chen, Katie Luo, Yan Wang, Marc Emond, Wei-Lun Chao, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell
PDF
ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks WeiQin Chuah, Ruwan Tennakoon, Reza Hoseinnezhad, Alireza Bab-Hadiashar, David Suter
PDF
JIFF: Jointly-Aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction Yukang Cao, Guanying Chen, Kai Han, Wenqi Yang, Kwan-Yee K. Wong
PDF
JoinABLe: Learning Bottom-up Assembly of Parametric CAD Joints Karl D.D. Willis, Pradeep Kumar Jayaraman, Hang Chu, Yunsheng Tian, Yifei Li, Daniele Grandi, Aditya Sanghi, Linh Tran, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech Matusik
PDF
Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification Jiangtao Xie, Fei Long, Jiaming Lv, Qilong Wang, Peihua Li
PDF
Joint Forecasting of Panoptic Segmentations with Difference Attention Colin Graber, Cyril Jazra, Wenjie Luo, Liangyan Gui, Alexander G. Schwing
PDF
Joint Global and Local Hierarchical Priors for Learned Image Compression Jun-Hyuk Kim, Byeongho Heo, Jong-Seok Lee
PDF
Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos Shaowei Liu, Subarna Tripathi, Somdeb Majumdar, Xiaolong Wang
PDF
Joint Video Summarization and Moment Localization by Cross-Task Sample Transfer Hao Jiang, Yadong Mu
PDF
JRDB-Act: A Large-Scale Dataset for Spatio-Temporal Action, Social Group and Activity Detection Mahsa Ehsanpour, Fatemeh Saleh, Silvio Savarese, Ian Reid, Hamid Rezatofighi
PDF
Kernelized Few-Shot Object Detection with Efficient Integral Aggregation Shan Zhang, Lei Wang, Naila Murray, Piotr Koniusz
PDF
Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit
PDF
KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi
PDF
KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning Shyamgopal Karthik, Massimiliano Mancini, Zeynep Akata
PDF
Killing Two Birds with One Stone: Efficient and Robust Training of Face Recognition CNNs by Partial FC Xiang An, Jiankang Deng, Jia Guo, Ziyong Feng, XuHan Zhu, Jing Yang, Tongliang Liu
PDF
KNN Local Attention for Image Restoration Hunsang Lee, Hyesong Choi, Kwanghoon Sohn, Dongbo Min
PDF
Knowledge Distillation as Efficient Pre-Training: Faster Convergence, Higher Data-Efficiency, and Better Transferability Ruifei He, Shuyang Sun, Jihan Yang, Song Bai, Xiaojuan Qi
PDF
Knowledge Distillation via the Target-Aware Transformer Sihao Lin, Hongwei Xie, Bing Wang, Kaicheng Yu, Xiaojun Chang, Xiaodan Liang, Gang Wang
PDF
Knowledge Distillation with the Reused Teacher Classifier Defang Chen, Jian-Ping Mei, Hailin Zhang, Can Wang, Yan Feng, Chun Chen
PDF
Knowledge Distillation: A Good Teacher Is Patient and Consistent Lucas Beyer, Xiaohua Zhai, Amélie Royer, Larisa Markeeva, Rohan Anil, Alexander Kolesnikov
PDF
Knowledge Mining with Scene Text for Fine-Grained Recognition Hao Wang, Junchao Liao, Tianheng Cheng, Zewen Gao, Hao Liu, Bo Ren, Xiang Bai, Wenyu Liu
PDF
Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition Yanan Chang, Shangfei Wang
PDF
Kubric: A Scalable Dataset Generator Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi, Matan Sela, Vincent Sitzmann, Austin Stone, Deqing Sun, Suhani Vora, Ziyu Wang, Tianhao Wu, Kwang Moo Yi, Fangcheng Zhong, Andrea Tagliasacchi
PDF
L-Verse: Bidirectional Generation Between Image and Text Taehoon Kim, Gwangmo Song, Sihaeng Lee, Sangyun Kim, Yewon Seo, Soonyoung Lee, Seung Hwan Kim, Honglak Lee, Kyunghoon Bae
PDF
L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation Peng-Tao Jiang, Yuqi Yang, Qibin Hou, Yunchao Wei
PDF
Label Matching Semi-Supervised Object Detection Binbin Chen, Weijie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, Shiliang Pu, Mingli Song, Yueting Zhuang
PDF
Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification Jingzhou Chen, Peng Wang, Jian Liu, Yuntao Qian
PDF
Label-Only Model Inversion Attacks via Boundary Repulsion Mostafa Kahla, Si Chen, Hoang Anh Just, Ruoxi Jia
PDF
Label, Verify, Correct: A Simple Few Shot Object Detection Method Prannay Kaul, Weidi Xie, Andrew Zisserman
PDF
Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition Tianrui Chai, Annan Li, Shaoxiong Zhang, Zilong Li, Yunhong Wang
PDF
LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints Junshu Tang, Zhijun Gong, Ran Yi, Yuan Xie, Lizhuang Ma
PDF
Language as Queries for Referring Video Object Segmentation Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo
PDF
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu
PDF
LAR-SR: A Local Autoregressive Model for Image Super-Resolution Baisong Guo, Xiaoyun Zhang, Haoning Wu, Yu Wang, Ya Zhang, Yan-Feng Wang
PDF
Large Loss Matters in Weakly Supervised Multi-Label Classification Youngwook Kim, Jae Myung Kim, Zeynep Akata, Jungwoo Lee
PDF
Large-Scale Pre-Training for Person Re-Identification with Noisy Labels Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, Lei Zhang, Houqiang Li, Fang Wen, Dong Chen
PDF
Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark Jiaxu Miao, Xiaohan Wang, Yu Wu, Wei Li, Xu Zhang, Yunchao Wei, Yi Yang
PDF
LARGE: Latent-Based Regression Through GAN Semantics Yotam Nitzan, Rinon Gal, Ofir Brenner, Daniel Cohen-Or
PDF
LAS-AT: Adversarial Training with Learnable Attack Strategy Xiaojun Jia, Yong Zhang, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao
PDF
LASER: LAtent SpacE Rendering for 2D Visual Localization Zhixiang Min, Naji Khosravan, Zachary Bessinger, Manjunath Narayana, Sing Bing Kang, Enrique Dunn, Ivaylo Boyadzhiev
PDF
LaTr: Layout-Aware Transformer for Scene-Text VQA Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha
PDF
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H.S. Torr
PDF
Layer-Wised Model Aggregation for Personalized Federated Learning Xiaosong Ma, Jie Zhang, Song Guo, Wenchao Xu
PDF
Layered Depth Refinement with Mask Guidance Soo Ye Kim, Jianming Zhang, Simon Niklaus, Yifei Fan, Simon Chen, Zhe Lin, Munchurl Kim
PDF
LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network Hochang Rhee, Yeong Il Jang, Seyun Kim, Nam Ik Cho
PDF
LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition Dan Liu, Libo Zhang, Yanjun Wu
PDF
Learn from Others and Be Yourself in Heterogeneous Federated Learning Wenke Huang, Mang Ye, Bo Du
PDF
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos Saghir Alfasly, Jian Lu, Chen Xu, Yuru Zou
PDF
Learnable Lookup Table for Neural Network Quantization Longguang Wang, Xiaoyu Dong, Yingqian Wang, Li Liu, Wei An, Yulan Guo
PDF
Learned Queries for Efficient Local Attention Moab Arar, Ariel Shamir, Amit H. Bermano
PDF
Learning 3D Object Shape and Layout Without 3D Supervision Georgia Gkioxari, Nikhila Ravi, Justin Johnson
PDF
Learning a Structured Latent Space for Unsupervised Point Cloud Completion Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li
PDF
Learning ABCs: Approximate Bijective Correspondence for Isolating Factors of Variation with Weak Supervision Kieran A. Murphy, Varun Jampani, Srikumar Ramalingam, Ameesh Makadia
PDF
Learning Adaptive Warping for Real-World Rolling Shutter Correction Mingdeng Cao, Zhihang Zhong, Jiahao Wang, Yinqiang Zheng, Yujiu Yang
PDF
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers Lixiang Ru, Yibing Zhan, Baosheng Yu, Bo Du
PDF
Learning Affordance Grounding from Exocentric Images Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao
PDF
Learning Based Multi-Modality Image and Video Compression Guo Lu, Tianxiong Zhong, Jing Geng, Qiang Hu, Dong Xu
PDF
Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning Qingsen Yan, Dong Gong, Yuhang Liu, Anton van den Hengel, Javen Qinfeng Shi
PDF
Learning Canonical F-Correlation Projection for Compact Multiview Representation Yun-Hao Yuan, Jin Li, Yun Li, Jipeng Qiang, Yi Zhu, Xiaobo Shen, Jianping Gou
PDF
Learning Deep Implicit Functions for 3D Shapes with Dynamic Code Clouds Tianyang Li, Xin Wen, Yu-Shen Liu, Hua Su, Zhizhong Han
PDF
Learning Distinctive Margin Toward Active Domain Adaptation Ming Xie, Yuxi Li, Yabiao Wang, Zekun Luo, Zhenye Gan, Zhongyi Sun, Mingmin Chi, Chengjie Wang, Pei Wang
PDF
Learning Fair Classifiers with Partially Annotated Group Labels Sangwon Jung, Sanghyuk Chun, Taesup Moon
PDF
Learning from All Vehicles Dian Chen, Philipp Krähenbühl
PDF
Learning from Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian
PDF
Learning from Temporal Gradient for Semi-Supervised Action Recognition Junfei Xiao, Longlong Jing, Lin Zhang, Ju He, Qi She, Zongwei Zhou, Alan Yuille, Yingwei Li
PDF
Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency Zhiwu Qing, Shiwei Zhang, Ziyuan Huang, Yi Xu, Xiang Wang, Mingqian Tang, Changxin Gao, Rong Jin, Nong Sang
PDF
Learning Graph Regularisation for Guided Super-Resolution Riccardo de Lutio, Alexander Becker, Stefano D'Aronco, Stefania Russo, Jan D. Wegner, Konrad Schindler
PDF
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou
PDF
Learning Invisible Markers for Hidden Codes in Offline-to-Online Photography Jun Jia, Zhongpai Gao, Dandan Zhu, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang
PDF
Learning Local Displacements for Point Cloud Completion Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari
PDF
Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation Nan Xue, Tianfu Wu, Gui-Song Xia, Liangpei Zhang
PDF
Learning Memory-Augmented Unidirectional Metrics for Cross-Modality Person Re-Identification Jialun Liu, Yifan Sun, Feng Zhu, Hongbin Pei, Yi Yang, Wenhui Li
PDF
Learning Modal-Invariant and Temporal-Memory for Video-Based Visible-Infrared Person Re-Identification Xinyu Lin, Jinxing Li, Zeyu Ma, Huafeng Li, Shuang Li, Kaixiong Xu, Guangming Lu, David Zhang
PDF
Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera Jae Shin Yoon, Duygu Ceylan, Tuanfeng Y. Wang, Jingwan Lu, Jimei Yang, Zhixin Shu, Hyun Soo Park
PDF
Learning Multi-View Aggregation in the Wild for Large-Scale 3D Semantic Segmentation Damien Robert, Bruno Vallet, Loic Landrieu
PDF
Learning Multiple Adverse Weather Removal via Two-Stage Knowledge Learning and Multi-Contrastive Regularization: Toward a Unified Model Wei-Ting Chen, Zhi-Kai Huang, Cheng-Che Tsai, Hao-Hsiang Yang, Jian-Jiun Ding, Sy-Yen Kuo
PDF
Learning Multiple Dense Prediction Tasks from Partially Annotated Data Wei-Hong Li, Xialei Liu, Hakan Bilen
PDF
Learning Neural Light Fields with Ray-Space Embedding Benjamin Attal, Jia-Bin Huang, Michael Zollhöfer, Johannes Kopf, Changil Kim
PDF
Learning Non-Target Knowledge for Few-Shot Semantic Segmentation Yuanwei Liu, Nian Liu, Qinglong Cao, Xiwen Yao, Junwei Han, Ling Shao
PDF
Learning Object Context for Novel-View Scene Layout Generation Xiaotian Qiao, Gerhard P. Hancke, Rynson W.H. Lau
PDF
Learning of Global Objective for Network Flow in Multi-Object Tracking Shuai Li, Yu Kong, Hamid Rezatofighi
PDF
Learning Optical Flow with Kernel Patch Attention Ao Luo, Fan Yang, Xin Li, Shuaicheng Liu
PDF
Learning Optimal K-Space Acquisition and Reconstruction Using Physics-Informed Neural Networks Wei Peng, Li Feng, Guoying Zhao, Fang Liu
PDF
Learning Part Segmentation Through Unsupervised Domain Adaptation from Synthetic Vehicles Qing Liu, Adam Kortylewski, Zhishuai Zhang, Zizhang Li, Mengqi Guo, Qihao Liu, Xiaoding Yuan, Jiteng Mu, Weichao Qiu, Alan Yuille
PDF
Learning Pixel Trajectories with Multiscale Contrastive Random Walks Zhangxing Bian, Allan Jabri, Alexei A. Efros, Andrew Owens
PDF
Learning Pixel-Level Distinctions for Video Highlight Detection Fanyue Wei, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
PDF
Learning Program Representations for Food Images and Cooking Recipes Dim P. Papadopoulos, Enrique Mora, Nadiia Chepurko, Kuan Wei Huang, Ferda Ofli, Antonio Torralba
PDF
Learning Robust Image-Based Rendering on Sparse Scene Geometry via Depth Completion Yuqi Sun, Shili Zhou, Ri Cheng, Weimin Tan, Bo Yan, Lang Fu
PDF
Learning Second Order Local Anomaly for General Face Forgery Detection Jianwei Fei, Yunshu Dai, Peipeng Yu, Tianrun Shen, Zhihua Xia, Jian Weng
PDF
Learning Semantic Associations for Mirror Detection Huankang Guan, Jiaying Lin, Rynson W.H. Lau
PDF
Learning Soft Estimator of Keypoint Scale and Orientation with Probabilistic Covariant Loss Pei Yan, Yihua Tan, Shengzhou Xiong, Yuan Tai, Yansheng Li
PDF
Learning sRGB-to-Raw-RGB De-Rendering with Content-Aware Metadata Seonghyeon Nam, Abhijith Punnappurath, Marcus A. Brubaker, Michael S. Brown
PDF
Learning Structured Gaussians to Approximate Deep Ensembles Ivor J. A. Simpson, Sara Vicente, Neill D. F. Campbell
PDF
Learning to Affiliate: Mutual Centralized Learning for Few-Shot Classification Yang Liu, Weifeng Zhang, Chao Xiang, Tu Zheng, Deng Cai, Xiaofei He
PDF
Learning to Align Sequential Actions in the Wild Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys
PDF
Learning to Answer Questions in Dynamic Audio-Visual Scenarios Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu
PDF
Learning to Anticipate Future with Dynamic Context Removal Xinyu Xu, Yong-Lu Li, Cewu Lu
PDF
Learning to Collaborate in Decentralized Learning of Personalized Models Shuangtong Li, Tianyi Zhou, Xinmei Tian, Dacheng Tao
PDF
Learning to Deblur Using Light Field Generated and Real Defocus Images Lingyan Ruan, Bin Chen, Jizhou Li, Miuling Lam
PDF
Learning to Detect Mobile Objects from LiDAR Scans Without Labels Yurong You, Katie Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger
PDF
Learning to Detect Scene Landmarks for Camera Localization Tien Do, Ondrej Miksik, Joseph DeGol, Hyun Soo Park, Sudipta N. Sinha
PDF
Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes Hongsuk Choi, Gyeongsik Moon, JoonKyu Park, Kyoung Mu Lee
PDF
Learning to Find Good Models in RANSAC Daniel Barath, Luca Cavalli, Marc Pollefeys
PDF
Learning to Generate Line Drawings That Convey Geometry and Semantics Caroline Chan, Frédo Durand, Phillip Isola
PDF
Learning to Imagine: Diversify Memory for Incremental Learning Using Unlabeled Data Yu-Ming Tang, Yi-Xing Peng, Wei-Shi Zheng
PDF
Learning to Learn Across Diverse Data Biases in Deep Face Recognition Chang Liu, Xiang Yu, Yi-Hsuan Tsai, Masoud Faraki, Ramin Moslemi, Manmohan Chandraker, Yun Fu
PDF
Learning to Learn and Remember Super Long Multi-Domain Task Sequence Zhenyi Wang, Li Shen, Tiehang Duan, Donglin Zhan, Le Fang, Mingchen Gao
PDF
Learning to Learn by Jointly Optimizing Neural Architecture and Weights Yadong Ding, Yu Wu, Chengyue Huang, Siliang Tang, Yi Yang, Longhui Wei, Yueting Zhuang, Qi Tian
PDF
Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion Evonne Ng, Hanbyul Joo, Liwen Hu, Hao Li, Trevor Darrell, Angjoo Kanazawa, Shiry Ginosar
PDF
Learning to Memorize Feature Hallucination for One-Shot Image Generation Yu Xie, Yanwei Fu, Ying Tai, Yun Cao, Junwei Zhu, Chengjie Wang
PDF
Learning to Prompt for Continual Learning Zifeng Wang, Zizhao Zhang, Chen-Yu Lee, Han Zhang, Ruoxi Sun, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister
PDF
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li
PDF
Learning to Recognize Procedural Activities with Distant Supervision Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani
PDF
Learning to Refactor Action and Co-Occurrence Features for Temporal Action Localization Kun Xia, Le Wang, Sanping Zhou, Nanning Zheng, Wei Tang
PDF
Learning to Restore 3D Face from In-the-Wild Degraded Images Zhenyu Zhang, Yanhao Ge, Ying Tai, Xiaoming Huang, Chengjie Wang, Hao Tang, Dongjin Huang, Zhifeng Xie
PDF
Learning to Solve Hard Minimal Problems Petr Hruby, Timothy Duff, Anton Leykin, Tomas Pajdla
PDF
Learning to Zoom Inside Camera Imaging Pipeline Chengzhou Tang, Yuqiang Yang, Bing Zeng, Ping Tan, Shuaicheng Liu
PDF
Learning Trajectory-Aware Transformer for Video Super-Resolution Chengxu Liu, Huan Yang, Jianlong Fu, Xueming Qian
PDF
Learning Transferable Human-Object Interaction Detector with Natural Language Supervision Suchen Wang, Yueqi Duan, Henghui Ding, Yap-Peng Tan, Kim-Hui Yap, Junsong Yuan
PDF
Learning Video Representations of Human Motion from Synthetic Data Xi Guo, Wei Wu, Dongliang Wang, Jing Su, Haisheng Su, Weihao Gan, Jian Huang, Qin Yang
PDF
Learning What Not to Segment: A New Perspective on Few-Shot Segmentation Chunbo Lang, Gong Cheng, Binfei Tu, Junwei Han
PDF
Learning Where to Learn in Cross-View Self-Supervised Learning Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki
PDF
Learning with Neighbor Consistency for Noisy Labels Ahmet Iscen, Jack Valmadre, Anurag Arnab, Cordelia Schmid
PDF
Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification Mouxing Yang, Zhenyu Huang, Peng Hu, Taihao Li, Jiancheng Lv, Xi Peng
PDF
Lepard: Learning Partial Point Cloud Matching in Rigid and Deformable Scenes Yang Li, Tatsuya Harada
PDF
Less Is More: Generating Grounded Navigation Instructions from Landmarks Su Wang, Ceslee Montgomery, Jordi Orbay, Vighnesh Birodkar, Aleksandra Faust, Izzeddin Gur, Natasha Jaques, Austin Waters, Jason Baldridge, Peter Anderson
PDF
Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers Dominik Zietlow, Michael Lohaus, Guha Balakrishnan, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Chris Russell
PDF
Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy Tong Zhang, Congpei Qiu, Wei Ke, Sabine Süsstrunk, Mathieu Salzmann
PDF
Leveraging Adversarial Examples to Quantify Membership Information Leakage Ganesh Del Grosso, Hamid Jalalzai, Georg Pichler, Catuscia Palamidessi, Pablo Piantanida
PDF
Leveraging Equivariant Features for Absolute Pose Regression Mohamed Adel Musallam, Vincent Gaudillière, Miguel Ortiz del Castillo, Kassem Al Ismaeil, Djamila Aouada
PDF
Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection Alexandros Haliassos, Rodrigo Mira, Stavros Petridis, Maja Pantic
PDF
Leveraging Self-Supervision for Cross-Domain Crowd Counting Weizhe Liu, Nikita Durasov, Pascal Fua
PDF
LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network Zhigang Jiang, Zhongzheng Xiang, Jinhua Xu, Ming Zhao
PDF
LiDAR Snowfall Simulation for Robust 3D Object Detection Martin Hahner, Christos Sakaridis, Mario Bijelic, Felix Heide, Fisher Yu, Dengxin Dai, Luc Van Gool
PDF
LiDARCap: Long-Range Marker-Less 3D Human Motion Capture with LiDAR Point Clouds Jialian Li, Jingyi Zhang, Zhiyong Wang, Siqi Shen, Chenglu Wen, Yuexin Ma, Lan Xu, Jingyi Yu, Cheng Wang
PDF
Lifelong Graph Learning Chen Wang, Yuheng Qiu, Dasong Gao, Sebastian Scherer
PDF
Lifelong Unsupervised Domain Adaptive Person Re-Identification with Coordinated Anti-Forgetting and Adaptation Zhipeng Huang, Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Zheng-Jun Zha
PDF
LIFT: Learning 4D LiDAR Image Fusion Transformer for 3D Object Detection Yihan Zeng, Da Zhang, Chunwei Wang, Zhenwei Miao, Ting Liu, Xin Zhan, Dayang Hao, Chao Ma
PDF
Light Field Neural Rendering Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia
PDF
Likert Scoring with Grade Decoupling for Long-Term Action Assessment Angchi Xu, Ling-An Zeng, Wei-Shi Zheng
PDF
LISA: Learning Implicit Shape and Appearance of Hands Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe, Lingni Ma
PDF
LiT: Zero-Shot Transfer with Locked-Image Text Tuning Xiaohua Zhai, Xiao Wang, Basil Mustafa, Andreas Steiner, Daniel Keysers, Alexander Kolesnikov, Lucas Beyer
PDF
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation Yihan Wang, Muyang Li, Han Cai, Wei-Ming Chen, Song Han
PDF
Lite Vision Transformer with Enhanced Self-Attention Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille
PDF
Lite-MDETR: A Lightweight Multi-Modal Detector Qian Lou, Yen-Chang Hsu, Burak Uzkent, Ting Hua, Yilin Shen, Hongxia Jin
PDF
LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking Duy M. H. Nguyen, Roberto Henschel, Bodo Rosenhahn, Daniel Sonntag, Paul Swoboda
PDF
Local Attention Pyramid for Scene Image Generation Sang-Heon Shim, Sangeek Hyun, DaeHyun Bae, Jae-Pil Heo
PDF
Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning Matias Mendieta, Taojiannan Yang, Pu Wang, Minwoo Lee, Zhengming Ding, Chen Chen
PDF
Local Texture Estimator for Implicit Representation Function Jaewon Lee, Kyong Hwan Jin
PDF
Local-Adaptive Face Recognition via Graph-Based Meta-Clustering and Regularized Adaptation Wenbin Zhu, Chien-Yi Wang, Kuan-Lun Tseng, Shang-Hong Lai, Baoyuan Wang
PDF
Locality-Aware Inter- and Intra-Video Reconstruction for Self-Supervised Correspondence Learning Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, Yi Yang
PDF
Localization Distillation for Dense Object Detection Zhaohui Zheng, Rongguang Ye, Ping Wang, Dongwei Ren, Wangmeng Zuo, Qibin Hou, Ming-Ming Cheng
PDF
Localized Adversarial Domain Generalization Wei Zhu, Le Lu, Jing Xiao, Mei Han, Jiebo Luo, Adam P. Harrison
PDF
Location-Free Human Pose Estimation Xixia Xu, Yingguo Gao, Ke Yan, Xue Lin, Qi Zou
PDF
LOLNerf: Learn from One Look Daniel Rebain, Mark Matthews, Kwang Moo Yi, Dmitry Lagun, Andrea Tagliasacchi
PDF
Long-Short Temporal Contrastive Learning of Video Transformers Jue Wang, Gedas Bertasius, Du Tran, Lorenzo Torresani
PDF
Long-Tail Recognition via Compositional Knowledge Transfer Sarah Parisot, Pedro M. Esperança, Steven McDonagh, Tamas J. Madarasz, Yongxin Yang, Zhenguo Li
PDF
Long-Tailed Recognition via Weight Balancing Shaden Alshammari, Yu-Xiong Wang, Deva Ramanan, Shu Kong
PDF
Long-Tailed Visual Recognition via Gaussian Clouded Logit Adjustment Mengke Li, Yiu-ming Cheung, Yang Lu
PDF
Long-Term Video Frame Interpolation via Feature Propagation Dawit Mureja Argaw, In So Kweon
PDF
Long-Term Visual mAP Sparsification with Heterogeneous GNN Ming-Fang Chang, Yipu Zhao, Rajvi Shah, Jakob J. Engel, Michael Kaess, Simon Lucey
PDF
Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai
PDF
Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator Yuxin Kong, Canjie Luo, Weihong Ma, Qiyuan Zhu, Shenggao Zhu, Nicholas Yuan, Lianwen Jin
PDF
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos Tomáš Souček, Jean-Baptiste Alayrac, Antoine Miech, Ivan Laptev, Josef Sivic
PDF
Look Outside the Room: Synthesizing a Consistent Long-Term 3D Scene Video from a Single Image Xuanchi Ren, Xiaolong Wang
PDF
Low-Resource Adaptation for Personalized Co-Speech Gesture Generation Chaitanya Ahuja, Dong Won Lee, Louis-Philippe Morency
PDF
LSVC: A Learning-Based Stereo Video Compression Framework Zhenghao Chen, Guo Lu, Zhihao Hu, Shan Liu, Wei Jiang, Dong Xu
PDF
LTP: Lane-Based Trajectory Prediction for Autonomous Driving Jingke Wang, Tengju Ye, Ziqing Gu, Junbo Chen
PDF
M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction Qiao Sun, Xin Huang, Junru Gu, Brian C. Williams, Hang Zhao
PDF
M3L: Language-Based Video Editing via Multi-Modal Multi-Level Transformers Tsu-Jui Fu, Xin Eric Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang
PDF
M3T: Three-Dimensional Medical Image Classifier Using Multi-Plane and Multi-Slice Transformer Jinseong Jang, Dosik Hwang
PDF
M5Product: Self-Harmonized Contrastive Learning for E-Commercial Multi-Modal Pretraining Xiao Dong, Xunlin Zhan, Yangxin Wu, Yunchao Wei, Michael C. Kampffmeyer, Xiaoyong Wei, Minlong Lu, Yaowei Wang, Xiaodan Liang
PDF
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Mattia Soldan, Alejandro Pardo, Juan León Alcázar, Fabian Caba, Chen Zhao, Silvio Giancola, Bernard Ghanem
PDF
Maintaining Reasoning Consistency in Compositional Visual Question Answering Chenchen Jing, Yunde Jia, Yuwei Wu, Xinyu Liu, Qi Wu
PDF
Make It Move: Controllable Image-to-Video Generation with Text Descriptions Yaosi Hu, Chong Luo, Zhenzhong Chen
PDF
Manifold Learning Benefits GANs Yao Ni, Piotr Koniusz, Richard Hartley, Richard Nock
PDF
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Chunjing Xu, Yanwei Fu
PDF
Many-to-Many Splatting for Efficient Video Frame Interpolation Ping Hu, Simon Niklaus, Stan Sclaroff, Kate Saenko
PDF
Marginal Contrastive Correspondence for Guided Image Generation Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Changgong Zhang
PDF
Mask Transfiner for High-Quality Instance Segmentation Lei Ke, Martin Danelljan, Xia Li, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu
PDF
Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van Gool
PDF
Masked Autoencoders Are Scalable Vision Learners Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, Ross Girshick
PDF
Masked Feature Prediction for Self-Supervised Visual Pre-Training Chen Wei, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan Yuille, Christoph Feichtenhofer
PDF
Masked-Attention Mask Transformer for Universal Image Segmentation Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar
PDF
MaskGIT: Masked Generative Image Transformer Huiwen Chang, Han Zhang, Lu Jiang, Ce Liu, William T. Freeman
PDF
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network Byung-Kwan Lee, Junho Kim, Yong Man Ro
PDF
MAT: Mask-Aware Transformer for Large Hole Image Inpainting Wenbo Li, Zhe Lin, Kun Zhou, Lu Qi, Yi Wang, Jiaya Jia
PDF
Matching Feature Sets for Few-Shot Image Classification Arman Afrasiyabi, Hugo Larochelle, Jean-François Lalonde, Christian Gagné
PDF
MatteFormer: Transformer-Based Image Matting via Prior-Tokens GyuTae Park, SungJoon Son, JaeYoung Yoo, SeHo Kim, Nojun Kwak
PDF
MAXIM: Multi-Axis MLP for Image Processing Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li
PDF
Maximum Consensus by Weighted Influences of Monotone Boolean Functions Erchuan Zhang, David Suter, Ruwan Tennakoon, Tat-Jun Chin, Alireza Bab-Hadiashar, Giang Truong, Syed Zulqarnain Gilani
PDF
Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation Yanwu Xu, Shaoan Xie, Wenhao Wu, Kun Zhang, Mingming Gong, Kayhan Batmanghelich
PDF
MDAN: Multi-Level Dependent Attention Network for Visual Emotion Analysis Liwen Xu, Zhengtao Wang, Bin Wu, Simon Lui
PDF
Measuring Compositional Consistency for Video Question Answering Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
PDF
Medial Spectral Coordinates for 3D Shape Analysis Morteza Rezanejad, Mohammad Khodadad, Hamidreza Mahyar, Herve Lombaert, Michael Gruninger, Dirk Walther, Kaleem Siddiqi
PDF
Mega-NERF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs Haithem Turki, Deva Ramanan, Mahadev Satyanarayanan
PDF
Memory-Augmented Deep Conditional Unfolding Network for Pan-Sharpening Gang Yang, Man Zhou, Keyu Yan, Aiping Liu, Xueyang Fu, Fan Wang
PDF
Memory-Augmented Non-Local Attention for Video Super-Resolution Jiyang Yu, Jingen Liu, Liefeng Bo, Tao Mei
PDF
MeMOT: Multi-Object Tracking with Memory Jiarui Cai, Mingze Xu, Wei Li, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto
PDF
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer
PDF
MERLOT Reserve: Neural Script Knowledge Through Vision and Language and Sound Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin Choi
PDF
Merry Go Round: Rotate a Frame and Fool a DNN Daksh Thapar, Aditya Nigam, Chetan Arora
PDF
Meta Agent Teaming Active Learning for Pose Estimation Jia Gong, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu
PDF
Meta Convolutional Neural Networks for Single Domain Generalization Chaoqun Wan, Xu Shen, Yonggang Zhang, Zhiheng Yin, Xinmei Tian, Feng Gao, Jianqiang Huang, Xian-Sheng Hua
PDF
Meta Distribution Alignment for Generalizable Person Re-Identification Hao Ni, Jingkuan Song, Xiaopeng Luo, Feng Zheng, Wen Li, Heng Tao Shen
PDF
Meta-Attention for ViT-Backed Continual Learning Mengqi Xue, Haofei Zhang, Jie Song, Mingli Song
PDF
MetaFormer Is Actually What You Need for Vision Weihao Yu, Mi Luo, Pan Zhou, Chenyang Si, Yichen Zhou, Xinchao Wang, Jiashi Feng, Shuicheng Yan
PDF
MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incremental Learning Zhixiang Chi, Li Gu, Huan Liu, Yang Wang, Yuanhao Yu, Jin Tang
PDF
MetaPose: Fast 3D Pose from Multiple Views Without 3D Supervision Ben Usman, Andrea Tagliasacchi, Kate Saenko, Avneesh Sud
PDF
MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool
PDF
Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning Yujun Shi, Kuangqi Zhou, Jian Liang, Zihang Jiang, Jiashi Feng, Philip H.S. Torr, Song Bai, Vincent Y. F. Tan
PDF
Mining Multi-View Information: A Strong Self-Supervised Framework for Depth-Based 3D Hand Pose and Mesh Estimation Pengfei Ren, Haifeng Sun, Jiachang Hao, Jingyu Wang, Qi Qi, Jianxin Liao
PDF
MiniViT: Compressing Vision Transformers with Weight Multiplexing Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan
PDF
Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman
PDF
MISF: Multi-Level Interactive Siamese Filtering for High-Fidelity Image Inpainting Xiaoguang Li, Qing Guo, Di Lin, Ping Li, Wei Feng, Song Wang
PDF
Mix and Localize: Localizing Sound Sources in Mixtures Xixi Hu, Ziyang Chen, Andrew Owens
PDF
Mixed Differential Privacy in Computer Vision Aditya Golatkar, Alessandro Achille, Yu-Xiang Wang, Aaron Roth, Michael Kearns, Stefano Soatto
PDF
MixFormer: End-to-End Tracking with Iterative Mixed Attention Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu
PDF
MixFormer: Mixing Features Across Windows and Dimensions Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang
PDF
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video Jinlu Zhang, Zhigang Tu, Jianyu Yang, Yujin Chen, Junsong Yuan
PDF
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei
PDF
MLSLT: Towards Multilingual Sign Language Translation Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He
PDF
MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation Inkyu Shin, Yi-Hsuan Tsai, Bingbing Zhuang, Samuel Schulter, Buyu Liu, Sparsh Garg, In So Kweon, Kuk-Jin Yoon
PDF
MNSRNet: Multimodal Transformer Network for 3D Surface Super-Resolution Wuyuan Xie, Tengcong Huang, Miaohui Wang
PDF
Mobile-Former: Bridging MobileNet and Transformer Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong, Lu Yuan, Zicheng Liu
PDF
MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo
PDF
Modality-Agnostic Learning for Radar-LiDAR Fusion in Vehicle Detection Yu-Jhe Li, Jinhyung Park, Matthew O'Toole, Kris Kitani
PDF
Modeling 3D Layout for Group Re-Identification Quan Zhang, Kaiheng Dang, Jian-Huang Lai, Zhanxiang Feng, Xiaohua Xie
PDF
Modeling Image Composition for Complex Scene Generation Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao
PDF
Modeling Indirect Illumination for Inverse Rendering Yuanqing Zhang, Jiaming Sun, Xingyi He, Huan Fu, Rongfei Jia, Xiaowei Zhou
PDF
Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation Wangbo Zhao, Kai Wang, Xiangxiang Chu, Fuzhao Xue, Xinchao Wang, Yang You
PDF
Modeling sRGB Camera Noise with Normalizing Flows Shayan Kousha, Ali Maleky, Michael S. Brown, Marcus A. Brubaker
PDF
Modular Action Concept Grounding in Semantic Video Prediction Wei Yu, Wenxin Chen, Songheng Yin, Steve Easterbrook, Animesh Garg
PDF
Modulated Contrast for Versatile Image Synthesis Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Rongliang Wu, Shijian Lu
PDF
MogFace: Towards a Deeper Appreciation on Face Detection Yang Liu, Fei Wang, Jiankang Deng, Zhipeng Zhou, Baigui Sun, Hao Li
PDF
MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer Kuan-Chih Huang, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu
PDF
MonoGround: Detecting Monocular 3D Objects from the Ground Zequn Qin, Xi Li
PDF
MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection Qing Lian, Peiliang Li, Xiaozhi Chen
PDF
MonoScene: Monocular 3D Semantic Scene Completion Anh-Quan Cao, Raoul de Charette
PDF
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal Remez
PDF
Motion-Adjustable Neural Implicit Video Representation Long Mai, Feng Liu
PDF
Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging Shuangrui Ding, Maomao Li, Tianyu Yang, Rui Qian, Haohang Xu, Qingyi Chen, Jue Wang, Hongkai Xiong
PDF
Motion-from-Blur: 3D Shape and Motion Estimation of Motion-Blurred Objects in Videos Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys
PDF
Motion-Modulated Temporal Fragment Alignment Network for Few-Shot Action Recognition Jiamin Wu, Tianzhu Zhang, Zhe Zhang, Feng Wu, Yongdong Zhang
PDF
MotionAug: Augmentation with Physical Correction for Human Motion Prediction Takahiro Maeda, Norimichi Ukita
PDF
Motron: Multimodal Probabilistic Human Motion Forecasting Tim Salzmann, Marco Pavone, Markus Ryll
PDF
Moving Window Regression: A Novel Approach to Ordinal Regression Nyeong-Ho Shin, Seon-Ho Lee, Chang-Su Kim
PDF
MPC: Multi-View Probabilistic Clustering Junjie Liu, Junlong Liu, Shaotian Yan, Rongxin Jiang, Xiang Tian, Boxuan Gu, Yaowu Chen, Chen Shen, Jianqiang Huang
PDF
MPViT: Multi-Path Vision Transformer for Dense Prediction Youngwan Lee, Jonghee Kim, Jeffrey Willette, Sung Ju Hwang
PDF
Mr.BiQ: Post-Training Non-Uniform Quantization Based on Minimizing the Reconstruction Error Yongkweon Jeon, Chungman Lee, Eulrang Cho, Yeonju Ro
PDF
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection Rui Dai, Srijan Das, Kumara Kahatapitiya, Michael S. Ryoo, François Brémond
PDF
MS2DG-Net: Progressive Correspondence Learning via Multiple Sparse Semantics Dynamic Graph Luanyuan Dai, Yizhang Liu, Jiayi Ma, Lifang Wei, Taotao Lai, Changcai Yang, Riqing Chen
PDF
MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning Shiming Chen, Ziming Hong, Guo-Sen Xie, Wenhan Yang, Qinmu Peng, Kai Wang, Jian Zhao, Xinge You
PDF
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens Jiemin Fang, Lingxi Xie, Xinggang Wang, Xiaopeng Zhang, Wenyu Liu, Qi Tian
PDF
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection Bumsoo Kim, Jonghwan Mun, Kyoung-Woon On, Minchul Shin, Junhyun Lee, Eun-Sol Kim
PDF
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-Based Visual Question Answering Yang Ding, Jing Yu, Bang Liu, Yue Hu, Mingxin Cui, Qi Wu
PDF
MulT: An End-to-End Multitask Learning Transformer Deblina Bhattacharjee, Tong Zhang, Sabine Süsstrunk, Mathieu Salzmann
PDF
Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu
PDF
Multi-Dimensional, Nuanced and Subjective - Measuring the Perception of Facial Expressions De'Aira Bryant, Siqi Deng, Nashlie Sephus, Wei Xia, Pietro Perona
PDF
Multi-Frame Self-Supervised Depth with Transformers Vitor Guizilini, Rareș Ambruș, Dian Chen, Sergey Zakharov, Adrien Gaidon
PDF
Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading Ganchao Tan, Yang Wang, Han Han, Yang Cao, Feng Wu, Zheng-Jun Zha
PDF
Multi-Granularity Alignment Domain Adaptation for Object Detection Wenzhang Zhou, Dawei Du, Libo Zhang, Tiejian Luo, Yanjun Wu
PDF
Multi-Instance Point Cloud Registration by Efficient Correspondence Clustering Weixuan Tang, Danping Zou
PDF
Multi-Label Classification with Partial Annotations Using Class-Aware Selective Loss Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor
PDF
Multi-Label Iterated Learning for Image Classification with Label Ambiguity Sai Rajeswar, Pau Rodríguez, Soumye Singhal, David Vazquez, Aaron Courville
PDF
Multi-Level Feature Learning for Contrastive Multi-View Clustering Jie Xu, Huayi Tang, Yazhou Ren, Liang Peng, Xiaofeng Zhu, Lifang He
PDF
Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation Dongming Wu, Xingping Dong, Ling Shao, Jianbing Shen
PDF
Multi-Marginal Contrastive Learning for Multi-Label Subcellular Protein Localization Ziyi Liu, Zengmao Wang, Bo Du
PDF
Multi-Modal Alignment Using Representation Codebook Jiali Duan, Liqun Chen, Son Tran, Jinyu Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi
PDF
Multi-Modal Dynamic Graph Transformer for Visual Grounding Sijia Chen, Baochun Li
PDF
Multi-Modal Extreme Classification Anshul Mittal, Kunal Dahiya, Shreya Malani, Janani Ramaswamy, Seba Kuruvilla, Jitendra Ajmera, Keng-hao Chang, Sumeet Agarwal, Purushottam Kar, Manik Varma
PDF
Multi-Object Tracking Meets Moving UAV Shuai Liu, Xin Li, Huchuan Lu, You He
PDF
Multi-Objective Diverse Human Motion Prediction with Knowledge Distillation Hengbo Ma, Jiachen Li, Ramtin Hosseini, Masayoshi Tomizuka, Chiho Choi
PDF
Multi-Person Extreme Motion Prediction Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer
PDF
Multi-Robot Active Mapping via Neural Bipartite Graph Matching Kai Ye, Siyan Dong, Qingnan Fan, He Wang, Li Yi, Fei Xia, Jue Wang, Baoquan Chen
PDF
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation Jiaqi Gu, Hyoukjun Kwon, Dilin Wang, Wei Ye, Meng Li, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra, David Z. Pan
PDF
Multi-Scale Memory-Based Video Deblurring Bo Ji, Angela Yao
PDF
Multi-Source Uncertainty Mining for Deep Unsupervised Saliency Detection Yifan Wang, Wenbo Zhang, Lijun Wang, Ting Liu, Huchuan Lu
PDF
Multi-View Consistent Generative Adversarial Networks for 3D-Aware Image Synthesis Xuanmeng Zhang, Zhedong Zheng, Daiheng Gao, Bang Zhang, Pan Pan, Yi Yang
PDF
Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry Gwangbin Bae, Ignas Budvytis, Roberto Cipolla
PDF
Multi-View Mesh Reconstruction with Neural Deferred Shading Markus Worchel, Rodrigo Diaz, Weiwen Hu, Oliver Schreer, Ingo Feldmann, Peter Eisert
PDF
Multi-View Transformer for 3D Visual Grounding Shijia Huang, Yilun Chen, Jiaya Jia, Liwei Wang
PDF
Multidimensional Belief Quantification for Label-Efficient Meta-Learning Deep Shankar Pandey, Qi Yu
PDF
Multimodal Colored Point Cloud to Image Alignment Noam Rotstein, Amit Bracha, Ron Kimmel
PDF
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification Zongbo Han, Fan Yang, Junzhou Huang, Changqing Zhang, Jianhua Yao
PDF
Multimodal Material Segmentation Yupeng Liang, Ryosuke Wakaki, Shohei Nobuhara, Ko Nishino
PDF
Multimodal Token Fusion for Vision Transformers Yikai Wang, Xinghao Chen, Lele Cao, Wenbing Huang, Fuchun Sun, Yunhe Wang
PDF
Multiview Transformers for Video Recognition Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid
PDF
MUM: Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection JongMok Kim, JooYoung Jang, Seunghyeon Seo, Jisoo Jeong, Jongkeun Na, Nojun Kwak
PDF
MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
PDF
Mutual Information-Driven Pan-Sharpening Man Zhou, Keyu Yan, Jie Huang, Zihe Yang, Xueyang Fu, Feng Zhao
PDF
Mutual Quantization for Cross-Modal Search with Noisy Labels Erkun Yang, Dongren Yao, Tongliang Liu, Cheng Deng
PDF
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer
PDF
MVS2D: Efficient Multi-View Stereo via Attention-Driven 2D Convolutions Zhenpei Yang, Zhile Ren, Qi Shan, Qixing Huang
PDF
NAN: Noise-Aware NeRFs for Burst-Denoising Naama Pearl, Tali Treibitz, Simon Korman
PDF
Negative-Aware Attention Framework for Image-Text Matching Kun Zhang, Zhendong Mao, Quan Wang, Yongdong Zhang
PDF
NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images Ben Mildenhall, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan, Jonathan T. Barron
PDF
NeRF-Editing: Geometry Editing of Neural Radiance Fields Yu-Jie Yuan, Yang-Tian Sun, Yu-Kun Lai, Yuewen Ma, Rongfei Jia, Lin Gao
PDF
NeRFReN: Neural Radiance Fields with Reflections Yuan-Chen Guo, Di Kang, Linchao Bao, Yu He, Song-Hai Zhang
PDF
NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction Xiaoshuai Zhang, Sai Bi, Kalyan Sunkavalli, Hao Su, Zexiang Xu
PDF
Nested Collaborative Learning for Long-Tailed Visual Recognition Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo
PDF
Nested Hyperbolic Spaces for Dimensionality Reduction and Hyperbolic NN Design Xiran Fan, Chun-Hao Yang, Baba C. Vemuri
PDF
Neural 3D Scene Reconstruction with the Manhattan-World Assumption Haoyu Guo, Sida Peng, Haotong Lin, Qianqian Wang, Guofeng Zhang, Hujun Bao, Xiaowei Zhou
PDF
Neural 3D Video Synthesis from Multi-View Video Tianye Li, Mira Slavcheva, Michael Zollhöfer, Simon Green, Christoph Lassner, Changil Kim, Tanner Schmidt, Steven Lovegrove, Michael Goesele, Richard Newcombe, Zhaoyang Lv
PDF
Neural Architecture Search with Representation Mutual Information Xiawu Zheng, Xiang Fei, Lei Zhang, Chenglin Wu, Fei Chao, Jianzhuang Liu, Wei Zeng, Yonghong Tian, Rongrong Ji
PDF
Neural Collaborative Graph Machines for Table Structure Recognition Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
PDF
Neural Compression-Based Feature Learning for Video Restoration Cong Huang, Jiahao Li, Bin Li, Dong Liu, Yan Lu
PDF
Neural Convolutional Surfaces Luca Morreale, Noam Aigerman, Paul Guerrero, Vladimir G. Kim, Niloy J. Mitra
PDF
Neural Data-Dependent Transform for Learned Image Compression Dezhao Wang, Wenhan Yang, Yueyu Hu, Jiaying Liu
PDF
Neural Emotion Director: Speech-Preserving Semantic Control of Facial Expressions in "In-the-Wild" Videos Foivos Paraperas Papantoniou, Panagiotis P. Filntisis, Petros Maragos, Anastasios Roussos
PDF
Neural Face Identification in a 2D Wireframe Projection of a Manifold Object Kehan Wang, Jia Zheng, Zihan Zhou
PDF
Neural Fields as Learnable Kernels for 3D Reconstruction Francis Williams, Zan Gojcic, Sameh Khamis, Denis Zorin, Joan Bruna, Sanja Fidler, Or Litany
PDF
Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature Zhixiang Wang, Xiang Ji, Jia-Bin Huang, Shin'ichi Satoh, Xiao Zhou, Yinqiang Zheng
PDF
Neural Head Avatars from Monocular RGB Videos Philip-William Grassal, Malte Prinzler, Titus Leistner, Carsten Rother, Matthias Nießner, Justus Thies
PDF
Neural Inertial Localization Sachini Herath, David Caruso, Chen Liu, Yufan Chen, Yasutaka Furukawa
PDF
Neural Mean Discrepancy for Efficient Out-of-Distribution Detection Xin Dong, Junfeng Guo, Ang Li, Wei-Te Ting, Cong Liu, H.T. Kung
PDF
Neural Mesh Simplification Rolandos Alexandros Potamias, Stylianos Ploumpis, Stefanos Zafeiriou
PDF
Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture Buzhen Huang, Liang Pan, Yuan Yang, Jingyi Ju, Yangang Wang
PDF
Neural Point Light Fields Julian Ost, Issam Laradji, Alejandro Newell, Yuval Bahat, Felix Heide
PDF
Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling Wanquan Feng, Jin Li, Hongrui Cai, Xiaonan Luo, Juyong Zhang
PDF
Neural Prior for Trajectory Estimation Chaoyang Wang, Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey
PDF
Neural Rays for Occlusion-Aware Image-Based Rendering Yuan Liu, Sida Peng, Lingjie Liu, Qianqian Wang, Peng Wang, Christian Theobalt, Xiaowei Zhou, Wenping Wang
PDF
Neural Recognition of Dashed Curves with Gestalt Law of Continuity Hanyuan Liu, Chengze Li, Xueting Liu, Tien-Tsin Wong
PDF
Neural Reflectance for Shape Recovery with Shadow Handling Junxuan Li, Hongdong Li
PDF
Neural RGB-D Surface Reconstruction Dejan Azinović, Ricardo Martin-Brualla, Dan B Goldman, Matthias Nießner, Justus Thies
PDF
Neural Shape Mating: Self-Supervised Object Assembly with Adversarial Shape Priors Yun-Chun Chen, Haoda Li, Dylan Turpin, Alec Jacobson, Animesh Garg
PDF
Neural Template: Topology-Aware Reconstruction and Disentangled Generation of 3D Meshes Ka-Hei Hui, Ruihui Li, Jingyu Hu, Chi-Wing Fu
PDF
Neural Texture Extraction and Distribution for Controllable Person Image Synthesis Yurui Ren, Xiaoqing Fan, Ge Li, Shan Liu, Thomas H. Li
PDF
Neural Volumetric Object Selection Zhongzheng Ren, Aseem Agarwala, Bryan Russell, Alexander G. Schwing, Oliver Wang
PDF
Neural Window Fully-Connected CRFs for Monocular Depth Estimation Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan
PDF
NeuralHDHair: Automatic High-Fidelity Hair Modeling from a Single Image Using Implicit Neural Representations Keyu Wu, Yifan Ye, Lingchen Yang, Hongbo Fu, Kun Zhou, Youyi Zheng
PDF
NeuralHOFusion: Neural Volumetric Rendering Under Human-Object Interactions Yuheng Jiang, Suyi Jiang, Guoxing Sun, Zhuo Su, Kaiwen Guo, Minye Wu, Jingyi Yu, Lan Xu
PDF
NeurMiPs: Neural Mixture of Planar Experts for View Synthesis Zhi-Hao Lin, Wei-Chiu Ma, Hao-Yu Hsu, Yu-Chiang Frank Wang, Shenlong Wang
PDF
NFormer: Robust Person Re-Identification with Neighbor Transformer Haochen Wang, Jiayi Shen, Yongtuo Liu, Yan Gao, Efstratios Gavves
PDF
NICE-SLAM: Neural Implicit Scalable Encoding for SLAM Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys
PDF
NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation Models Simin Chen, Zihe Song, Mirazul Haque, Cong Liu, Wei Yang
PDF
NightLab: A Dual-Level Architecture with Hardness Detection for Segmentation at Night Xueqing Deng, Peng Wang, Xiaochen Lian, Shawn Newsam
PDF
NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning Tony Ng, Hyo Jin Kim, Vincent T. Lee, Daniel DeTone, Tsun-Yi Yang, Tianwei Shen, Eddy Ilg, Vassileios Balntas, Krystian Mikolajczyk, Chris Sweeney
PDF
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks Fawaz Sammani, Tanmoy Mukherjee, Nikos Deligiannis
PDF
No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-Level Space-Time Surfaces Jia-Xing Zhong, Kaichen Zhou, Qingyong Hu, Bing Wang, Niki Trigoni, Andrew Markham
PDF
No-Reference Point Cloud Quality Assessment via Domain Adaptation Qi Yang, Yipeng Liu, Siheng Chen, Yiling Xu, Jun Sun
PDF
NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge Duc Minh Vo, Hong Chen, Akihiro Sugimoto, Hideki Nakayama
PDF
Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization Wei Dong, Junsheng Wu, Yi Luo, Zongyuan Ge, Peng Wang
PDF
Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification Yonghang Guan, Jun Zhang, Kuan Tian, Sen Yang, Pei Dong, Jinxi Xiang, Wei Yang, Junzhou Huang, Yuyao Zhang, Xiao Han
PDF
NODEO: A Neural Ordinary Differential Equation Based Optimization Framework for Deformable Image Registration Yifan Wu, Tom Z. Jiahao, Jiancong Wang, Paul A. Yushkevich, M. Ani Hsieh, James C. Gee
PDF
Noise Distribution Adaptive Self-Supervised Image Denoising Using Tweedie Distribution and Score Matching Kwanyoung Kim, Taesung Kwon, Jong Chul Ye
PDF
Noise Is Also Useful: Negative Correlation-Steered Latent Contrastive Learning Jiexi Yan, Lei Luo, Chenghao Xu, Cheng Deng, Heng Huang
PDF
Noise2NoiseFlow: Realistic Camera Noise Modeling Without Clean Images Ali Maleky, Shayan Kousha, Michael S. Brown, Marcus A. Brubaker
PDF
Noisy Boundaries: Lemon or Lemonade for Semi-Supervised Instance Segmentation? Zhenyu Wang, Yali Li, Shengjin Wang
PDF
NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition Hao Liu, Xinghua Jiang, Xin Li, Zhimin Bao, Deqiang Jiang, Bo Ren
PDF
Non-Generative Generalized Zero-Shot Learning via Task-Correlated Disentanglement and Controllable Samples Synthesis Yaogong Feng, Xiaowen Huang, Pengbo Yang, Jian Yu, Jitao Sang
PDF
Non-Isotropy Regularization for Proxy-Based Deep Metric Learning Karsten Roth, Oriol Vinyals, Zeynep Akata
PDF
Non-Iterative Recovery from Nonlinear Observations Using Generative Models Jiulong Liu, Zhaoqiang Liu
PDF
Non-Parametric Depth Distribution Modelling Based Depth Inference for Multi-View Stereo Jiayu Yang, Jose M. Alvarez, Miaomiao Liu
PDF
Non-Probability Sampling Network for Stochastic Human Trajectory Prediction Inhwan Bae, Jin-Hwi Park, Hae-Gon Jeon
PDF
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation Zechun Liu, Kwang-Ting Cheng, Dong Huang, Eric P. Xing, Zhiqiang Shen
PDF
Not All Labels Are Equal: Rationalizing the Labeling Costs for Training Object Detection Ismail Elezi, Zhiding Yu, Anima Anandkumar, Laura Leal-Taixé, Jose M. Alvarez
PDF
Not All Points Are Equal: Learning Highly Efficient Point-Based Detectors for 3D LiDAR Point Clouds Yifan Zhang, Qingyong Hu, Guoquan Xu, Yanxin Ma, Jianwei Wan, Yulan Guo
PDF
Not All Relations Are Equal: Mining Informative Labels for Scene Graph Generation Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen
PDF
Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang
PDF
Not Just Selection, but Exploration: Online Class-Incremental Continual Learning via Dual View Consistency Yanan Gu, Xu Yang, Kun Wei, Cheng Deng
PDF
Novel Class Discovery in Semantic Segmentation Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee
PDF
NPBG++: Accelerating Neural Point-Based Graphics Ruslan Rakhimov, Andrei-Timotei Ardelean, Victor Lempitsky, Evgeny Burnaev
PDF
OakInk: A Large-Scale Knowledge Repository for Understanding Hand-Object Interaction Lixin Yang, Kailin Li, Xinyu Zhan, Fei Wu, Anran Xu, Liu Liu, Cewu Lu
PDF
Object Localization Under Single Coarse Point Supervision Xuehui Yu, Pengfei Chen, Di Wu, Najmul Hassan, Guorong Li, Junchi Yan, Humphrey Shi, Qixiang Ye, Zhenjun Han
PDF
Object-Aware Video-Language Pre-Training for Retrieval Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou
PDF
Object-Region Video Transformers Roei Herzig, Elad Ben-Avraham, Karttikeya Mangalam, Amir Bar, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson
PDF
Object-Relation Reasoning Graph for Action Recognition Yangjun Ou, Li Mi, Zhenzhong Chen
PDF
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer Ruohan Gao, Zilin Si, Yen-Yu Chang, Samuel Clarke, Jeannette Bohg, Li Fei-Fei, Wenzhen Yuan, Jiajun Wu
PDF
ObjectFormer for Image Manipulation Detection and Localization Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
PDF
OccAM's Laser: Occlusion-Based Attribution Maps for 3D Object Detectors on LiDAR Data David Schinagl, Georg Krispel, Horst Possegger, Peter M. Roth, Horst Bischof
PDF
Occluded Human Mesh Recovery Rawal Khirodkar, Shashank Tripathi, Kris Kitani
PDF
Occlusion-Aware Cost Constructor for Light Field Depth Estimation Yingqian Wang, Longguang Wang, Zhengyu Liang, Jungang Yang, Wei An, Yulan Guo
PDF
Occlusion-Robust Face Alignment Using a Viewpoint-Invariant Hierarchical Network Architecture Congcong Zhu, Xintong Wan, Shaorong Xie, Xiaoqiang Li, Yinzheng Gu
PDF
OcclusionFusion: Occlusion-Aware Motion Estimation for Real-Time Dynamic 3D Reconstruction Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu
PDF
OCSampler: Compressing Videos to One CLIP with Single-Step Sampling Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang
PDF
Omni-DETR: Omni-Supervised Object Detection with Transformers Pei Wang, Zhaowei Cai, Hao Yang, Gurumurthy Swaminathan, Nuno Vasconcelos, Bernt Schiele, Stefano Soatto
PDF
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion Yuyan Li, Yuliang Guo, Zhixin Yan, Xinyu Huang, Ye Duan, Liu Ren
PDF
Omnivore: A Single Model for Many Visual Modalities Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan Misra
PDF
On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles Qingzhao Zhang, Shengtuo Hu, Jiachen Sun, Qi Alfred Chen, Z. Morley Mao
PDF
On Aliased Resizing and Surprising Subtleties in GAN Evaluation Gaurav Parmar, Richard Zhang, Jun-Yan Zhu
PDF
On Generalizing Beyond Domains in Cross-Domain Continual Learning Christian Simon, Masoud Faraki, Yi-Hsuan Tsai, Xiang Yu, Samuel Schulter, Yumin Suh, Mehrtash Harandi, Manmohan Chandraker
PDF
On Guiding Visual Attention with Language Specification Suzanne Petryk, Lisa Dunlap, Keyan Nasseri, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach
PDF
On Learning Contrastive Representations for Learning with Noisy Labels Li Yi, Sheng Liu, Qi She, A. Ian McLeod, Boyu Wang
PDF
On the Importance of Asymmetry for Siamese Representation Learning Xiao Wang, Haoqi Fan, Yuandong Tian, Daisuke Kihara, Xinlei Chen
PDF
On the Instability of Relative Pose Estimation and RANSAC's Role Hongyi Fan, Joe Kileel, Benjamin Kimia
PDF
On the Integration of Self-Attention and Convolution Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang
PDF
On the Road to Online Adaptation for Semantic Image Segmentation Riccardo Volpi, Pau De Jorge, Diane Larlus, Gabriela Csurka
PDF
ONCE-3DLanes: Building Monocular 3D Lane Detection Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang
PDF
One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching Khoa D. Doan, Peng Yang, Ping Li
PDF
One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su
PDF
One-Bit Active Query with Contrastive Pairs Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian
PDF
OnePose: One-Shot Object Pose Estimation Without CAD Models Jiaming Sun, Zihao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao, Guofeng Zhang, Xiaowei Zhou
PDF
Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries Jihwan Bang, Hyunseo Koh, Seulki Park, Hwanjun Song, Jung-Woo Ha, Jonghyun Choi
PDF
Online Convolutional Re-Parameterization Mu Hu, Junyi Feng, Jiashen Hua, Baisheng Lai, Jianqiang Huang, Xiaojin Gong, Xian-Sheng Hua
PDF
Online Learning of Reusable Abstract Models for Object Goal Navigation Tommaso Campari, Leonardo Lamanna, Paolo Traverso, Luciano Serafini, Lamberto Ballan
PDF
OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization Nanyang Ye, Kaican Li, Haoyue Bai, Runpeng Yu, Lanqing Hong, Fengwei Zhou, Zhenguo Li, Jun Zhu
PDF
Open Challenges in Deep Stereo: The Booster Dataset Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano
PDF
Open-Domain, Content-Based, Multi-Modal Fact-Checking of Out-of-Context Images via Online Resources Sahar Abdelnabi, Rakibul Hasan, Mario Fritz
PDF
Open-Set Text Recognition via Character-Context Decoupling Chang Liu, Chun Yang, Xu-Cheng Yin
PDF
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling Dat Huynh, Jason Kuen, Zhe Lin, Jiuxiang Gu, Ehsan Elhamifar
PDF
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation Zongyang Ma, Guan Luo, Jin Gao, Liang Li, Yuxin Chen, Shaoru Wang, Congxuan Zhang, Weiming Hu
PDF
Open-World Instance Segmentation: Exploiting Pseudo Ground Truth from Learned Pairwise Affinity Weiyao Wang, Matt Feiszli, Heng Wang, Jitendra Malik, Du Tran
PDF
Opening up Open World Tracking Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé
PDF
OpenTAL: Towards Open Set Temporal Action Localization Wentao Bao, Qi Yu, Yu Kong
PDF
Optical Flow Estimation for Spiking Camera Liwen Hu, Rui Zhao, Ziluo Ding, Lei Ma, Boxin Shi, Ruiqin Xiong, Tiejun Huang
PDF
Optimal Correction Cost for Object Detection Evaluation Mayu Otani, Riku Togashi, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh
PDF
Optimal LED Spectral Multiplexing for NIR2RGB Translation Lei Liu, Yuze Chen, Junchi Yan, Yinqiang Zheng
PDF
Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev, Jana Vráblíková, Tomas Pajdla
PDF
Optimizing Video Prediction via Video Frame Interpolation Yue Wu, Qiang Wen, Qifeng Chen
PDF
Oriented RepPoints for Aerial Object Detection Wentong Li, Yijie Chen, Kaixuan Hu, Jianke Zhu
PDF
OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks Wanyu Lin, Hao Lan, Hao Wang, Baochun Li
PDF
OSKDet: Orientation-Sensitive Keypoint Localization for Rotated Object Detection Dongchen Lu, Dongmei Li, Yali Li, Shengjin Wang
PDF
OSOP: A Multi-Stage One Shot Object Pose Estimation Framework Ivan Shugurov, Fu Li, Benjamin Busam, Slobodan Ilic
PDF
OSSGAN: Open-Set Semi-Supervised Image Generation Kai Katsumata, Duc Minh Vo, Hideki Nakayama
PDF
OSSO: Obtaining Skeletal Shape from Outside Marilyn Keller, Silvia Zuffi, Michael J. Black, Sergi Pujades
PDF
Out-of-Distribution Generalization with Causal Invariant Transformations Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu
PDF
OVE6D: Object Viewpoint Encoding for Depth-Based 6d Object Pose Estimation Dingding Cai, Janne Heikkilä, Esa Rahtu
PDF
Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation Tao Feng, Mang Wang, Hangjie Yuan
PDF
OW-DETR: Open-World Detection Transformer Akshita Gupta, Sanath Narayan, K J Joseph, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah
PDF
P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior Vaishakh Patil, Christos Sakaridis, Alexander Liniger, Luc Van Gool
PDF
P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson
PDF
Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation Abhijit Kundu, Kyle Genova, Xiaoqi Yin, Alireza Fathi, Caroline Pantofaru, Leonidas J. Guibas, Andrea Tagliasacchi, Frank Dellaert, Thomas Funkhouser
PDF
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo, Tong Lu
PDF
Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap Jinke Li, Xiao He, Yang Wen, Yuan Gao, Xiaoqiang Cheng, Dan Zhang
PDF
Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation Shubhankar Borse, Hyojin Park, Hong Cai, Debasmit Das, Risheek Garrepalli, Fatih Porikli
PDF
PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation Naiyu Gao, Fei He, Jian Jia, Yanhu Shan, Haoyang Zhang, Xin Zhao, Kaiqi Huang
PDF
Parameter-Free Online Test-Time Adaptation Malik Boudiaf, Romain Mueller, Ismail Ben Ayed, Luca Bertinetto
PDF
Parametric Scattering Networks Shanel Gauthier, Benjamin Thérien, Laurent Alsène-Racicot, Muawiz Chaudhary, Irina Rish, Eugene Belilovsky, Michael Eickenberg, Guy Wolf
PDF
Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention Tong Yu, Ruslan Khalitov, Lei Cheng, Zhirong Yang
PDF
Part-Based Pseudo Label Refinement for Unsupervised Person Re-Identification Yoonki Cho, Woo Jae Kim, Seunghoon Hong, Sung-Eui Yoon
PDF
PartGlot: Learning Shape Part Segmentation from Language Reference Games Juil Koo, Ian Huang, Panos Achlioptas, Leonidas J. Guibas, Minhyuk Sung
PDF
Partial Class Activation Attention for Semantic Segmentation Sun-Ao Liu, Hongtao Xie, Hai Xu, Yongdong Zhang, Qi Tian
PDF
Partially Does It: Towards Scene-Level FG-SBIR with Partial Input Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Aneeshan Sain, Tao Xiang, Yi-Zhe Song
PDF
Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy
PDF
Patch Slimming for Efficient Vision Transformers Yehui Tang, Kai Han, Yunhe Wang, Chang Xu, Jianyuan Guo, Chao Xu, Dacheng Tao
PDF
Patch-Level Representation Learning for Self-Supervised Vision Transformers Sukmin Yun, Hankook Lee, Jaehyung Kim, Jinwoo Shin
PDF
PatchFormer: An Efficient Point Transformer with Patch Attention Cheng Zhang, Haocheng Wan, Xinyi Shen, Zizhao Wu
PDF
PatchNet: A Simple Face Anti-Spoofing Framework via Fine-Grained Patch Recognition Chien-Yi Wang, Yu-Ding Lu, Shang-Ta Yang, Shang-Hong Lai
PDF
PCA-Based Knowledge Distillation Towards Lightweight and Content-Style Balanced Photorealistic Style Transfer Models Tai-Yin Chiu, Danna Gurari
PDF
PCL: Proxy-Based Contrastive Learning for Domain Generalization Xufeng Yao, Yang Bai, Xinyun Zhang, Yuechen Zhang, Qi Sun, Ran Chen, Ruiyu Li, Bei Yu
PDF
Per-CLIP Video Object Segmentation Kwanyong Park, Sanghyun Woo, Seoung Wug Oh, In So Kweon, Joon-Young Lee
PDF
Perception Prioritized Training of Diffusion Models Jooyoung Choi, Jungbeom Lee, Chaehun Shin, Sungwon Kim, Hyunwoo Kim, Sungroh Yoon
PDF
Performance-Aware Mutual Knowledge Distillation for Improving Neural Architecture Search Pengtao Xie, Xuefeng Du
PDF
Personalized Image Aesthetics Assessment with Rich Attributes Yuzhe Yang, Liwu Xu, Leida Li, Nan Qie, Yaqian Li, Peng Zhang, Yandong Guo
PDF
Perturbed and Strict Mean Teachers for Semi-Supervised Semantic Segmentation Yuyuan Liu, Yu Tian, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, Gustavo Carneiro
PDF
PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects Pengyuan Wang, HyunJun Jung, Yitong Li, Siyuan Shen, Rahul Parthasarathy Srikanth, Lorenzo Garattoni, Sven Meier, Nassir Navab, Benjamin Busam
PDF
Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing Thiemo Alldieck, Mihai Zanfir, Cristian Sminchisescu
PDF
PhotoScene: Photorealistic Material and Lighting Transfer for Indoor Scenes Yu-Ying Yeh, Zhengqin Li, Yannick Hold-Geoffroy, Rui Zhu, Zexiang Xu, Miloš Hašan, Kalyan Sunkavalli, Manmohan Chandraker
PDF
PhyIR: Physics-Based Inverse Rendering for Panoramic Indoor Images Zhen Li, Lingli Wang, Xiang Huang, Cihui Pan, Jiaqi Yang
PDF
PhysFormer: Facial Video-Based Physiological Measurement with Temporal Difference Transformer Zitong Yu, Yuming Shen, Jingang Shi, Hengshuang Zhao, Philip H.S. Torr, Guoying Zhao
PDF
Physical Inertial Poser (PIP): Physics-Aware Real-Time Human Motion Tracking from Sparse Inertial Sensors Xinyu Yi, Yuxiao Zhou, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt, Feng Xu
PDF
Physical Simulation Layer for Accurate 3D Modeling Mariem Mezghanni, Théo Bodrito, Malika Boulkenafed, Maks Ovsjanikov
PDF
Physically Disentangled Intra- and Inter-Domain Adaptation for Varicolored Haze Removal Yi Li, Yi Chang, Yan Gao, Changfeng Yu, Luxin Yan
PDF
Physically-Guided Disentangled Implicit Rendering for 3D Face Modeling Zhenyu Zhang, Yanhao Ge, Ying Tai, Weijian Cao, Renwang Chen, Kunlin Liu, Hao Tang, Xiaoming Huang, Chengjie Wang, Zhifeng Xie, Dongjin Huang
PDF
PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition Partha Das, Sezer Karaoglu, Theo Gevers
PDF
PILC: Practical Image Lossless Compression with an End-to-End GPU Oriented Neural Framework Ning Kang, Shanzhao Qiu, Shifeng Zhang, Zhenguo Li, Shu-Tao Xia
PDF
Pin the Memory: Learning to Generalize Semantic Segmentation Jin Kim, Jiyoung Lee, Jungin Park, Dongbo Min, Kwanghoon Sohn
PDF
PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence Zijian Dong, Chen Guo, Jie Song, Xu Chen, Andreas Geiger, Otmar Hilliges
PDF
Pix2NeRF: Unsupervised Conditional P-GAN for Single Image to Neural Radiance Fields Translation Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc Van Gool
PDF
Pixel Screening Based Intermediate Correction for Blind Deblurring Meina Zhang, Yingying Fang, Guoxi Ni, Tieyong Zeng
PDF
PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt
PDF
PLAD: Learning to Infer Shape Programs with Pseudo-Labels and Approximate Distributions R. Kenny Jones, Homer Walke, Daniel Ritchie
PDF
PlanarRecon: Real-Time 3D Plane Detection and Reconstruction from Posed Monocular Videos Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang
PDF
PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo Jiachen Liu, Pan Ji, Nitin Bansal, Changjiang Cai, Qingan Yan, Xiaolei Huang, Yi Xu
PDF
Playable Environments: Video Manipulation in Space and Time Willi Menapace, Stéphane Lathuilière, Aliaksandr Siarohin, Christian Theobalt, Sergey Tulyakov, Vladislav Golyanik, Elisa Ricci
PDF
Plenoxels: Radiance Fields Without Neural Networks Sara Fridovich-Keil, Alex Yu, Matthew Tancik, Qinhong Chen, Benjamin Recht, Angjoo Kanazawa
PDF
PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction Zeren Sun, Fumin Shen, Dan Huang, Qiong Wang, Xiangbo Shu, Yazhou Yao, Jinhui Tang
PDF
POCO: Point Convolution for Surface Reconstruction Alexandre Boulch, Renaud Marlet
PDF
Point Cloud Color Constancy Xiaoyan Xing, Yanlin Qian, Sibo Feng, Yuhan Dong, Jiří Matas
PDF
Point Cloud Pre-Training with Natural 3D Structures Ryosuke Yamada, Hirokatsu Kataoka, Naoya Chiba, Yukiyasu Domae, Tetsuya Ogata
PDF
Point Density-Aware Voxels for LiDAR 3D Object Detection Jordan S. K. Hu, Tianshu Kuai, Steven L. Waslander
PDF
Point-BERT: Pre-Training 3D Point Cloud Transformers with Masked Point Modeling Xumin Yu, Lulu Tang, Yongming Rao, Tiejun Huang, Jie Zhou, Jiwen Lu
PDF
Point-Level Region Contrast for Object Detection Pre-Training Yutong Bai, Xinlei Chen, Alexander Kirillov, Alan Yuille, Alexander C. Berg
PDF
Point-NeRF: Point-Based Neural Radiance Fields Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, Ulrich Neumann
PDF
Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation Yuenan Hou, Xinge Zhu, Yuexin Ma, Chen Change Loy, Yikang Li
PDF
Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders Mikaela Angelina Uy, Yen-Yu Chang, Minhyuk Sung, Purvi Goel, Joseph G. Lambourne, Tolga Birdal, Leonidas J. Guibas
PDF
Point2Seq: Detecting 3D Objects as Sequences Yujing Xue, Jiageng Mao, Minzhe Niu, Hang Xu, Michael Bi Mi, Wei Zhang, Xiaogang Wang, Xinchao Wang
PDF
PointCLIP: Point Cloud Understanding by CLIP Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li
PDF
Pointly-Supervised Instance Segmentation Bowen Cheng, Omkar Parkhi, Alexander Kirillov
PDF
PokeBNN: A Binary Pursuit of Lightweight Accuracy Yichi Zhang, Zhiru Zhang, Lukasz Lew
PDF
Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk
PDF
Polymorphic-GAN: Generating Aligned Samples Across Multiple Domains with Learned Morph Maps Seung Wook Kim, Karsten Kreis, Daiqing Li, Antonio Torralba, Sanja Fidler
PDF
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images Stefano Zorzi, Shabab Bazrafkan, Stefan Habenschuss, Friedrich Fraundorfer
PDF
PONI: Potential Functions for ObjectGoal Navigation with Interaction-Free Learning Santhosh Kumar Ramakrishnan, Devendra Singh Chaplot, Ziad Al-Halah, Jitendra Malik, Kristen Grauman
PDF
Pooling Revisited: Your Receptive Field Is Suboptimal Dong-Hwan Jang, Sanghyeok Chu, Joonhyuk Kim, Bohyung Han
PDF
Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian Jihyun Lee, Minhyuk Sung, Hyunjin Kim, Tae-Kyun Kim
PDF
Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data Junfeng Lyu, Zhibo Wang, Feng Xu
PDF
PoseKernelLifter: Metric Lifting of 3D Human Pose Using Sound Zhijian Yang, Xiaoran Fan, Volkan Isler, Hyun Soo Park
PDF
PoseTrack21: A Dataset for Person Search, Multi-Object Tracking and Multi-Person Pose Tracking Andreas Döring, Di Chen, Shanshan Zhang, Bernt Schiele, Jürgen Gall
PDF
PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision Kehong Gong, Bingbing Li, Jianfeng Zhang, Tao Wang, Jing Huang, Michael Bi Mi, Jiashi Feng, Xinchao Wang
PDF
PPDL: Predicate Probability Distribution Based Loss for Unbiased Scene Graph Generation Wei Li, Haiwei Zhang, Qijie Bai, Guoqing Zhao, Ning Jiang, Xiaojie Yuan
PDF
Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack Ye Liu, Yaya Cheng, Lianli Gao, Xianglong Liu, Qilong Zhang, Jingkuan Song
PDF
Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain Lina Guo, Xinjie Shi, Dailan He, Yuanyuan Wang, Rui Ma, Hongwei Qin, Yan Wang
PDF
Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation Jiankun Li, Peisen Wang, Pengfei Xiong, Tao Cai, Ziwei Yan, Lei Yang, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu
PDF
Pre-Train, Self-Train, Distill: A Simple Recipe for Supersizing 3D Reconstruction Kalyan Vasudev Alwala, Abhinav Gupta, Shubham Tulsiani
PDF
Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc Van Gool, Errui Ding
PDF
Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives Xinke Li, Henghui Ding, Zekun Tong, Yuwei Wu, Yeow Meng Chee
PDF
Privacy Preserving Partial Localization Marcel Geppert, Viktor Larsson, Johannes L. Schönberger, Marc Pollefeys
PDF
Privacy-Preserving Online AutoML for Domain-Specific Face Detection Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang
PDF
Proactive Image Manipulation Detection Vishal Asnani, Xi Yin, Tal Hassner, Sijia Liu, Xiaoming Liu
PDF
Probabilistic Representations for Video Contrastive Learning Jungin Park, Jiyoung Lee, Ig-Jae Kim, Kwanghoon Sohn
PDF
Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences Prune Truong, Martin Danelljan, Fisher Yu, Luc Van Gool
PDF
Probing Representation Forgetting in Supervised and Unsupervised Continual Learning MohammadReza Davari, Nader Asadi, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky
PDF
Programmatic Concept Learning for Human Motion Description and Synthesis Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu
PDF
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, Limin Wang
PDF
Progressive End-to-End Object Detection in Crowded Scenes Anlin Zheng, Yuang Zhang, Xiangyu Zhang, Xiaojuan Qi, Jian Sun
PDF
Progressive Minimal Path Method with Embedded CNN Wei Liao
PDF
Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction Tiezheng Ma, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
PDF
Projective Manifold Gradient Layer for Deep Rotation Regression Jiayi Chen, Yingda Yin, Tolga Birdal, Baoquan Chen, Leonidas J. Guibas, He Wang
PDF
Prompt Distribution Learning Yuning Lu, Jianzhuang Liu, Yonggang Zhang, Yajing Liu, Xinmei Tian
PDF
Propagation Regularizer for Semi-Supervised Learning with Extremely Scarce Labeled Samples Noo-ri Kim, Jee-Hyong Lee
PDF
Proper Reuse of Image Classification Features Improves Object Detection Cristina Vasconcelos, Vighnesh Birodkar, Vincent Dumoulin
PDF
ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues Hengcan Shi, Munawar Hayat, Yicheng Wu, Jianfei Cai
PDF
Protecting Celebrities from DeepFake with Identity Consistency Transformer Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Ting Zhang, Weiming Zhang, Nenghai Yu, Dong Chen, Fang Wen, Baining Guo
PDF
Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-Robust Makeup Transfer Shengshan Hu, Xiaogeng Liu, Yechao Zhang, Minghui Li, Leo Yu Zhang, Hai Jin, Libing Wu
PDF
Proto2Proto: Can You Recognize the Car, the Way I Do? Monish Keswani, Sriranjani Ramakrishnan, Nishant Reddy, Vineeth N Balasubramanian
PDF
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding Haojun Jiang, Yuanze Lin, Dongchen Han, Shiji Song, Gao Huang
PDF
Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving Yi-Nan Chen, Hang Dai, Yong Ding
PDF
PSMNet: Position-Aware Stereo Merging Network for Room Layout Estimation Haiyan Wang, Will Hutchcroft, Yuguang Li, Zhiqiang Wan, Ivaylo Boyadzhiev, Yingli Tian, Sing Bing Kang
PDF
PSTR: End-to-End One-Step Person Search with Transformers Jiale Cao, Yanwei Pang, Rao Muhammad Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, Fahad Shahbaz Khan
PDF
PTTR: Relational 3D Point Cloud Object Tracking with Transformer Changqing Zhou, Zhipeng Luo, Yueru Luo, Tianrui Liu, Liang Pan, Zhongang Cai, Haiyu Zhao, Shijian Lu
PDF
PubTables-1m: Towards Comprehensive Table Extraction from Unstructured Documents Brandon Smock, Rohith Pesala, Robin Abraham
PDF
PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors Jérome Revaud, Vincent Leroy, Philippe Weinzaepfel, Boris Chidlovskii
PDF
Pushing the Envelope of Gradient Boosting Forests via Globally-Optimized Oblique Trees Magzhan Gabidolla, Miguel Á. Carreira-Perpiñán
PDF
Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference Shell Xu Hu, Da Li, Jan Stühmer, Minyoung Kim, Timothy M. Hospedales
PDF
Pushing the Performance Limit of Scene Text Recognizer Without Human Annotation Caiyuan Zheng, Hui Li, Seon-Min Rhee, Seungju Han, Jae-Joon Han, Peng Wang
PDF
Putting People in Their Place: Monocular Regression of 3D People in Depth Yu Sun, Wu Liu, Qian Bao, Yili Fu, Tao Mei, Michael J. Black
PDF
PyMiceTracking: An Open-Source Toolbox for Real-Time Behavioral Neuroscience Experiments Richardson Menezes, Aron de Miranda, Helton Maia
PDF
Pyramid Adversarial Training Improves ViT Performance Charles Herrmann, Kyle Sargent, Lu Jiang, Ramin Zabih, Huiwen Chang, Ce Liu, Dilip Krishnan, Deqing Sun
PDF
Pyramid Architecture for Multi-Scale Processing in Point Cloud Segmentation Dong Nie, Rui Lan, Ling Wang, Xiaofeng Ren
PDF
Pyramid Grafting Network for One-Stage High Resolution Saliency Detection Chenxi Xie, Changqun Xia, Mingcan Ma, Zhirui Zhao, Xiaowu Chen, Jia Li
PDF
QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation Xueqi Hu, Xinyue Zhou, Qiusheng Huang, Zhengyi Shi, Li Sun, Qingli Li
PDF
Quantifying Societal Bias Amplification in Image Captioning Yusuke Hirota, Yuta Nakashima, Noa Garcia
PDF
Quantization-Aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging Lingen Li, Lizhi Wang, Weitao Song, Lei Zhang, Zhiwei Xiong, Hua Huang
PDF
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free Tianlong Chen, Zhenyu Zhang, Yihua Zhang, Shiyu Chang, Sijia Liu, Zhangyang Wang
PDF
Query and Attention Augmentation for Knowledge-Based Explainable Reasoning Yifeng Zhang, Ming Jiang, Qi Zhao
PDF
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection Chenhongyi Yang, Zehao Huang, Naiyan Wang
PDF
R(Det)2: Randomized Decision Routing for Object Detection Yali Li, Shengjin Wang
PDF
RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising Michael Schelling, Pedro Hermosilla, Timo Ropinski
PDF
RAGO: Recurrent Graph Optimizer for Multiple Rotation Averaging Heng Li, Zhaopeng Cui, Shuaicheng Liu, Ping Tan
PDF
RAMA: A Rapid Multicut Algorithm on GPU Ahmed Abbas, Paul Swoboda
PDF
Ranking Distance Calibration for Cross-Domain Few-Shot Learning Pan Li, Shaogang Gong, Chengjie Wang, Yanwei Fu
PDF
Ranking-Based Siamese Visual Tracking Feng Tang, Qiang Ling
PDF
Raw High-Definition Radar for Multi-Task Learning Julien Rebut, Arthur Ouaknine, Waqas Malik, Patrick Pérez
PDF
Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation Jian Zhang, Yuanqing Zhang, Huan Fu, Xiaowei Zhou, Bowen Cai, Jinchi Huang, Rongfei Jia, Binqiang Zhao, Xing Tang
PDF
Ray3D: Ray-Based 3D Human Pose Estimation for Monocular Absolute 3D Localization Yu Zhan, Fenghai Li, Renliang Weng, Wongun Choi
PDF
RayMVSNet: Learning Ray-Based 1d Implicit Fields for Accurate Multi-View Stereo Junhua Xi, Yifei Shi, Yijie Wang, Yulan Guo, Kai Xu
PDF
RBGNet: Ray-Based Grouping for 3D Object Detection Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang
PDF
RCL: Recurrent Continuous Localization for Temporal Action Detection Qiang Wang, Yanhao Zhang, Yun Zheng, Pan Pan
PDF
RCP: Recurrent Closest Point for Point Cloud Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan
PDF
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation Akam Rahimi, Triantafyllos Afouras, Andrew Zisserman
PDF
Real-Time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders Maksim Makarenko, Arturo Burguete-Lopez, Qizhou Wang, Fedor Getman, Silvio Giancola, Bernard Ghanem, Andrea Fratalocchi
PDF
Real-Time Object Detection for Streaming Perception Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Jian Sun
PDF
Real-Time, Accurate, and Consistent Video Semantic Segmentation via Unsupervised Adaptation and Cross-Unit Deployment on Mobile Device Hyojin Park, Alan Yessenbayev, Tushar Singhal, Navin Kumar Adhikari, Yizhe Zhang, Shubhankar Mangesh Borse, Hong Cai, Nilesh Prasad Pandey, Fei Yin, Frank Mayer, Balaji Calidas, Fatih Porikli
PDF
Recall@k Surrogate Loss with Large Batches and Similarity Mixup Yash Patel, Giorgos Tolias, Jiří Matas
PDF
RecDis-SNN: Rectifying Membrane Potential Distribution for Directly Training Spiking Neural Networks Yufei Guo, Xinyi Tong, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Zhe Ma, Xuhui Huang
PDF
Reconstructing Surfaces for Sparse Point Clouds with On-Surface Priors Baorui Ma, Yu-Shen Liu, Zhizhong Han
PDF
Recurrent Dynamic Embedding for Video Object Segmentation Mingxing Li, Li Hu, Zhiwei Xiong, Bang Zhang, Pan Pan, Dong Liu
PDF
Recurrent Glimpse-Based Decoder for Detection with Transformer Zhe Chen, Jing Zhang, Dacheng Tao
PDF
Recurrent Variational Network: A Deep Learning Inverse Problem Solver Applied to the Task of Accelerated MRI Reconstruction George Yiasemis, Jan-Jakob Sonke, Clarisa Sánchez, Jonas Teuwen
PDF
Recurring the Transformer for Video Action Recognition Jiewen Yang, Xingbo Dong, Liujun Liu, Chao Zhang, Jiajun Shen, Dahai Yu
PDF
Reduce Information Loss in Transformers for Pluralistic Image Inpainting Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu
PDF
Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields Dor Verbin, Peter Hedman, Ben Mildenhall, Todd Zickler, Jonathan T. Barron, Pratul P. Srinivasan
PDF
Reference-Based Video Super-Resolution Using Multi-Camera Video Triplets Junyong Lee, Myeonghee Lee, Sunghyun Cho, Seungyong Lee
PDF
Reflash Dropout in Image Super-Resolution Xiangtao Kong, Xina Liu, Jinjin Gu, Yu Qiao, Chao Dong
PDF
Reflection and Rotation Symmetry Detection via Equivariant Learning Ahyun Seo, Byungjin Kim, Suha Kwak, Minsu Cho
PDF
Region-Aware Face Swapping Chao Xu, Jiangning Zhang, Miao Hua, Qian He, Zili Yi, Yong Liu
PDF
Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation Tianfei Zhou, Meijie Zhang, Fang Zhao, Jianwu Li
PDF
RegionCLIP: Region-Based Language-Image Pretraining Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao
PDF
Registering Explicit to Implicit: Towards High-Fidelity Garment Mesh Reconstruction from Single Images Heming Zhu, Lingteng Qiu, Yuda Qiu, Xiaoguang Han
PDF
RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs Michael Niemeyer, Jonathan T. Barron, Ben Mildenhall, Mehdi S. M. Sajjadi, Andreas Geiger, Noha Radwan
PDF
REGTR: End-to-End Point Cloud Correspondences with Transformers Zi Jian Yew, Gim Hee Lee
PDF
Reinforced Structured State-Evolution for Vision-Language Navigation Jinyu Chen, Chen Gao, Erli Meng, Qiong Zhang, Si Liu
PDF
Relative Pose from a Calibrated and an Uncalibrated Smartphone Image Yaqing Ding, Daniel Barath, Jian Yang, Zuzana Kukelova
PDF
Relieving Long-Tailed Instance Segmentation via Pairwise Class Balance Yin-Yin He, Peizhen Zhang, Xiu-Shen Wei, Xiangyu Zhang, Jian Sun
PDF
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition Jun Chen, Aniket Agarwal, Sherif Abdelkarim, Deyao Zhu, Mohamed Elhoseiny
PDF
Remember Intentions: Retrospective-Memory-Based Trajectory Prediction Chenxin Xu, Weibo Mao, Wenjun Zhang, Siheng Chen
PDF
Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer Wenjian Wang, Lijuan Duan, Yuxi Wang, Qing En, Junsong Fan, Zhaoxiang Zhang
PDF
RendNet: Unified 2D/3D Recognizer with Latent Space Rendering Ruoxi Shi, Xinyang Jiang, Caihua Shan, Yansen Wang, Dongsheng Li
PDF
Rep-Net: Efficient On-Device Learning via Feature Reprogramming Li Yang, Adnan Siraj Rakin, Deliang Fan
PDF
RePaint: Inpainting Using Denoising Diffusion Probabilistic Models Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc Van Gool
PDF
Replacing Labeled Real-Image Datasets with Auto-Generated Contours Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota
PDF
RepMLPNet: Hierarchical Vision MLP with Re-Parameterized Locality Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Jungong Han, Guiguang Ding
PDF
Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting Min Shi, Hao Lu, Chen Feng, Chengxin Liu, Zhiguo Cao
PDF
Representation Compensation Networks for Continual Semantic Segmentation Chang-Bin Zhang, Jia-Wen Xiao, Xialei Liu, Ying-Cong Chen, Ming-Ming Cheng
PDF
Representing 3D Shapes with Probabilistic Directed Distance Fields Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan D. Jepson
PDF
ResSFL: A Resistance Transfer Framework for Defending Model Inversion Attack in Split Federated Learning Jingtao Li, Adnan Siraj Rakin, Xing Chen, Zhezhi He, Deliang Fan, Chaitali Chakrabarti
PDF
RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs Zhouxia Wang, Jiawei Zhang, Runjian Chen, Wenping Wang, Ping Luo
PDF
Restormer: Efficient Transformer for High-Resolution Image Restoration Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang
PDF
ReSTR: Convolution-Free Referring Image Segmentation Using Transformers Namyup Kim, Dongwon Kim, Cuiling Lan, Wenjun Zeng, Suha Kwak
PDF
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning Liangqiong Qu, Yuyin Zhou, Paul Pu Liang, Yingda Xia, Feifei Wang, Ehsan Adeli, Li Fei-Fei, Daniel Rubin
PDF
Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation Jianfeng Wang, Thomas Lukasiewicz
PDF
Rethinking Controllable Variational Autoencoders Huajie Shao, Yifei Yang, Haohong Lin, Longzhong Lin, Yizhuo Chen, Qinmin Yang, Han Zhao
PDF
Rethinking Deep Face Restoration Yang Zhao, Yu-Chuan Su, Chun-Te Chu, Yandong Li, Marius Renn, Yukun Zhu, Changyou Chen, Xuhui Jia
PDF
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation Rui Peng, Rongjie Wang, Zhenyu Wang, Yawen Lai, Ronggang Wang
PDF
Rethinking Efficient Lane Detection via Curve Modeling Zhengyang Feng, Shaohua Guo, Xin Tan, Ke Xu, Min Wang, Lizhuang Ma
PDF
Rethinking Image Cropping: Exploring Diverse Compositions from Global Views Gengyun Jia, Huaibo Huang, Chaoyou Fu, Ran He
PDF
Rethinking Minimal Sufficient Representation in Contrastive Learning Haoqing Wang, Xun Guo, Zhi-Hong Deng, Yan Lu
PDF
Rethinking Reconstruction Autoencoder-Based Out-of-Distribution Detection Yibo Zhou
PDF
Rethinking Semantic Segmentation: A Prototype View Tianfei Zhou, Wenguan Wang, Ender Konukoglu, Luc Van Gool
PDF
Rethinking Spatial Invariance of Convolutional Networks for Object Counting Zhi-Qi Cheng, Qi Dai, Hong Li, Jingkuan Song, Xiao Wu, Alexander G. Hauptmann
PDF
Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded Views Junbo Zhang, Kaisheng Ma
PDF
Rethinking Visual Geo-Localization for Large-Scale Applications Gabriele Berton, Carlo Masone, Barbara Caputo
PDF
Retrieval Augmented Classification for Long-Tail Visual Recognition Alexander Long, Wei Yin, Thalaiyasingam Ajanthan, Vu Nguyen, Pulak Purkait, Ravi Garg, Alan Blair, Chunhua Shen, Anton van den Hengel
PDF
Retrieval-Based Spatially Adaptive Normalization for Semantic Image Synthesis Yupeng Shi, Xiao Liu, Yuxiang Wei, Zhongqin Wu, Wangmeng Zuo
PDF
Reusing the Task-Specific Classifier as a Discriminator: Discriminator-Free Adversarial Domain Adaptation Lin Chen, Huaian Chen, Zhixiang Wei, Xin Jin, Xiao Tan, Yi Jin, Enhong Chen
PDF
Revealing Occlusions with 4D Neural Fields Basile Van Hoorick, Purva Tendulkar, Dídac Surís, Dennis Park, Simon Stent, Carl Vondrick
PDF
Reversible Vision Transformers Karttikeya Mangalam, Haoqi Fan, Yanghao Li, Chao-Yuan Wu, Bo Xiong, Christoph Feichtenhofer, Jitendra Malik
PDF
Revisiting AP Loss for Dense Object Detection: Adaptive Ranking Pair Selection Dongli Xu, Jinhong Deng, Wen Li
PDF
Revisiting Document Image Dewarping by Grid Regularization Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia
PDF
Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective Jiawei Zhang, Xiang Wang, Xiao Bai, Chen Wang, Lei Huang, Yimin Chen, Lin Gu, Jun Zhou, Tatsuya Harada, Edwin R. Hancock
PDF
Revisiting Learnable Affines for Batch Norm in Few-Shot Transfer Learning Moslem Yazdanpanah, Aamer Abdul Rahman, Muawiz Chaudhary, Christian Desrosiers, Mohammad Havaei, Eugene Belilovsky, Samira Ebrahimi Kahou
PDF
Revisiting Near/Remote Sensing with Geospatial Attention Scott Workman, M. Usman Rafique, Hunter Blanton, Nathan Jacobs
PDF
Revisiting Random Channel Pruning for Neural Network Compression Yawei Li, Kamil Adamczewski, Wen Li, Shuhang Gu, Radu Timofte, Luc Van Gool
PDF
Revisiting Skeleton-Based Action Recognition Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai
PDF
Revisiting Temporal Alignment for Video Restoration Kun Zhou, Wenbo Li, Liying Lu, Xiaoguang Han, Jiangbo Lu
PDF
Revisiting the "Video" in Video-Language Understanding Shyamal Buch, Cristóbal Eyzaguirre, Adrien Gaidon, Jiajun Wu, Li Fei-Fei, Juan Carlos Niebles
PDF
Revisiting the Transferability of Supervised Pretraining: An MLP Perspective Yizhou Wang, Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Donglian Qi, Wanli Ouyang
PDF
Revisiting Weakly Supervised Pre-Training of Visual Perception Models Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der Maaten
PDF
REX: Reasoning-Aware and Grounded Explanation Shi Chen, Qi Zhao
PDF
RFNet: Unsupervised Network for Mutually Reinforcing Multi-Modal Image Registration and Fusion Han Xu, Jiayi Ma, Jiteng Yuan, Zhuliang Le, Wei Liu
PDF
RGB-Depth Fusion GAN for Indoor Depth Completion Haowen Wang, Mingyuan Wang, Zhengping Che, Zhiyuan Xu, Xiuquan Qiao, Mengshi Qi, Feifei Feng, Jian Tang
PDF
RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano
PDF
RIDDLE: LiDAR Data Compression with Range Image Deep Delta Encoding Xuanyu Zhou, Charles R. Qi, Yin Zhou, Dragomir Anguelov
PDF
RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior Ruibo Li, Chi Zhang, Guosheng Lin, Zhe Wang, Chunhua Shen
PDF
RigNeRF: Fully Controllable Neural 3D Portraits ShahRukh Athar, Zexiang Xu, Kalyan Sunkavalli, Eli Shechtman, Zhixin Shu
PDF
RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures Chengjie Niu, Manyi Li, Kai Xu, Hao Zhang
PDF
RIO: Rotation-Equivariance Supervised Learning of Robust Inertial Odometry Xiya Cao, Caifa Zhou, Dandan Zeng, Yongliang Wang
PDF
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes Tak-Wai Hui
PDF
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li
PDF
Robust and Accurate Superquadric Recovery: A Probabilistic Approach Weixiao Liu, Yuwei Wu, Sipu Ruan, Gregory S. Chirikjian
PDF
Robust Combination of Distributed Gradients Under Adversarial Perturbations Kwang In Kim
PDF
Robust Contrastive Learning Against Noisy Views Ching-Yao Chuang, R Devon Hjelm, Xin Wang, Vibhav Vineet, Neel Joshi, Antonio Torralba, Stefanie Jegelka, Yale Song
PDF
Robust Cross-Modal Representation Learning with Progressive Self-Distillation Alex Andonian, Shixing Chen, Raffay Hamid
PDF
Robust Egocentric Photo-Realistic Facial Expression Transfer for Virtual Reality Amin Jourabloo, Fernando De la Torre, Jason Saragih, Shih-En Wei, Stephen Lombardi, Te-Li Wang, Danielle Belko, Autumn Trimble, Hernan Badino
PDF
Robust Equivariant Imaging: A Fully Unsupervised Framework for Learning to Image from Noisy and Partial Measurements Dongdong Chen, Julián Tachella, Mike E. Davies
PDF
Robust Federated Learning with Noisy and Heterogeneous Clients Xiuwen Fang, Mang Ye
PDF
Robust Fine-Tuning of Zero-Shot Models Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon Kornblith, Rebecca Roelofs, Raphael Gontijo Lopes, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt
PDF
Robust Image Forgery Detection over Online Social Network Shared Images Haiwei Wu, Jiantao Zhou, Jinyu Tian, Jun Liu
PDF
Robust Invertible Image Steganography Youmin Xu, Chong Mou, Yujie Hu, Jingfen Xie, Jian Zhang
PDF
Robust Optimization as Data Augmentation for Large-Scale Graphs Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein
PDF
Robust Outlier Detection by De-Biasing VAE Likelihoods Kushal Chauhan, Barath Mohan U, Pradeep Shenoy, Manish Gupta, Devarajan Sridharan
PDF
Robust Region Feature Synthesizer for Zero-Shot Object Detection Peiliang Huang, Junwei Han, De Cheng, Dingwen Zhang
PDF
Robust Structured Declarative Classifiers for 3D Point Clouds: Defending Adversarial Attacks with Implicit Gradients Kaidong Li, Ziming Zhang, Cuncong Zhong, Guanghui Wang
PDF
ROCA: Robust CAD Model Retrieval and Alignment from a Single Image Can Gümeli, Angela Dai, Matthias Nießner
PDF
Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task Xiaoqing Ye, Mao Shu, Hanyu Li, Yifeng Shi, Yingying Li, Guangjie Wang, Xiao Tan, Errui Ding
PDF
Rotationally Equivariant 3D Object Detection Hong-Xing Yu, Jiajun Wu, Li Yi
PDF
RSCFed: Random Sampling Consensus Federated Semi-Supervised Learning Xiaoxiao Liang, Yiqun Lin, Huazhu Fu, Lei Zhu, Xiaomeng Li
PDF
RSTT: Real-Time Spatial Temporal Transformer for Space-Time Video Super-Resolution Zhicheng Geng, Luming Liang, Tianyu Ding, Ilya Zharkov
PDF
RU-Net: Regularized Unrolling Network for Scene Graph Generation Xin Lin, Changxing Ding, Jing Zhang, Yibing Zhan, Dacheng Tao
PDF
Safe Self-Refinement for Transformer-Based Domain Adaptation Tao Sun, Cheng Lu, Tianshuo Zhang, Haibin Ling
PDF
Safe-Student for Safe Deep Semi-Supervised Learning with Unseen-Class Unlabeled Data Rundong He, Zhongyi Han, Xiankai Lu, Yilong Yin
PDF
Salient-to-Broad Transition for Video Person Re-Identification Shutao Bai, Bingpeng Ma, Hong Chang, Rui Huang, Xilin Chen
PDF
Salvage of Supervision in Weakly Supervised Object Detection Lin Sui, Chen-Lin Zhang, Jianxin Wu
PDF
SAR-Net: Shape Alignment and Recovery Network for Category-Level 6d Object Pose and Size Estimation Haitao Lin, Zichang Liu, Chilam Cheang, Yanwei Fu, Guodong Guo, Xiangyang Xue
PDF
SASIC: Stereo Image Compression with Latent Shifts and Stereo Attention Matthias Wödlinger, Jan Kotera, Jan Xu, Robert Sablatnig
PDF
SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao
PDF
Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels Yikai Wang, Xinwei Sun, Yanwei Fu
PDF
Scale-Equivalent Distillation for Semi-Supervised Object Detection Qiushan Guo, Yao Mu, Jianyu Chen, Tianqi Wang, Yizhou Yu, Ping Luo
PDF
ScaleNet: A Shallow Architecture for Scale Estimation Axel Barroso-Laguna, Yurun Tian, Krystian Mikolajczyk
PDF
Scaling up Vision-Language Pre-Training for Image Captioning Xiaowei Hu, Zhe Gan, Jianfeng Wang, Zhengyuan Yang, Zicheng Liu, Yumao Lu, Lijuan Wang
PDF
Scaling up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding
PDF
Scaling Vision Transformers Xiaohua Zhai, Alexander Kolesnikov, Neil Houlsby, Lucas Beyer
PDF
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal Mahmood
PDF
Scanline Homographies for Rolling-Shutter Plane Absolute Pose Fang Bai, Agniva Sengupta, Adrien Bartoli
PDF
ScanQA: 3D Question Answering for Spatial Scene Understanding Daichi Azuma, Taiki Miyanishi, Shuhei Kurita, Motoaki Kawanabe
PDF
Scene Consistency Representation Learning for Video Scene Segmentation Haoqian Wu, Keyu Chen, Yanan Luo, Ruizhi Qiao, Bo Ren, Haozhe Liu, Weicheng Xie, Linlin Shen
PDF
Scene Graph Expansion for Semantics-Guided Image Outpainting Chiao-An Yang, Cheng-Yo Tan, Wan-Cyuan Fan, Cheng-Fu Yang, Meng-Lin Wu, Yu-Chiang Frank Wang
PDF
Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations Mehdi S. M. Sajjadi, Henning Meyer, Etienne Pot, Urs Bergmann, Klaus Greff, Noha Radwan, Suhani Vora, Mario Lučić, Daniel Duckworth, Alexey Dosovitskiy, Jakob Uszkoreit, Thomas Funkhouser, Andrea Tagliasacchi
PDF
SceneSqueezer: Learning to Compress Scene for Camera Relocalization Luwei Yang, Rakesh Shrestha, Wenbo Li, Shuaicheng Liu, Guofeng Zhang, Zhaopeng Cui, Ping Tan
PDF
Scenic: A JAX Library for Computer Vision Research and Beyond Mostafa Dehghani, Alexey Gritsenko, Anurag Arnab, Matthias Minderer, Yi Tay
PDF
ScePT: Scene-Consistent, Policy-Based Trajectory Predictions for Planning Yuxiao Chen, Boris Ivanovic, Marco Pavone
PDF
Scribble-Supervised LiDAR Semantic Segmentation Ozan Unal, Dengxin Dai, Luc Van Gool
PDF
SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization Yucheng Hang, Bin Xia, Wenming Yang, Qingmin Liao
PDF
Searching the Deployable Convolution Neural Networks for GPUs Linnan Wang, Chenhan Yu, Satish Salian, Slawomir Kierat, Szymon Migacz, Alex Fit Florea
PDF
SEEG: Semantic Energized Co-Speech Gesture Generation Yuanzhi Liang, Qianyu Feng, Linchao Zhu, Li Hu, Pan Pan, Yi Yang
PDF
SeeThroughNet: Resurrection of Auxiliary Loss by Preserving Class Probability Information Dasol Han, Jaewook Yoo, Dokwan Oh
PDF
Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks with Robust Patch Detection Jiang Liu, Alexander Levine, Chun Pong Lau, Rama Chellappa, Soheil Feizi
PDF
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation Anirud Thyagharajan, Benjamin Ummenhofer, Prashant Laddha, Om Ji Omer, Sreenivas Subramoney
PDF
Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, Zhongxuan Luo
PDF
Selective-Supervised Contrastive Learning with Noisy Labels Shikun Li, Xiaobo Xia, Shiming Ge, Tongliang Liu
PDF
Self-Augmented Unpaired Image Dehazing via Density and Depth Decomposition Yang Yang, Chaoyue Wang, Risheng Liu, Lin Zhang, Xiaojie Guo, Dacheng Tao
PDF
Self-Distillation from the Last Mini-Batch for Consistency Regularization Yiqing Shen, Liwu Xu, Yuzhe Yang, Yaqian Li, Yandong Guo
PDF
Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation Wenbo Zhao, Xianming Liu, Zhiwei Zhong, Junjun Jiang, Wei Gao, Ge Li, Xiangyang Ji
PDF
Self-Supervised Bulk Motion Artifact Removal in Optical Coherence Tomography Angiography Jiaxiang Ren, Kicheon Park, Yingtian Pan, Haibin Ling
PDF
Self-Supervised Correlation Mining Network for Person Image Generation Zijian Wang, Xingqun Qi, Kun Yuan, Muyi Sun
PDF
Self-Supervised Deep Image Restoration via Adaptive Stochastic Gradient Langevin Dynamics Weixi Wang, Ji Li, Hui Ji
PDF
Self-Supervised Dense Consistency Regularization for Image-to-Image Translation Minsu Ko, Eunju Cha, Sungjoo Suh, Huijin Lee, Jae-Joon Han, Jinwoo Shin, Bohyung Han
PDF
Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee, Byungjin Kim, Minsu Cho
PDF
Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels Hehe Fan, Xiaojun Chang, Wanyue Zhang, Yi Cheng, Ying Sun, Mohan Kankanhalli
PDF
Self-Supervised Image Representation Learning with Geometric Set Consistency Nenglun Chen, Lei Chu, Hao Pan, Yan Lu, Wenping Wang
PDF
Self-Supervised Image-Specific Prototype Exploration for Weakly Supervised Semantic Segmentation Qi Chen, Lingxiao Yang, Jian-Huang Lai, Xiaohua Xie
PDF
Self-Supervised Keypoint Discovery in Behavioral Videos Jennifer J. Sun, Serim Ryou, Roni H. Goldshmid, Brandon Weissbourd, John O. Dabiri, David J. Anderson, Ann Kennedy, Yisong Yue, Pietro Perona
PDF
Self-Supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection Liang Chen, Yong Zhang, Yibing Song, Lingqiao Liu, Jue Wang
PDF
Self-Supervised Learning of Object Parts for Semantic Segmentation Adrian Ziegler, Yuki M. Asano
PDF
Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks Peri Akiva, Matthew Purri, Matthew Leotta
PDF
Self-Supervised Models Are Continual Learners Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal
PDF
Self-Supervised Neural Articulated Shape and Appearance Models Fangyin Wei, Rohan Chabra, Lingni Ma, Christoph Lassner, Michael Zollhöfer, Szymon Rusinkiewicz, Chris Sweeney, Richard Newcombe, Mira Slavcheva
PDF
Self-Supervised Object Detection from Audio-Visual Correspondence Triantafyllos Afouras, Yuki M. Asano, Francois Fagan, Andrea Vedaldi, Florian Metze
PDF
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis Yucheng Tang, Dong Yang, Wenqi Li, Holger R. Roth, Bennett Landman, Daguang Xu, Vishwesh Nath, Ali Hatamizadeh
PDF
Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection Nicolae-Cătălin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah
PDF
Self-Supervised Spatial Reasoning on Multi-View Line Drawings Siyuan Xiang, Anbang Yang, Yanfei Xue, Yaoqing Yang, Chen Feng
PDF
Self-Supervised Super-Resolution for Multi-Exposure Push-Frame Satellites Ngoc Long Nguyen, Jérémy Anger, Axel Davy, Pablo Arias, Gabriele Facciolo
PDF
Self-Supervised Transformers for Unsupervised Object Discovery Using Normalized Cut Yangtao Wang, Xi Shen, Shell Xu Hu, Yuan Yuan, James L. Crowley, Dominique Vaufreydaz
PDF
Self-Supervised Video Transformer Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Michael S. Ryoo
PDF
Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning Kai Zhu, Wei Zhai, Yang Cao, Jiebo Luo, Zheng-Jun Zha
PDF
Self-Taught Metric Learning Without Labels Sungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak
PDF
SelfD: Self-Learning Large-Scale Driving Policies from the Web Jimuyang Zhang, Ruizhao Zhu, Eshed Ohn-Bar
PDF
SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video Boyi Jiang, Yang Hong, Hujun Bao, Juyong Zhang
PDF
SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation Ziyi Wang, Yongming Rao, Xumin Yu, Jie Zhou, Jiwen Lu
PDF
Semantic Segmentation by Early Region Proxy Yifan Zhang, Bo Pang, Cewu Lu
PDF
Semantic-Aligned Fusion Transformer for One-Shot Object Detection Yizhou Zhao, Xun Guo, Yan Lu
PDF
Semantic-Aware Auto-Encoders for Self-Supervised Representation Learning Guangrun Wang, Yansong Tang, Liang Lin, Philip H.S. Torr
PDF
Semantic-Aware Domain Generalized Segmentation Duo Peng, Yinjie Lei, Munawar Hayat, Yulan Guo, Wen Li
PDF
Semantic-Shape Adaptive Feature Modulation for Semantic Image Synthesis Zhengyao Lv, Xiaoming Li, Zhenxing Niu, Bing Cao, Wangmeng Zuo
PDF
SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen
PDF
Semi-Supervised Few-Shot Learning via Multi-Factor Clustering Jie Ling, Lei Liao, Meng Yang, Jia Shuai
PDF
Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels Jiwon Kim, Kwangrok Ryoo, Junyoung Seo, Gyuseong Lee, Daehwan Kim, Hansang Cho, Seungryong Kim
PDF
Semi-Supervised Object Detection via Multi-Instance Alignment with Global Class Prototypes Aoxue Li, Peng Yuan, Zhenguo Li
PDF
Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, Xinyi Le
PDF
Semi-Supervised Semantic Segmentation with Error Localization Network Donghyeon Kwon, Suha Kwak
PDF
Semi-Supervised Video Paragraph Grounding with Contrastive Encoder Xun Jiang, Xing Xu, Jingran Zhang, Fumin Shen, Zuo Cao, Heng Tao Shen
PDF
Semi-Supervised Video Semantic Segmentation with Inter-Frame Feature Reconstruction Jiafan Zhuang, Zilei Wang, Yuan Gao
PDF
Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer Fushun Zhu, Shan Zhao, Peng Wang, Hao Wang, Hua Yan, Shuaicheng Liu
PDF
Semi-Weakly-Supervised Learning of Complex Actions from Instructional Task Videos Yuhan Shen, Ehsan Elhamifar
PDF
Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning Yuan-Fu Yang, Min Sun
PDF
Sequential Voting with Relational Box Fields for Active Object Detection Qichen Fu, Xingyu Liu, Kris Kitani
PDF
Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order Consistency Zijia Lu, Ehsan Elhamifar
PDF
SGTR: End-to-End Scene Graph Generation with Transformer Rongjie Li, Songyang Zhang, Xuming He
PDF
Shadows Can Be Dangerous: Stealthy and Effective Physical-World Adversarial Attack by Natural Phenomenon Yiqi Zhong, Xianming Liu, Deming Zhai, Junjun Jiang, Xiangyang Ji
PDF
Shape from Polarization for Complex Scenes in the Wild Chenyang Lei, Chenyang Qi, Jiaxin Xie, Na Fan, Vladlen Koltun, Qifeng Chen
PDF
Shape from Thermal Radiation: Passive Ranging Using Multi-Spectral LWIR Measurements Yasuto Nagase, Takahiro Kushida, Kenichiro Tanaka, Takuya Funatomi, Yasuhiro Mukaigawa
PDF
Shape-Invariant 3D Adversarial Point Clouds Qidong Huang, Xiaoyi Dong, Dongdong Chen, Hang Zhou, Weiming Zhang, Nenghai Yu
PDF
ShapeFormer: Transformer-Based Shape Completion via Sparse Representation Xingguang Yan, Liqiang Lin, Niloy J. Mitra, Dani Lischinski, Daniel Cohen-Or, Hui Huang
PDF
Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search Han Xiao, Ziwei Wang, Zheng Zhu, Jie Zhou, Jiwen Lu
PDF
SharpContour: A Contour-Based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation Chenming Zhu, Xuanye Zhang, Yanran Li, Liangdong Qiu, Kai Han, Xiaoguang Han
PDF
SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher Yu
PDF
Shifting More Attention to Visual Backbone: Query-Modulated Refinement Networks for End-to-End Visual Grounding Jiabo Ye, Junfeng Tian, Ming Yan, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin
PDF
Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning Ligong Han, Jian Ren, Hsin-Ying Lee, Francesco Barbieri, Kyle Olszewski, Shervin Minaee, Dimitris Metaxas, Sergey Tulyakov
PDF
Show, Deconfound and Tell: Image Captioning with Causal Inference Bing Liu, Dong Wang, Xu Yang, Yong Zhou, Rui Yao, Zhiwen Shao, Jiaqi Zhao
PDF
Shunted Self-Attention via Multi-Scale Token Aggregation Sucheng Ren, Daquan Zhou, Shengfeng He, Jiashi Feng, Xinchao Wang
PDF
Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning Xiangyu Li, Xu Yang, Kun Wei, Cheng Deng, Muli Yang
PDF
SIGMA: Semantic-Complete Graph Matching for Domain Adaptive Object Detection Wuyang Li, Xinyu Liu, Yixuan Yuan
PDF
Sign Language Video Retrieval with Free-Form Textual Queries Amanda Duarte, Samuel Albanie, Xavier Giró-i-Nieto, Gül Varol
PDF
Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production Ben Saunders, Necati Cihan Camgoz, Richard Bowden
PDF
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization Canjie Luo, Lianwen Jin, Jingdong Chen
PDF
SIMBAR: Single Image-Based Scene Relighting for Effective Data Augmentation for Automated Driving Vision Tasks Xianling Zhang, Nathan Tseng, Ameerah Syed, Rohan Bhasin, Nikita Jaipuria
PDF
SimMatch: Semi-Supervised Learning with Similarity Matching Mingkai Zheng, Shan You, Lang Huang, Fei Wang, Chen Qian, Chang Xu
PDF
SimMIM: A Simple Framework for Masked Image Modeling Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu
PDF
Simple but Effective: CLIP Embeddings for Embodied AI Apoorv Khandelwal, Luca Weihs, Roozbeh Mottaghi, Aniruddha Kembhavi
PDF
Simple Multi-Dataset Detection Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl
PDF
SimT: Handling Open-Set Noise for Domain Adaptive Semantic Segmentation Xiaoqing Guo, Jie Liu, Tongliang Liu, Yixuan Yuan
PDF
Simulated Adversarial Testing of Face Recognition Models Nataniel Ruiz, Adam Kortylewski, Weichao Qiu, Cihang Xie, Sarah Adel Bargal, Alan Yuille, Stan Sclaroff
PDF
SimVP: Simpler yet Better Video Prediction Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li
PDF
SimVQA: Exploring Simulated Environments for Visual Question Answering Paola Cascante-Bonilla, Hui Wu, Letao Wang, Rogerio S. Feris, Vicente Ordonez
PDF
Single-Domain Generalized Object Detection in Urban Scene via Cyclic-Disentangled Self-Distillation Aming Wu, Cheng Deng
PDF
Single-Photon Structured Light Varun Sundar, Sizhuo Ma, Aswin C. Sankaranarayanan, Mohit Gupta
PDF
Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data Nikolay Patakin, Anna Vorontsova, Mikhail Artemyev, Anton Konushin
PDF
Single-Stage Is Enough: Multi-Person Absolute 3D Pose Estimation Lei Jin, Chenyang Xu, Xiaojuan Wang, Yabo Xiao, Yandong Guo, Xuecheng Nie, Jian Zhao
PDF
SIOD: Single Instance Annotated per Category per Image for Object Detection Hanjun Li, Xingjia Pan, Ke Yan, Fan Tang, Wei-Shi Zheng
PDF
Sketch3T: Test-Time Training for Zero-Shot SBIR Aneeshan Sain, Ayan Kumar Bhunia, Vaishnav Potlapalli, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
PDF
SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches Yu Zeng, Zhe Lin, Vishal M. Patel
PDF
Sketching Without Worrying: Noise-Tolerant Sketch-Based Image Retrieval Ayan Kumar Bhunia, Subhadeep Koley, Abdullah Faiz Ur Rahman Khilji, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
PDF
SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters Albert Mosella-Montoro, Javier Ruiz-Hidalgo
PDF
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos Salar Hosseini Khorasgani, Yuxuan Chen, Florian Shkurti
PDF
Slimmable Domain Adaptation Rang Meng, Weijie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, Shiliang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang
PDF
Slot-VPS: Object-Centric Representation Learning for Video Panoptic Segmentation Yi Zhou, Hui Zhang, Hana Lee, Shuyang Sun, Pingjun Li, Yangguang Zhu, ByungIn Yoo, Xiaojuan Qi, Jae-Joon Han
PDF
SmartAdapt: Multi-Branch Object Detection Framework for Videos on Mobiles Ran Xu, Fangzhou Mu, Jayoung Lee, Preeti Mukherjee, Somali Chaterji, Saurabh Bagchi, Yin Li
PDF
SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis Anastasiia Kornilova, Marsel Faizullin, Konstantin Pakulev, Andrey Sadkov, Denis Kukushkin, Azat Akhmetyanov, Timur Akhtyamov, Hekmat Taherinejad, Gonzalo Ferrer
PDF
Smooth Maximum Unit: Smooth Activation Function for Deep Networks Using Smoothing Maximum Technique Koushik Biswas, Sandeep Kumar, Shilpak Banerjee, Ashish Kumar Pandey
PDF
Smooth-Swap: A Simple Enhancement for Face-Swapping with Smoothness Jiseob Kim, Jihoon Lee, Byoung-Tak Zhang
PDF
SMPL-A: Modeling Person-Specific Deformable Anatomy Hengtao Guo, Benjamin Planche, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu
PDF
SNR-Aware Low-Light Image Enhancement Xiaogang Xu, Ruixing Wang, Chi-Wing Fu, Jiaya Jia
PDF
SNUG: Self-Supervised Neural Dynamic Garments Igor Santesteban, Miguel A. Otaduy, Dan Casas
PDF
SoftCollage: A Differentiable Probabilistic Tree Generator for Image Collage Jiahao Yu, Li Chen, Mingrui Zhang, Mading Li
PDF
SoftGroup for 3D Instance Segmentation on Point Clouds Thang Vu, Kookhoi Kim, Tung M. Luu, Thanh Nguyen, Chang D. Yoo
PDF
SOMSI: Spherical Novel View Synthesis with Soft Occlusion Multi-Sphere Images Tewodros Habtegebrial, Christiano Gava, Marcel Rogge, Didier Stricker, Varun Jampani
PDF
Sound and Visual Representation Learning with Multiple Pretraining Tasks Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool
PDF
Sound-Guided Semantic Image Manipulation Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chanyoung Kim, Jinkyu Kim, Sangpil Kim
PDF
Source-Free Domain Adaptation via Distribution Estimation Ning Ding, Yixing Xu, Yehui Tang, Chao Xu, Yunhe Wang, Dacheng Tao
PDF
Source-Free Object Detection by Learning to Overlook Domain Style Shuaifeng Li, Mao Ye, Xiatian Zhu, Lihua Zhou, Lin Xiong
PDF
SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu
PDF
SPAct: Self-Supervised Privacy Preservation for Action Recognition Ishan Rajendrakumar Dave, Chen Chen, Mubarak Shah
PDF
SPAMs: Structured Implicit Parametric Models Pablo Palafox, Nikolaos Sarafianos, Tony Tung, Angela Dai
PDF
Sparse and Complete Latent Organization for Geospatial Semantic Segmentation Fengyu Yang, Chenyang Ma
PDF
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai
PDF
Sparse Instance Activation for Real-Time Instance Segmentation Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Wenqiang Zhang, Qian Zhang, Chang Huang, Zhaoxiang Zhang, Wenyu Liu
PDF
Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning Jiahao Xia, Weiwei Qu, Wenjian Huang, Jianguo Zhang, Xi Wang, Min Xu
PDF
Sparse Non-Local CRF Olga Veksler, Yuri Boykov
PDF
Sparse Object-Level Supervision for Instance Segmentation with Pixel Embeddings Adrian Wolny, Qin Yu, Constantin Pape, Anna Kreshuk
PDF
Sparse to Dense Dynamic 3D Facial Expression Generation Naima Otberdout, Claudio Ferrari, Mohamed Daoudi, Stefano Berretti, Alberto Del Bimbo
PDF
Spatial Commonsense Graph for Object Localisation in Partial Scenes Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming Wang, Alessio Del Bue
PDF
Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation Shuying Liu, Wenbin Wu, Jiaxian Wu, Yue Lin
PDF
Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning Mengshun Hu, Kui Jiang, Liang Liao, Jing Xiao, Junjun Jiang, Zheng Wang
PDF
Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing Gaurav Parmar, Yijun Li, Jingwan Lu, Richard Zhang, Jun-Yan Zhu, Krishna Kumar Singh
PDF
Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction Chongyang Zhong, Lei Hu, Zihao Zhang, Yongjing Ye, Shihong Xia
PDF
Spatio-Temporal Relation Modeling for Few-Shot Action Recognition Anirudh Thatipelli, Sanath Narayan, Salman Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Bernard Ghanem
PDF
Spectral Unsupervised Domain Adaptation for Visual Recognition Jingyi Zhang, Jiaxing Huang, Zichen Tian, Shijian Lu
PDF
Speech Driven Tongue Animation Salvador Medina, Denis Tome, Carsten Stoll, Mark Tiede, Kevin Munhall, Alexander G. Hauptmann, Iain Matthews
PDF
Speed up Object Detection on Gigapixel-Level Images with Patch Arrangement Jiahao Fan, Huabin Liu, Wenjie Yang, John See, Aixin Zhang, Weiyao Lin
PDF
SphereSR: 360deg Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation Youngho Yoon, Inchul Chung, Lin Wang, Kuk-Jin Yoon
PDF
SphericGAN: Semi-Supervised Hyper-Spherical Generative Adversarial Networks for Fine-Grained Image Synthesis Tianyi Chen, Yunfei Zhang, Xiaoyang Huo, Si Wu, Yong Xu, Hau San Wong
PDF
Spiking Transformers for Event-Based Single Object Tracking Jiqing Zhang, Bo Dong, Haiwei Zhang, Jianchuan Ding, Felix Heide, Baocai Yin, Xin Yang
PDF
Splicing ViT Features for Semantic Appearance Transfer Narek Tumanyan, Omer Bar-Tal, Shai Bagon, Tali Dekel
PDF
Split Hierarchical Variational Compression Tom Ryder, Chen Zhang, Ning Kang, Shifeng Zhang
PDF
SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems Xin Dong, Barbara De Salvo, Meng Li, Chiao Liu, Zhongnan Qu, H.T. Kung, Ziyun Li
PDF
SS3D: Sparsely-Supervised 3D Object Detection from Point Cloud Chuandong Liu, Chenqiang Gao, Fangcen Liu, Jiang Liu, Deyu Meng, Xinbo Gao
PDF
ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation Duolikun Danier, Fan Zhang, David Bull
PDF
ST++: Make Self-Training Work Better for Semi-Supervised Semantic Segmentation Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao
PDF
Stability-Driven Contact Reconstruction from Monocular Color Images Zimeng Zhao, Binghui Zuo, Wei Xie, Yangang Wang
PDF
Stable Long-Term Recurrent Video Super-Resolution Benjamin Naoto Chiche, Arnaud Woiselle, Joana Frontera-Pons, Jean-Luc Starck
PDF
Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation Xingning Dong, Tian Gan, Xuemeng Song, Jianlong Wu, Yuan Cheng, Liqiang Nie
PDF
Stand-Alone Inter-Frame Attention in Video Models Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo, Tao Mei
PDF
STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes Peishan Cong, Xinge Zhu, Feng Qiao, Yiming Ren, Xidong Peng, Yuenan Hou, Lan Xu, Ruigang Yang, Dinesh Manocha, Yuexin Ma
PDF
Stereo Depth from Events Cameras: Concentrate and Focus on the Future Yeongwoo Nam, Mohammad Mostafavi, Kuk-Jin Yoon, Jonghyun Choi
PDF
Stereo Magnification with Multi-Layer Images Taras Khakhulin, Denis Korzhenkov, Pavel Solovev, Gleb Sterkin, Andrei-Timotei Ardelean, Victor Lempitsky
PDF
Stereoscopic Universal Perturbations Across Different Architectures and Datasets Zachary Berger, Parth Agrawal, Tian Yu Liu, Stefano Soatto, Alex Wong
PDF
Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models Feng Cheng, Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Li, Wei Xia
PDF
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion Tianpei Gu, Guangyi Chen, Junlong Li, Chunze Lin, Yongming Rao, Jie Zhou, Jiwen Lu
PDF
Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial Transferability Yifeng Xiong, Jiadong Lin, Min Zhang, John E. Hopcroft, Kun He
PDF
Stratified Transformer for 3D Point Cloud Segmentation Xin Lai, Jianhui Liu, Li Jiang, Liwei Wang, Hengshuang Zhao, Shu Liu, Xiaojuan Qi, Jiaya Jia
PDF
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
PDF
Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation Deyi Ji, Haoran Wang, Mingyuan Tao, Jianqiang Huang, Xian-Sheng Hua, Hongtao Lu
PDF
Structure-Aware Flow Generation for Human Body Reshaping Jianqiang Ren, Yuan Yao, Biwen Lei, Miaomiao Cui, Xuansong Xie
PDF
Structure-Aware Motion Transfer with Deformable Anchor Model Jiale Tao, Biao Wang, Borun Xu, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
PDF
Structured Local Radiance Fields for Human Avatar Modeling Zerong Zheng, Han Huang, Tao Yu, Hongwen Zhang, Yandong Guo, Yebin Liu
PDF
Structured Sparse R-CNN for Direct Scene Graph Generation Yao Teng, Limin Wang
PDF
Style Neophile: Constantly Seeking Novel Styles for Domain Generalization Juwon Kang, Sohyun Lee, Namyup Kim, Suha Kwak
PDF
Style Transformer for Image Inversion and Editing Xueqi Hu, Qiusheng Huang, Zhengyi Shi, Siyuan Li, Changxin Gao, Li Sun, Qingli Li
PDF
Style-Based Global Appearance Flow for Virtual Try-on Sen He, Yi-Zhe Song, Tao Xiang
PDF
Style-ERD: Responsive and Coherent Online Motion Style Transfer Tianxin Tao, Xiaohang Zhan, Zhongquan Chen, Michiel van de Panne
PDF
Style-Structure Disentangled Features and Normalizing Flows for Diverse Icon Colorization Yuan-kui Li, Yun-Hsuan Lien, Yu-Shuen Wang
PDF
Styleformer: Transformer Based Generative Adversarial Networks with Style Vector Jeeseung Park, Younggeun Kim
PDF
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2 Ivan Skorokhodov, Sergey Tulyakov, Mohamed Elhoseiny
PDF
StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions Lukas Höllein, Justin Johnson, Matthias Nießner
PDF
StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman
PDF
StyleSwin: Transformer-Based GAN for High-Resolution Image Generation Bowen Zhang, Shuyang Gu, Bo Zhang, Jianmin Bao, Dong Chen, Fang Wen, Yong Wang, Baining Guo
PDF
StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis Zhiheng Li, Martin Renqiang Min, Kai Li, Chenliang Xu
PDF
StylizedNeRF: Consistent 3D Scene Stylization as Stylized NeRF via 2D-3D Mutual Learning Yi-Hua Huang, Yue He, Yu-Jie Yuan, Yu-Kun Lai, Lin Gao
PDF
StyTr2: Image Style Transfer with Transformers Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu
PDF
Sub-Word Level Lip Reading with Visual Attention K R Prajwal, Triantafyllos Afouras, Andrew Zisserman
PDF
Subspace Adversarial Training Tao Li, Yingwen Wu, Sizhe Chen, Kun Fang, Xiaolin Huang
PDF
Super-Fibonacci Spirals: Fast, Low-Discrepancy Sampling of SO(3) Marc Alexa
PDF
Surface Reconstruction from Point Clouds by Learning Predictive Context Priors Baorui Ma, Yu-Shen Liu, Matthias Zwicker, Zhizhong Han
PDF
Surface Representation for Point Clouds Haoxi Ran, Jun Liu, Chengjie Wang
PDF
Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis Tianhan Xu, Yasuhiro Fujita, Eiichi Matsumoto
PDF
SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation with Learnt Surface Embeddings Rasmus Laurvig Haugaard, Anders Glent Buch
PDF
Surpassing the Human Accuracy: Detecting Gallbladder Cancer from USG Images with Curriculum Learning Soumen Basu, Mayank Gupta, Pratyaksha Rana, Pankaj Gupta, Chetan Arora
PDF
SVIP: Sequence VerIfication for Procedures in Videos Yicheng Qian, Weixin Luo, Dongze Lian, Xu Tang, Peilin Zhao, Shenghua Gao
PDF
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering Vipul Gupta, Zhuowan Li, Adam Kortylewski, Chenyu Zhang, Yingwei Li, Alan Yuille
PDF
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, Wei Liu
PDF
Swin Transformer V2: Scaling up Capacity and Resolution Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo
PDF
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning Kevin Lin, Linjie Li, Chung-Ching Lin, Faisal Ahmed, Zhe Gan, Zicheng Liu, Yumao Lu, Lijuan Wang
PDF
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin
PDF
Sylph: A Hypernetwork Framework for Incremental Few-Shot Object Detection Li Yin, Juan M. Perez-Rua, Kevin J. Liang
PDF
Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation Nathaniel Merrill, Yuliang Guo, Xingxing Zuo, Xinyu Huang, Stefan Leutenegger, Xi Peng, Liu Ren, Guoquan Huang
PDF
Symmetry-Aware Neural Architecture for Embodied Visual Exploration Shuang Liu, Takayuki Okatani
PDF
Syntax-Aware Network for Handwritten Mathematical Expression Recognition Ye Yuan, Xiao Liu, Wondimu Dikubab, Hui Liu, Zhilong Ji, Zhongqin Wu, Xiang Bai
PDF
Synthetic Aperture Imaging with Events and Frames Wei Liao, Xiang Zhang, Lei Yu, Shijie Lin, Wen Yang, Ning Qiao
PDF
Synthetic Generation of Face Videos with Plethysmograph Physiology Zhen Wang, Yunhao Ba, Pradyumna Chari, Oyku Deniz Bozkurt, Gianna Brown, Parth Patwa, Niranjan Vaddi, Laleh Jalilian, Achuta Kadambi
PDF
TableFormer: Table Structure Understanding with Transformers Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar
PDF
Talking Face Generation with Multilingual TTS Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim
PDF
Target-Aware Dual Adversarial Learning and a Multi-Scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection Jinyuan Liu, Xin Fan, Zhanbo Huang, Guanyao Wu, Risheng Liu, Wei Zhong, Zhongxuan Luo
PDF
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang
PDF
Targeted Supervised Contrastive Learning for Long-Tailed Recognition Tianhong Li, Peng Cao, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogerio S. Feris, Piotr Indyk, Dina Katabi
PDF
Task Adaptive Parameter Sharing for Multi-Task Learning Matthew Wallingford, Hao Li, Alessandro Achille, Avinash Ravichandran, Charless Fowlkes, Rahul Bhotika, Stefano Soatto
PDF
Task Decoupled Framework for Reference-Based Super-Resolution Yixuan Huang, Xiaoyun Zhang, Yu Fu, Siheng Chen, Ya Zhang, Yan-Feng Wang, Dazhi He
PDF
Task Discrepancy Maximization for Fine-Grained Few-Shot Classification SuBeen Lee, WonJun Moon, Jae-Pil Heo
PDF
Task-Adaptive Negative Envision for Few-Shot Open-Set Recognition Shiyuan Huang, Jiawei Ma, Guangxing Han, Shih-Fu Chang
PDF
Task-Specific Inconsistency Alignment for Domain Adaptive Object Detection Liang Zhao, Limin Wang
PDF
Task2Sim: Towards Effective Pre-Training and Transfer from Synthetic Data Samarth Mishra, Rameswar Panda, Cheng Perng Phoo, Chun-Fu Chen, Leonid Karlinsky, Kate Saenko, Venkatesh Saligrama, Rogerio S. Feris
PDF
TCTrack: Temporal Contexts for Aerial Tracking Ziang Cao, Ziyuan Huang, Liang Pan, Shiwei Zhang, Ziwei Liu, Changhong Fu
PDF
TeachAugment: Data Augmentation Optimization Using Teacher Knowledge Teppei Suzuki
PDF
Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions Van Nguyen Nguyen, Yinlin Hu, Yang Xiao, Mathieu Salzmann, Vincent Lepetit
PDF
Temporal Alignment Networks for Long-Term Video Tengda Han, Weidi Xie, Andrew Zisserman
PDF
Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification Wei Wu, Jiawei Liu, Kecheng Zheng, Qibin Sun, Zheng-Jun Zha
PDF
Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations Aishik Konwer, Xuan Xu, Joseph Bae, Chao Chen, Prateek Prasanna
PDF
Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation Zhenguang Liu, Runyang Feng, Haoming Chen, Shuang Wu, Yixing Gao, Yunjun Gao, Xiang Wang
PDF
Temporally Efficient Vision Transformer for Video Instance Segmentation Shusheng Yang, Xinggang Wang, Yu Li, Yuxin Fang, Jiemin Fang, Wenyu Liu, Xun Zhao, Ying Shan
PDF
TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates You Xie, Huiqi Mao, Angela Yao, Nils Thuerey
PDF
Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation Zhaoyang Zeng, Yongsheng Luo, Zhenhua Liu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen
PDF
Text Spotting Transformers Xiang Zhang, Yongwen Su, Subarna Tripathi, Zhuowen Tu
PDF
Text to Image Generation with Semantic-Spatial Aware GAN Wentong Liao, Kai Hu, Michael Ying Yang, Bodo Rosenhahn
PDF
Text-to-Image Synthesis Based on Object-Guided Joint-Decoding Transformer Fuxiang Wu, Liu Liu, Fusheng Hao, Fengxiang He, Jun Cheng
PDF
Text2Mesh: Text-Driven Neural Stylization for Meshes Oscar Michel, Roi Bar-On, Richard Liu, Sagie Benaim, Rana Hanocka
PDF
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization Manuel Kolmet, Qunjie Zhou, Aljoša Ošep, Laura Leal-Taixé
PDF
Texture-Based Error Analysis for Image Super-Resolution Salma Abdel Magid, Zudi Lin, Donglai Wei, Yulun Zhang, Jinjin Gu, Hanspeter Pfister
PDF
The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift Sara Beery, Guanhang Wu, Trevor Edwards, Filip Pavetic, Bo Majewski, Shreyasee Mukherjee, Stanley Chan, John Morgan, Vivek Rathod, Jonathan Huang
PDF
The DEVIL Is in the Details: A Diagnostic Evaluation Benchmark for Video Inpainting Ryan Szeto, Jason J. Corso
PDF
The Devil Is in the Details: Window-Based Attention for Image Compression Renjie Zou, Chunfeng Song, Zhaoxiang Zhang
PDF
The Devil Is in the Labels: Noisy Label Correction for Robust Scene Graph Generation Lin Li, Long Chen, Yifeng Huang, Zhimeng Zhang, Songyang Zhang, Jun Xiao
PDF
The Devil Is in the Margin: Margin-Based Label Smoothing for Network Calibration Bingyuan Liu, Ismail Ben Ayed, Adrian Galdran, Jose Dolz
PDF
The Devil Is in the Pose: Ambiguity-Free 3D Rotation-Invariant Learning via Pose-Aware Convolution Ronghan Chen, Yang Cong
PDF
The Flag Median and FlagIRLS Nathan Mankovich, Emily J. King, Chris Peterson, Michael Kirby
PDF
The Implicit Values of a Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement Ilya Chugunov, Yuxuan Zhang, Zhihao Xia, Xuaner Zhang, Jiawen Chen, Felix Heide
PDF
The Majority Can Help the Minority: Context-Rich Minority Oversampling for Long-Tailed Classification Seulki Park, Youngkyu Hong, Byeongho Heo, Sangdoo Yun, Jin Young Choi
PDF
The Neurally-Guided Shape Parser: Grammar-Based Labeling of 3D Shape Regions with Approximate Inference R. Kenny Jones, Aalia Habib, Rana Hanocka, Daniel Ritchie
PDF
The Norm Must Go on: Dynamic Unsupervised Domain Adaptation by Normalization M. Jehanzeb Mirza, Jakub Micorek, Horst Possegger, Horst Bischof
PDF
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy Tianlong Chen, Zhenyu Zhang, Yu Cheng, Ahmed Awadallah, Zhangyang Wang
PDF
The Probabilistic Normal Epipolar Constraint for Frame-to-Frame Rotation Optimization Under Uncertain Feature Positions Dominik Muhle, Lukas Koestler, Nikolaus Demmel, Florian Bernard, Daniel Cremers
PDF
The Two Dimensions of Worst-Case Training and Their Integrated Effect for Out-of-Domain Generalization Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing
PDF
The Wanderings of Odysseus in 3D Scenes Yan Zhang, Siyu Tang
PDF
Thin-Plate Spline Motion Model for Image Animation Jian Zhao, Hui Zhang
PDF
Think Global, Act Local: Dual-Scale Graph Transformer for Vision-and-Language Navigation Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
PDF
Think Twice Before Detecting GAN-Generated Fake Images from Their Spectral Domain Imprints Chengdong Dong, Ajay Kumar, Eryun Liu
PDF
Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds Minhyun Lee, Dongseob Kim, Hyunjung Shim
PDF
Time Lens++: Event-Based Frame Interpolation with Parametric Non-Linear Flow and Multi-Scale Fusion Stepan Tulyakov, Alfredo Bochicchio, Daniel Gehrig, Stamatios Georgoulis, Yuanyou Li, Davide Scaramuzza
PDF
Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving Peixuan Li, Jieyu Jin
PDF
TimeReplayer: Unlocking the Potential of Event Cameras for Video Interpolation Weihua He, Kaichao You, Zhendong Qiao, Xu Jia, Ziyang Zhang, Wenhui Wang, Huchuan Lu, Yaoyuan Wang, Jianxing Liao
PDF
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization Adjoint with Moving Speed Shian Du, Yihong Luo, Wei Chen, Jian Xu, Delu Zeng
PDF
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation Wenqiang Zhang, Zilong Huang, Guozhong Luo, Tao Chen, Xinggang Wang, Wenyu Liu, Gang Yu, Chunhua Shen
PDF
Topologically-Aware Deformation Fields for Single-View 3D Reconstruction Shivam Duggal, Deepak Pathak
PDF
Topology Preserving Local Road Network Estimation from Single Onboard Camera Image Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc Van Gool
PDF
Topology-Preserving Shape Reconstruction and Registration via Neural Diffeomorphic Flow Shanlin Sun, Kun Han, Deying Kong, Hao Tang, Xiangyi Yan, Xiaohui Xie
PDF
Total Variation Optimization Layers for Computer Vision Raymond A. Yeh, Yuan-Ting Hu, Zhongzheng Ren, Alexander G. Schwing
PDF
Toward Fast, Flexible, and Robust Low-Light Image Enhancement Long Ma, Tengyu Ma, Risheng Liu, Xin Fan, Zhongxuan Luo
PDF
Toward Practical Monocular Indoor Depth Estimation Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su
PDF
Towards Accurate Facial Landmark Detection via Cascaded Transformers Hui Li, Zidong Guo, Seon-Min Rhee, Seungju Han, Jae-Joon Han
PDF
Towards an End-to-End Framework for Flow-Guided Video Inpainting Zhen Li, Cheng-Ze Lu, Jianhua Qin, Chun-Le Guo, Ming-Ming Cheng
PDF
Towards Better Plasticity-Stability Trade-Off in Incremental Learning: A Simple Linear Connector Guoliang Lin, Hanlu Chu, Hanjiang Lai
PDF
Towards Better Understanding Attribution Methods Sukrut Rao, Moritz Böhle, Bernt Schiele
PDF
Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding
PDF
Towards Data-Free Model Stealing in a Hard Label Setting Sunandini Sanyal, Sravanti Addepalli, R. Venkatesh Babu
PDF
Towards Discovering the Effectiveness of Moderately Confident Samples for Semi-Supervised Learning Hui Tang, Kui Jia
PDF
Towards Discriminative Representation: Multi-View Trajectory Contrastive Learning for Online Multi-Object Tracking En Yu, Zhuoling Li, Shoudong Han
PDF
Towards Diverse and Natural Scene-Aware 3D Human Motion Synthesis Jingbo Wang, Yu Rong, Jingyuan Liu, Sijie Yan, Dahua Lin, Bo Dai
PDF
Towards Driving-Oriented Metric for Lane Detection Models Takami Sato, Qi Alfred Chen
PDF
Towards Efficient and Scalable Sharpness-Aware Minimization Yong Liu, Siqi Mai, Xiangning Chen, Cho-Jui Hsieh, Yang You
PDF
Towards Efficient Data Free Black-Box Adversarial Attack Jie Zhang, Bo Li, Jianghe Xu, Shuang Wu, Shouhong Ding, Lei Zhang, Chao Wu
PDF
Towards End-to-End Unified Scene Text Detection and Layout Analysis Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis
PDF
Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic Segmentation Binhui Xie, Longhui Yuan, Shuang Li, Chi Harold Liu, Xinjing Cheng
PDF
Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem
PDF
Towards Implicit Text-Guided 3D Shape Generation Zhengzhe Liu, Yi Wang, Xiaojuan Qi, Chi-Wing Fu
PDF
Towards Language-Free Training for Text-to-Image Generation Yufan Zhou, Ruiyi Zhang, Changyou Chen, Chunyuan Li, Chris Tensmeyer, Tong Yu, Jiuxiang Gu, Jinhui Xu, Tong Sun
PDF
Towards Layer-Wise Image Vectorization Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey Shi
PDF
Towards Low-Cost and Efficient Malaria Detection Waqas Sultani, Wajahat Nawaz, Syed Javed, Muhammad Sohail Danish, Asma Saadia, Mohsen Ali
PDF
Towards Multi-Domain Single Image Dehazing via Test-Time Training Huan Liu, Zijun Wu, Liangyan Li, Sadaf Salehkalaibar, Jun Chen, Keyan Wang
PDF
Towards Multimodal Depth Estimation from Light Fields Titus Leistner, Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother
PDF
Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation Jing Li, Junsong Fan, Zhaoxiang Zhang
PDF
Towards Practical Certifiable Patch Defense with Vision Transformer Zhaoyu Chen, Bo Li, Jianghe Xu, Shuang Wu, Shouhong Ding, Wenqiang Zhang
PDF
Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks Xiangyu Qi, Tinghao Xie, Ruizhe Pan, Jifeng Zhu, Yong Yang, Kai Bu
PDF
Towards Principled Disentanglement for Domain Generalization Hanlin Zhang, Yi-Fan Zhang, Weiyang Liu, Adrian Weller, Bernhard Schölkopf, Eric P. Xing
PDF
Towards Real-World Navigation with Deep Differentiable Planners Shu Ishida, João F. Henriques
PDF
Towards Robust Adaptive Object Detection Under Noisy Annotations Xinyu Liu, Wuyang Li, Qiushi Yang, Baopu Li, Yixuan Yuan
PDF
Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective Yuejiang Liu, Riccardo Cadei, Jonas Schweizer, Sherwin Bahmani, Alexandre Alahi
PDF
Towards Robust and Reproducible Active Learning Using Neural Networks Prateek Munjal, Nasir Hayat, Munawar Hayat, Jamshid Sourati, Shadab Khan
PDF
Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond Yi Yu, Wenhan Yang, Yap-Peng Tan, Alex C. Kot
PDF
Towards Robust Vision Transformer Xiaofeng Mao, Gege Qi, Yuefeng Chen, Xiaodan Li, Ranjie Duan, Shaokai Ye, Yuan He, Hui Xue
PDF
Towards Semi-Supervised Deep Facial Expression Recognition with an Adaptive Confidence Margin Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao
PDF
Towards Total Recall in Industrial Anomaly Detection Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler
PDF
Towards Understanding Adversarial Robustness of Optical Flow Networks Simon Schrodi, Tonmoy Saikia, Thomas Brox
PDF
Towards Unsupervised Domain Generalization Xingxuan Zhang, Linjun Zhou, Renzhe Xu, Peng Cui, Zheyan Shen, Haoxin Liu
PDF
Towards Weakly-Supervised Text Spotting Using a Multi-Task Transformer Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona
PDF
TrackFormer: Multi-Object Tracking with Transformers Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixé, Christoph Feichtenhofer
PDF
Tracking People by Predicting 3D Appearance, Location and Pose Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik
PDF
Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation Qingyan Meng, Mingqing Xiao, Shen Yan, Yisen Wang, Zhouchen Lin, Zhi-Quan Luo
PDF
Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer Weixiang Hong, Jiangwei Lao, Wang Ren, Jian Wang, Jingdong Chen, Wei Chu
PDF
Training Quantised Neural Networks with STE Variants: The Additive Noise Annealing Algorithm Matteo Spallanzani, Gian Paolo Leonardi, Luca Benini
PDF
Training-Free Transformer Architecture Search Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji
PDF
Trajectory Optimization for Physics-Based Reconstruction of 3D Human Pose from Monocular Video Erik Gärtner, Mykhaylo Andriluka, Hongyi Xu, Cristian Sminchisescu
PDF
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing Yanbo Xu, Yueqin Yin, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu
PDF
Transferability Estimation Using Bhattacharyya Class Separability Michal Pándy, Andrea Agostinelli, Jasper Uijlings, Vittorio Ferrari, Thomas Mensink
PDF
Transferability Metrics for Selecting Source Model Ensembles Andrea Agostinelli, Jasper Uijlings, Thomas Mensink, Vittorio Ferrari
PDF
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan
PDF
TransforMatcher: Match-to-Match Attention for Semantic Correspondence Seungwook Kim, Juhong Min, Minsu Cho
PDF
Transformer Based Line Segment Classifier with Image Context for Real-Time Vanishing Point Detection in Manhattan World Xin Tong, Xianghua Ying, Yongjie Shi, Ruibin Wang, Jinfa Yang
PDF
Transformer Tracking with Cyclic Shifting Window Attention Zikai Song, Junqing Yu, Yi-Ping Phoebe Chen, Wei Yang
PDF
Transformer-Empowered Multi-Scale Contextual Matching and Aggregation for Multi-Contrast MRI Super-Resolution Guangyuan Li, Jun Lv, Yapeng Tian, Qi Dou, Chengyan Wang, Chenliang Xu, Jing Qin
PDF
Transforming Model Prediction for Tracking Christoph Mayer, Martin Danelljan, Goutam Bhat, Matthieu Paul, Danda Pani Paudel, Fisher Yu, Luc Van Gool
PDF
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers Xuyang Bai, Zeyu Hu, Xinge Zhu, Qingqiu Huang, Yilun Chen, Hongbo Fu, Chiew-Lan Tai
PDF
TransGeo: Transformer Is All You Need for Cross-View Image Geo-Localization Sijie Zhu, Mubarak Shah, Chen Chen
PDF
TransMix: Attend to Mix for Vision Transformers Jie-Neng Chen, Shuyang Sun, Ju He, Philip H.S. Torr, Alan Yuille, Song Bai
PDF
TransMVSNet: Global Context-Aware Multi-View Stereo Network with Transformers Yikang Ding, Wentao Yuan, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
PDF
TransRAC: Encoding Multi-Scale Temporal Correlation with Transformers for Repetitive Action Counting Huazhang Hu, Sixun Dong, Yiqun Zhao, Dongze Lian, Zhengxin Li, Shenghua Gao
PDF
TransRank: Self-Supervised Video Representation Learning via Ranking-Based Transformation Recognition Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin
PDF
TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation Ruotong Wang, Yanqing Shen, Weiliang Zuo, Sanping Zhou, Nanning Zheng
PDF
TransWeather: Transformer-Based Restoration of Images Degraded by Adverse Weather Conditions Jeya Maria Jose Valanarasu, Rajeev Yasarla, Vishal M. Patel
PDF
Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation Zhiyuan Liang, Tiancai Wang, Xiangyu Zhang, Jian Sun, Jianbing Shen
PDF
Trustworthy Long-Tailed Classification Bolian Li, Zongbo Han, Haining Li, Huazhu Fu, Changqing Zhang
PDF
TubeDETR: Spatio-Temporal Video Grounding with Transformers Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid
PDF
TubeFormer-DeepLab: Video Mask Transformer Dahun Kim, Jun Xie, Huiyu Wang, Siyuan Qiao, Qihang Yu, Hong-Seok Kim, Hartwig Adam, In So Kweon, Liang-Chieh Chen
PDF
TubeR: Tubelet Transformer for Video Action Detection Jiaojiao Zhao, Yanyi Zhang, Xinyu Li, Hao Chen, Bing Shuai, Mingze Xu, Chunhui Liu, Kaustav Kundu, Yuanjun Xiong, Davide Modolo, Ivan Marsic, Cees G. M. Snoek, Joseph Tighe
PDF
TVConv: Efficient Translation Variant Convolution for Layout-Aware Visual Processing Jierun Chen, Tianlang He, Weipeng Zhuo, Li Ma, Sangtae Ha, S.-H. Gary Chan
PDF
TWIST: Two-Way Inter-Label Self-Training for Semi-Supervised 3D Instance Segmentation Ruihang Chu, Xiaoqing Ye, Zhengzhe Liu, Xiao Tan, Xiaojuan Qi, Chi-Wing Fu, Jiaya Jia
PDF
Two Coupled Rejection Metrics Can Tell Adversarial Examples Apart Tianyu Pang, Huishuai Zhang, Di He, Yinpeng Dong, Hang Su, Wei Chen, Jun Zhu, Tie-Yan Liu
PDF
UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah
PDF
UBoCo: Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection Hyolim Kang, Jinwoo Kim, Taehyun Kim, Seon Joo Kim
PDF
UCC: Uncertainty Guided Cross-Head Co-Training for Semi-Supervised Semantic Segmentation Jiashuo Fan, Bin Gao, Huan Jin, Lihui Jiang
PDF
UDA-COPE: Unsupervised Domain Adaptation for Category-Level Object Pose Estimation Taeyeop Lee, Byeong-Uk Lee, Inkyu Shin, Jaesung Choe, Ukcheol Shin, In So Kweon, Kuk-Jin Yoon
PDF
Uformer: A General U-Shaped Transformer for Image Restoration Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, Houqiang Li
PDF
UKPGAN: A General Self-Supervised Keypoint Detector Yang You, Wenhai Liu, Yanjie Ze, Yong-Lu Li, Weiming Wang, Cewu Lu
PDF
UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection Ye Liu, Siyuan Li, Yang Wu, Chang-Wen Chen, Ying Shan, Xiaohu Qie
PDF
Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu
PDF
Unbiased Teacher V2: Semi-Supervised Object Detection for Anchor-Free and Anchor-Based Detectors Yen-Cheng Liu, Chih-Yao Ma, Zsolt Kira
PDF
Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation Jogendra Nath Kundu, Siddharth Seth, Pradyumna Ym, Varun Jampani, Anirban Chakraborty, R. Venkatesh Babu
PDF
Uncertainty-Aware Deep Multi-View Photometric Stereo Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc Van Gool
PDF
Uncertainty-Guided Probabilistic Transformer for Complex Action Recognition Hongji Guo, Hanjing Wang, Qiang Ji
PDF
Understanding 3D Object Articulation in Internet Videos Shengyi Qian, Linyi Jin, Chris Rockwell, Siyi Chen, David F. Fouhey
PDF
Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training Theodoros Tsiligkaridis, Jay Roberts
PDF
Understanding Uncertainty Maps in Vision with Statistical Testing Jurijs Nazarovs, Zhichun Huang, Songwong Tasneeyapant, Rudrasis Chakraborty, Vikas Singh
PDF
Undoing the Damage of Label Shift for Cross-Domain Semantic Segmentation Yahao Liu, Jinhong Deng, Jiale Tao, Tong Chu, Lixin Duan, Wen Li
PDF
Uni-Perceiver: Pre-Training Unified Architecture for Generic Perception for Zero-Shot and Few-Shot Tasks Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Hongsheng Li, Xiaohua Wang, Jifeng Dai
PDF
Uni6D: A Unified CNN Framework Without Projection Breakdown for 6d Pose Estimation Xiaoke Jiang, Donghai Li, Hao Chen, Ye Zheng, Rui Zhao, Liwei Wu
PDF
UniCon: Combating Label Noise Through Uniform Selection and Contrastive Learning Nazmul Karim, Mamshad Nayeem Rizve, Nazanin Rahnavard, Ajmal Mian, Mubarak Shah
PDF
UniCoRN: A Unified Conditional Image Repainting Network Jimeng Sun, Shuchen Weng, Zheng Chang, Si Li, Boxin Shi
PDF
Unified Contrastive Learning in Image-Text-Label Space Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao
PDF
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression Xiaosu Zhu, Jingkuan Song, Lianli Gao, Feng Zheng, Heng Tao Shen
PDF
Unified Transformer Tracker for Object Tracking Fan Ma, Mike Zheng Shou, Linchao Zhu, Haoqi Fan, Yilei Xu, Yi Yang, Zhicheng Yan
PDF
Uniform Subdivision of Omnidirectional Camera Space for Efficient Spherical Stereo Matching Donghun Kang, Hyeonjoong Jang, Jungeon Lee, Chong-Min Kyung, Min H. Kim
PDF
Unifying Motion Deblurring and Frame Interpolation with Events Xiang Zhang, Lei Yu
PDF
Unifying Panoptic Segmentation for Autonomous Driving Oliver Zendel, Matthias Schörghuber, Bernhard Rainer, Markus Murschitz, Csaba Beleznai
PDF
Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression Qiang Li, Jingjing Wang, Zhaoliang Yao, Yachun Li, Pengju Yang, Jingwei Yan, Chunmao Wang, Shiliang Pu
PDF
UNIST: Unpaired Neural Implicit Shape Translation Network Qimin Chen, Johannes Merz, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, Hao Zhang
PDF
Universal Photometric Stereo Network Using Global Lighting Contexts Satoshi Ikehata
PDF
UniVIP: A Unified Framework for Self-Supervised Visual Pre-Training Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang
PDF
Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild Xuefeng Du, Xin Wang, Gabriel Gozum, Yixuan Li
PDF
Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao
PDF
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping Yifang Men, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie, Xian-Sheng Hua
PDF
Unpaired Deep Image Deraining Using Dual Contrastive Learning Xiang Chen, Jinshan Pan, Kui Jiang, Yufeng Li, Yufeng Huang, Caihua Kong, Longgang Dai, Zhentao Fan
PDF
Unseen Classes at a Later Time? No Problem Hari Chandana Kuchibhotla, Sumitra S Malagi, Shivam Chandhok, Vineeth N Balasubramanian
PDF
Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering Sateesh Kumar, Sanjay Haresh, Awais Ahmed, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran
PDF
Unsupervised Deraining: Where Contrastive Learning Meets Self-Similarity Yuntong Ye, Changfeng Yu, Yi Chang, Lin Zhu, Xi-Le Zhao, Luxin Yan, Yonghong Tian
PDF
Unsupervised Domain Adaptation for Nighttime Aerial Tracking Junjie Ye, Changhong Fu, Guangze Zheng, Danda Pani Paudel, Guang Chen
PDF
Unsupervised Domain Generalization by Learning a Bridge Across Domains Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne, Dina Katabi, Kate Saenko, Rogerio S. Feris, Leonid Karlinsky
PDF
Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers Tsung-Wei Ke, Jyh-Jing Hwang, Yunhui Guo, Xudong Wang, Stella X. Yu
PDF
Unsupervised Homography Estimation with Coplanarity-Aware GAN Mingbo Hong, Yuhang Lu, Nianjin Ye, Chunyu Lin, Qijun Zhao, Shuaicheng Liu
PDF
Unsupervised Image-to-Image Translation with Generative Prior Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy
PDF
Unsupervised Learning of Accurate Siamese Tracking Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao Feng, Weihao Gan, Wei Wu, Wanli Ouyang
PDF
Unsupervised Learning of Debiased Representations with Pseudo-Attributes Seonguk Seo, Joon-Young Lee, Bohyung Han
PDF
Unsupervised Pre-Training for Temporal Action Localization Tasks Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang, Yuexian Zou
PDF
Unsupervised Representation Learning for Binary Networks by Joint Classifier Learning Dahyun Kim, Jonghyun Choi
PDF
Unsupervised Vision-and-Language Pre-Training via Retrieval-Based Multi-Granular Alignment Mingyang Zhou, Licheng Yu, Amanpreet Singh, Mengjiao Wang, Zhou Yu, Ning Zhang
PDF
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships Chao Lou, Wenjuan Han, Yuhuan Lin, Zilong Zheng
PDF
Unsupervised Visual Representation Learning by Online Constrained K-Means Qi Qian, Yuanhong Xu, Juhua Hu, Hao Li, Rong Jin
PDF
UnweaveNet: Unweaving Activity Stories Will Price, Carl Vondrick, Dima Damen
PDF
Upright-Net: Learning Upright Orientation for 3D Point Cloud Xufang Pang, Feng Li, Ning Ding, Xiaopin Zhong
PDF
Urban Radiance Fields Konstantinos Rematas, Andrew Liu, Pratul P. Srinivasan, Jonathan T. Barron, Andrea Tagliasacchi, Thomas Funkhouser, Vittorio Ferrari
PDF
URetinex-Net: Retinex-Based Deep Unfolding Network for Low-Light Image Enhancement Wenhui Wu, Jian Weng, Pingping Zhang, Xu Wang, Wenhan Yang, Jianmin Jiang
PDF
Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework Shu Zhang, Ran Xu, Caiming Xiong, Chetan Ramaiah
PDF
Using 3D Topological Connectivity for Ghost Particle Reduction in Flow Reconstruction Christina Tsalicoglou, Thomas Rösgen
PDF
UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog Cheng Chen, Zhenshan Tan, Qingrong Cheng, Xin Jiang, Qun Liu, Yudong Zhu, Xiaodong Gu
PDF
V-Doc: Visual Questions Answers with Documents Yihao Ding, Zhe Huang, Runlin Wang, YanHang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
PDF
V2C: Visual Voice Cloning Qi Chen, Mingkui Tan, Yuankai Qi, Jiaqiu Zhou, Yuanqing Li, Qi Wu
PDF
VALHALLA: Visual Hallucination for Machine Translation Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Chen, Rogerio S. Feris, David Cox, Nuno Vasconcelos
PDF
vCLIMB: A Novel Video Class Incremental Learning Benchmark Andrés Villa, Kumail Alhamoud, Victor Escorcia, Fabian Caba, Juan León Alcázar, Bernard Ghanem
PDF
Vector Quantized Diffusion Model for Text-to-Image Synthesis Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo
PDF
Vehicle Trajectory Prediction Works, but Not Everywhere Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi, Mohammad Shaverdikondori, Amir Hossein Shahidzadeh, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi
PDF
Versatile Multi-Modal Pre-Training for Human-Centric Perception Fangzhou Hong, Liang Pan, Zhongang Cai, Ziwei Liu
PDF
VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning Wenjia Xu, Yongqin Xian, Jiuniu Wang, Bernt Schiele, Zeynep Akata
PDF
Video Demoireing with Relation-Based Temporal Consistency Peng Dai, Xin Yu, Lan Ma, Baoheng Zhang, Jia Li, Wenbo Li, Jiajun Shen, Xiaojuan Qi
PDF
Video Frame Interpolation Transformer Zhihao Shi, Xiangyu Xu, Xiaohong Liu, Jun Chen, Ming-Hsuan Yang
PDF
Video Frame Interpolation with Transformer Liying Lu, Ruizheng Wu, Huaijia Lin, Jiangbo Lu, Jiaya Jia
PDF
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy
PDF
Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia Xiao
PDF
Video Swin Transformer Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu
PDF
Video-Text Representation Learning via Differentiable Weak Temporal Alignment Dohwan Ko, Joonmyung Choi, Juyeon Ko, Shinyeong Noh, Kyoung-Woon On, Eun-Sol Kim, Hyunwoo J. Kim
PDF
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong Wang
PDF
ViM: Out-of-Distribution with Virtual-Logit Matching Haoqi Wang, Zhizhong Li, Litong Feng, Wayne Zhang
PDF
Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma, Anqi Joyce Yang, Shenlong Wang, Raquel Urtasun, Antonio Torralba
PDF
Virtual Elastic Objects Hsiao-yu Chen, Edith Tretschk, Tuur Stuyck, Petr Kadlecek, Ladislav Kavan, Etienne Vouga, Christoph Lassner
PDF
VisCUIT: Visual Auditor for Bias in CNN Image Classifier Seongmin Lee, Zijie J. Wang, Judy Hoffman, Duen Horng Chau
PDF
Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiang Ruan
PDF
Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space Arnav Chavan, Zhiqiang Shen, Zhuang Liu, Zechun Liu, Kwang-Ting Cheng, Eric P. Xing
PDF
Vision Transformer with Deformable Attention Zhuofan Xia, Xuran Pan, Shiji Song, Li Erran Li, Gao Huang
PDF
Vision-Language Pre-Training for Boosting Scene Text Detectors Sibo Song, Jianqiang Wan, Zhibo Yang, Jun Tang, Wenqing Cheng, Xiang Bai, Cong Yao
PDF
Vision-Language Pre-Training with Triple Contrastive Learning Jinyu Yang, Jiali Duan, Son Tran, Yi Xu, Sampath Chanda, Liqun Chen, Belinda Zeng, Trishul Chilimbi, Junzhou Huang
PDF
VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation Su Ho Han, Sukjun Hwang, Seoung Wug Oh, Yeonchool Park, Hyunwoo Kim, Min-Jung Kim, Seon Joo Kim
PDF
VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention Shengheng Deng, Zhihao Liang, Lin Sun, Kui Jia
PDF
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
PDF
Visual Abductive Reasoning Chen Liang, Wenguan Wang, Tianfei Zhou, Yi Yang
PDF
Visual Acoustic Matching Changan Chen, Ruohan Gao, Paul Calamia, Kristen Grauman
PDF
Visual Vibration Tomography: Estimating Interior Material Properties from Monocular Video Berthy T. Feng, Alexander C. Ogren, Chiara Daraio, Katherine L. Bouman
PDF
VisualGPT: Data-Efficient Adaptation of Pretrained Language Models for Image Captioning Jun Chen, Han Guo, Kai Yi, Boyang Li, Mohamed Elhoseiny
PDF
VisualHow: Multimodal Problem Solving Jinhui Yang, Xianyu Chen, Ming Jiang, Shi Chen, Louis Wang, Qi Zhao
PDF
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks Yi-Lin Sung, Jaemin Cho, Mohit Bansal
PDF
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal
PDF
Volumetric Bundle Adjustment for Online Photorealistic Scene Capture Ronald Clark
PDF
Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces from 3D MRI Scans with Geometric Deep Neural Networks Fabian Bongratz, Anne-Marie Rickmann, Sebastian Pölsterl, Christian Wachinger
PDF
Voxel Field Fusion for 3D Object Detection Yanwei Li, Xiaojuan Qi, Yukang Chen, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia
PDF
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds Chenhang He, Ruihuang Li, Shuai Li, Lei Zhang
PDF
VRDFormer: End-to-End Video Visual Relation Detection with Transformers Sipeng Zheng, Shizhe Chen, Qin Jin
PDF
WALT: Watch and Learn 2D Amodal Representation from Time-Lapse Imagery N. Dinesh Reddy, Robert Tamburo, Srinivasa G. Narasimhan
PDF
WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation Yingzhi Tang, Yue Qian, Qijian Zhang, Yiming Zeng, Junhui Hou, Xuefei Zhe
PDF
Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects Atsuhiro Noguchi, Umar Iqbal, Jonathan Tremblay, Tatsuya Harada, Orazio Gallo
PDF
Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation Linfeng Zhang, Xin Chen, Xiaobing Tu, Pengfei Wan, Ning Xu, Kaisheng Ma
PDF
Weakly but Deeply Supervised Occlusion-Reasoned Parametric Road Layouts Buyu Liu, Bingbing Zhuang, Manmohan Chandraker
PDF
Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory Sangmin Lee, Hyung-Il Kim, Yong Man Ro
PDF
Weakly Supervised High-Fidelity Clothing Model Generation Ruili Feng, Cheng Ma, Chengji Shen, Xin Gao, Zhenjiang Liu, Xiaobo Li, Kairi Ou, Deli Zhao, Zheng-Jun Zha
PDF
Weakly Supervised Object Localization as Domain Adaption Lei Zhu, Qi She, Qian Chen, Yunfei You, Boyu Wang, Yanye Lu
PDF
Weakly Supervised Rotation-Invariant Aerial Object Detection Network Xiaoxu Feng, Xiwen Yao, Gong Cheng, Junwei Han
PDF
Weakly Supervised Segmentation on Outdoor 4D Point Clouds with Temporal Matching and Spatial Graph Propagation Hanyu Shi, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
PDF
Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast Ye Du, Zehua Fu, Qingjie Liu, Yunhong Wang
PDF
Weakly Supervised Semantic Segmentation Using Out-of-Distribution Data Jungbeom Lee, Seong Joon Oh, Sangdoo Yun, Junsuk Choe, Eunji Kim, Sungroh Yoon
PDF
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation Linjiang Huang, Liang Wang, Hongsheng Li
PDF
Weakly Supervised Temporal Sentence Grounding with Gaussian-Based Contrastive Proposal Learning Minghang Zheng, Yanjie Huang, Qingchao Chen, Yuxin Peng, Yang Liu
PDF
Weakly-Supervised Action Transition Learning for Stochastic Human Motion Prediction Wei Mao, Miaomiao Liu, Mathieu Salzmann
PDF
Weakly-Supervised Generation and Grounding of Visual Descriptions with Conditional Generative Models Effrosyni Mavroudi, René Vidal
PDF
Weakly-Supervised Metric Learning with Cross-Module Communications for the Classification of Anterior Chamber Angle Images Jingqi Huang, Yue Ning, Dong Nie, Linan Guan, Xiping Jia
PDF
Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos Reza Ghoddoosian, Isht Dwivedi, Nakul Agarwal, Chiho Choi, Behzad Dariush
PDF
WebQA: Multihop and Multimodal QA Yingshan Chang, Mridu Narang, Hisami Suzuki, Guihong Cao, Jianfeng Gao, Yonatan Bisk
PDF
What Do Navigation Agents Learn About Their Environment? Kshitij Dwivedi, Gemma Roig, Aniruddha Kembhavi, Roozbeh Mottaghi
PDF
What Makes Transfer Learning Work for Medical Images: Feature Reuse & Other Factors Christos Matsoukas, Johan Fredin Haslum, Moein Sorkhei, Magnus Söderberg, Kevin Smith
PDF
What Matters for Meta-Learning Vision Regression Tasks? Ning Gao, Hanna Ziesche, Ngo Anh Vien, Michael Volpp, Gerhard Neumann
PDF
What to Look at and Where: Semantic and Spatial Refined Transformer for Detecting Human-Object Interactions A S M Iftekhar, Hao Chen, Kaustav Kundu, Xinyu Li, Joseph Tighe, Davide Modolo
PDF
What's in Your Hands? 3D Reconstruction of Generic Objects in Hands Yufei Ye, Abhinav Gupta, Shubham Tulsiani
PDF
When Does Contrastive Visual Representation Learning Work? Elijah Cole, Xuan Yang, Kimberly Wilber, Oisin Mac Aodha, Serge Belongie
PDF
When to Prune? a Policy Towards Early Structural Pruning Maying Shen, Pavlo Molchanov, Hongxu Yin, Jose M. Alvarez
PDF
Which Images to Label for Few-Shot Medical Landmark Detection? Quan Quan, Qingsong Yao, Jun Li, S. Kevin Zhou
PDF
Which Model to Transfer? Finding the Needle in the Growing Haystack Cedric Renggli, André Susano Pinto, Luka Rimanic, Joan Puigcerver, Carlos Riquelme, Ce Zhang, Mario Lučić
PDF
Whose Hands Are These? Hand Detection and Hand-Body Association in the Wild Supreeth Narasimhaswamy, Thanh Nguyen, Mingzhen Huang, Minh Hoai
PDF
Whose Track Is It Anyway? Improving Robustness to Tracking Errors with Affinity-Based Trajectory Prediction Xinshuo Weng, Boris Ivanovic, Kris Kitani, Marco Pavone
PDF
Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis Jiajing Chen, Burak Kakillioglu, Huantao Ren, Senem Velipasalar
PDF
WildNet: Learning Domain Generalized Semantic Segmentation from the Wild Suhyeon Lee, Hongje Seong, Seongwon Lee, Euntai Kim
PDF
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross
PDF
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian
PDF
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval Satya Krishna Gorti, Noël Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu
PDF
X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning Zhihao Yuan, Xu Yan, Yinghong Liao, Yao Guo, Guanbin Li, Shuguang Cui, Zhen Li
PDF
XMP-Font: Self-Supervised Cross-Modality Pre-Training for Few-Shot Font Generation Wei Liu, Fangyue Liu, Fei Ding, Qian He, Zili Yi
PDF
XYDeblur: Divide and Conquer for Single Image Deblurring Seo-Won Ji, Jeongmin Lee, Seung-Wook Kim, Jun-Pyo Hong, Seung-Jin Baek, Seung-Won Jung, Sung-Jea Ko
PDF
XYLayoutLM: Towards Layout-Aware Multimodal Networks for Visually-Rich Document Understanding Zhangxuan Gu, Changhua Meng, Ke Wang, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang
PDF
YouMVOS: An Actor-Centric Multi-Shot Video Object Segmentation Dataset Donglai Wei, Siddhant Kharbanda, Sarthak Arora, Roshan Roy, Nishant Jain, Akash Palrecha, Tanav Shah, Shray Mathur, Ritik Mathur, Abhijay Kemkar, Anirudh Chakravarthy, Zudi Lin, Won-Dong Jang, Yansong Tang, Song Bai, James Tompkin, Philip H.S. Torr, Hanspeter Pfister
PDF
ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation Yongzhi Su, Mahdi Saleh, Torben Fetzer, Jason Rambach, Nassir Navab, Benjamin Busam, Didier Stricker, Federico Tombari
PDF
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation Ziad Al-Halah, Santhosh Kumar Ramakrishnan, Kristen Grauman
PDF
Zero-Query Transfer Attacks on Context-Aware Object Detectors Zikui Cai, Shantanu Rane, Alejandro E. Brito, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury, M. Salman Asif
PDF
Zero-Shot Text-Guided Object Generation with Dream Fields Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole
PDF
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic Yoad Tewel, Yoav Shalev, Idan Schwartz, Lior Wolf
PDF
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes Dina Bashkirova, Mohamed Abdelfattah, Ziliang Zhu, James Akl, Fadi Alladkani, Ping Hu, Vitaly Ablavsky, Berk Calli, Sarah Adel Bargal, Kate Saenko
PDF
Zoom in and Out: A Mixed-Scale Triplet Network for Camouflaged Object Detection Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, Huchuan Lu
PDF
ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds Georg Bökman, Fredrik Kahl, Axel Flinth
PDF