ICCV 2021
1612 papers
3D Building Reconstruction from Monocular Remote Sensing Images
Weijia Li, Lingxuan Meng, Jinwang Wang, Conghui He, Gui-Song Xia, Dahua Lin 3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng, Sijie Zhu, Matias Mendieta, Taojiannan Yang, Chen Chen, Zhengming Ding 3D Local Convolutional Neural Networks for Gait Recognition
Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua 3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
Huan Fu, Bowen Cai, Lin Gao, Ling-Xiao Zhang, Jiaming Wang, Cao Li, Qixun Zeng, Chengyue Sun, Rongfei Jia, Binqiang Zhao, Hao Zhang 3DeepCT: Learning Volumetric Scattering Tomography of Clouds
Yael Sde-Chen, Yoav Y. Schechner, Vadim Holodovsky, Eshkol Eytan 3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces
Mohsen Yavartanoo, Jaeyoung Chung, Reyhaneh Neshatavar, Kyoung Mu Lee 4D Cloud Scattering Tomography
Roi Ronen, Yoav Y. Schechner, Eshkol Eytan 4D-Net for Learned Multi-Modal Alignment
Aj Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova 4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface
Yang Li, Hikari Takehara, Takafumi Taketomi, Bo Zheng, Matthias Nießner A Backdoor Attack Against 3D Point Cloud Classifiers
Zhen Xiang, David J. Miller, Siheng Chen, Xi Li, George Kesidis A Broad Study on the Transferability of Visual Representations with Contrastive Learning
Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Richard Radke, Rogerio Feris A Dark Flash Normal Camera
Zhihao Xia, Jason Lawrence, Supreeth Achar A General Recurrent Tracking Framework Without Real Data
Shuai Wang, Hao Sheng, Yang Zhang, Yubin Wu, Zhang Xiong A Light Stage on Every Desk
Soumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz A Multi-Mode Modulator for Multi-Domain Few-Shot Classification
Yanbin Liu, Juho Lee, Linchao Zhu, Ling Chen, Humphrey Shi, Yi Yang A New Journey from SDRTV to HDRTV
Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong A Robust Loss for Point Cloud Registration
Zhi Deng, Yuxin Yao, Bailin Deng, Juyong Zhang A Simple Feature Augmentation for Domain Generalization
Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales A Style and Semantic Memory Mechanism for Domain Generalization
Yang Chen, Yu Wang, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder
Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, Xiaohui Shen, Ding Liu, Nadia Magnenat Thalmann A Unified Objective for Novel Class Discovery
Enrico Fini, Enver Sangineto, Stéphane Lathuilière, Zhun Zhong, Moin Nabi, Elisa Ricci Achieving On-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang Active Learning for Deep Object Detection via Probabilistic Modeling
Jiwoong Choi, Ismail Elezi, Hyuk-Jae Lee, Clement Farabet, Jose M. Alvarez Active Universal Domain Adaptation
Xinhong Ma, Junyu Gao, Changsheng Xu AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis
Yudong Guo, Keyu Chen, Sen Liang, Yong-Jin Liu, Hujun Bao, Juyong Zhang AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer
Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding AdaFit: Rethinking Learning-Based Normal Estimation on Point Clouds
Runsong Zhu, Yuan Liu, Zhen Dong, Yuan Wang, Tengping Jiang, Wenping Wang, Bisheng Yang AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Rameswar Panda, Chun-Fu Chen, Quanfu Fan, Ximeng Sun, Kate Saenko, Aude Oliva, Rogerio Feris Adaptive Confidence Thresholding for Monocular Depth Estimation
Hyesong Choi, Hunsang Lee, Sunkyung Kim, Sunok Kim, Seungryong Kim, Kwanghoon Sohn, Dongbo Min Adaptive Curriculum Learning
Yajing Kong, Liu Liu, Jun Wang, Dacheng Tao Adaptive Focus for Efficient Video Recognition
Yulin Wang, Zhaoxi Chen, Haojun Jiang, Shiji Song, Yizeng Han, Gao Huang Adaptive Graph Convolution for Point Cloud Analysis
Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference
Juncheng Li, Siliang Tang, Linchao Zhu, Haochen Shi, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang Adaptive Label Noise Cleaning with Meta-Supervision for Deep Face Recognition
Yaobin Zhang, Weihong Deng, Yaoyao Zhong, Jiani Hu, Xian Li, Dongyue Zhao, Dongchao Wen Adaptive Surface Normal Constraint for Depth Estimation
Xiaoxiao Long, Cheng Lin, Lingjie Liu, Wei Li, Christian Theobalt, Ruigang Yang, Wenping Wang AdvDrop: Adversarial Attack to DNNs by Dropping Information
Ranjie Duan, Yuefeng Chen, Dantong Niu, Yun Yang, A. K. Qin, Yuan He Adversarial Attack on Deep Cross-Modal Hamming Retrieval
Chao Li, Shangqian Gao, Cheng Deng, Wei Liu, Heng Huang Adversarial Attacks Are Reversible with Natural Supervision
Chengzhi Mao, Mia Chiquier, Hao Wang, Junfeng Yang, Carl Vondrick Adversarial Attacks on Multi-Agent Communication
James Tu, Tsunhsuan Wang, Jingkang Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun Adversarial Example Detection Using Latent Neighborhood Graph
Ahmed Abusnaina, Yuhang Wu, Sunpreet Arora, Yizhen Wang, Fei Wang, Hao Yang, David Mohaisen Adversarial Robustness for Unsupervised Domain Adaptation
Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, Ping Luo, Sung-Ho Bae, Zhenguo Li Adversarial Unsupervised Domain Adaptation with Conditional and Label Shift: Infer, Align and Iterate
Xiaofeng Liu, Zhenhua Guo, Site Li, Fangxu Xing, Jane You, C.-C. Jay Kuo, Georges El Fakhri, Jonghye Woo AESOP: Abstract Encoding of Stories, Objects, and Pictures
Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia Aggregation with Feature Detection
Shuyang Sun, Xiaoyu Yue, Xiaojuan Qi, Wanli Ouyang, Victor Adrian Prisacariu, Philip H.S. Torr Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models
Jie Li, Rongrong Ji, Peixian Chen, Baochang Zhang, Xiaopeng Hong, Ruixin Zhang, Shaoxin Li, Jilin Li, Feiyue Huang, Yongjian Wu Airbert: In-Domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid ALADIN: All Layer Adaptive Instance Normalization for Fine-Grained Style Similarity
Dan Ruta, Saeid Motiian, Baldo Faieta, Zhe Lin, Hailin Jin, Alex Filipkowski, Andrew Gilbert, John Collomosse Aligning Subtitles in Sign Language Videos
Hannah Bull, Triantafyllos Afouras, Gül Varol, Samuel Albanie, Liliane Momeni, Andrew Zisserman Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning
James Smith, Yen-Chang Hsu, Jonathan Balloch, Yilin Shen, Hongxia Jin, Zsolt Kira An Asynchronous Kalman Filter for Hybrid Event Cameras
Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony An Elastica Geodesic Approach with Convexity Shape Prior
Da Chen, Laurent D. Cohen, Jean-Marie Mirebeau, Xue-Cheng Tai Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies
Sida Peng, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Xiaowei Zhou, Hujun Bao Anonymizing Egocentric Videos
Daksh Thapar, Aditya Nigam, Chetan Arora Anticipative Video Transformer
Rohit Girdhar, Kristen Grauman ARCH++: Animation-Ready Clothed Human Reconstruction Revisited
Tong He, Yuanlu Xu, Shunsuke Saito, Stefano Soatto, Tony Tung Architecture Disentanglement for Deep Neural Networks
Jie Hu, Liujuan Cao, Tong Tong, Qixiang Ye, Shengchuan Zhang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji ASCNet: Self-Supervised Video Representation Learning with Appearance-Speed Consistency
Deng Huang, Wenhao Wu, Weiwen Hu, Xu Liu, Dongliang He, Zhihua Wu, Xiangmiao Wu, Mingkui Tan, Errui Ding Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query
Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun Asymmetric Loss for Multi-Label Classification
Tal Ridnik, Emanuel Ben-Baruch, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor Audio-Visual Floorplan Reconstruction
Senthil Purushwalkam, Sebastià Vicenc Amengual Garí, Vamsi Krishna Ithapu, Carl Schissler, Philip Robinson, Abhinav Gupta, Kristen Grauman Augmented Lagrangian Adversarial Attacks
Jérôme Rony, Eric Granger, Marco Pedersoli, Ismail Ben Ayed AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection
Zongdai Liu, Dingfu Zhou, Feixiang Lu, Jin Fang, Liangjun Zhang AutoSpace: Neural Architecture Search with Less Human Interference
Daquan Zhou, Xiaojie Jin, Xiaochen Lian, Linjie Yang, Yujing Xue, Qibin Hou, Jiashi Feng BabelCalib: A Universal Approach to Calibrating Central Cameras
Yaroslava Lochman, Kostiantyn Liepieshov, Jianhui Chen, Michal Perdoch, Christopher Zach, James Pritts Baking Neural Radiance Fields for Real-Time View Synthesis
Peter Hedman, Pratul P. Srinivasan, Ben Mildenhall, Jonathan T. Barron, Paul Debevec BARF: Bundle-Adjusting Neural Radiance Fields
Chen-Hsuan Lin, Wei-Chiu Ma, Antonio Torralba, Simon Lucey Benchmarking Ultra-High-Definition Image Super-Resolution
Kaihao Zhang, Dongxu Li, Wenhan Luo, Wenqi Ren, Björn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang Better Aggregation in Test-Time Augmentation
Divya Shanmugam, Davis Blalock, Guha Balakrishnan, John Guttag Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations
Pau Rodríguez, Massimo Caccia, Alexandre Lacoste, Lee Zamparo, Issam Laradji, Laurent Charlin, David Vazquez Bias Loss for Mobile Neural Networks
Lusine Abrahamyan, Valentin Ziatchin, Yiming Chen, Nikos Deligiannis Big Self-Supervised Models Advance Medical Image Classification
Shekoofeh Azizi, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi BioFors: A Large Biomedical Image Forensics Dataset
Ekraam Sabir, Soumyaroop Nandi, Wael Abd-Almageed, Prem Natarajan Black-Box Detection of Backdoor Attacks with Limited Information and Data
Yinpeng Dong, Xiao Yang, Zhijie Deng, Tianyu Pang, Zihao Xiao, Hang Su, Jun Zhu BlockPlanner: City Block Generation with Vectorized Graph Representation
Linning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin BN-NAS: Neural Architecture Search with Batch Normalization
Boyu Chen, Peixia Li, Baopu Li, Chen Lin, Chuming Li, Ming Sun, Junjie Yan, Wanli Ouyang Bootstrap Your Own Correspondences
Mohamed El Banani, Justin Johnson Boundary-Sensitive Pre-Training for Temporal Localization in Videos
Mengmeng Xu, Juan-Manuel Pérez-Rúa, Victor Escorcia, Brais Martínez, Xiatian Zhu, Li Zhang, Bernard Ghanem, Tao Xiang Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds
Chaoda Zheng, Xu Yan, Jiantao Gao, Weibing Zhao, Wei Zhang, Zhen Li, Shuguang Cui Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision
Ning-Hsu Wang, Ren Wang, Yu-Lun Liu, Yu-Hao Huang, Yu-Lin Chang, Chia-Ping Chen, Kevin Jou Bringing Events into Video Deblurring with Non-Consecutively Blurry Frames
Wei Shang, Dongwei Ren, Dongqing Zou, Jimmy S. Ren, Ping Luo, Wangmeng Zuo Broaden Your Views for Self-Supervised Video Learning
Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Pătrăucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, Andrew Zisserman Building-GAN: Graph-Conditioned Architectural Volumetric Design Generation
Kai-Hung Chang, Chin-Yi Cheng, Jieliang Luo, Shingo Murata, Mehdi Nourbakhsh, Yoshito Tsuji BuildingNet: Learning to Label 3D Buildings
Pratheba Selvaraju, Mohamed Nabail, Marios Loizou, Maria Maslioukova, Melinos Averkiou, Andreas Andreou, Siddhartha Chaudhuri, Evangelos Kalogerakis BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification
Cheng Yan, Guansong Pang, Lei Wang, Jile Jiao, Xuetao Feng, Chunhua Shen, Jingjing Li Calibrated and Partially Calibrated Semi-Generalized Homographies
Snehal Bhayani, Torsten Sattler, Daniel Barath, Patrik Beliansky, Janne Heikkilä, Zuzana Kukelova Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Zhuowan Li, Elias Stengel-Eskin, Yixiao Zhang, Cihang Xie, Quan Hung Tran, Benjamin Van Durme, Alan Yuille CANet: A Context-Aware Network for Shadow Removal
Zipei Chen, Chengjiang Long, Ling Zhang, Chunxia Xiao CAPTRA: CAtegory-Level Pose Tracking for Rigid and Articulated Objects from Point Clouds
Yijia Weng, He Wang, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas Cascade Image Matting with Deformable Graph Refinement
Zijian Yu, Xuhui Li, Huijuan Huang, Wen Zheng, Li Chen CaT: Weakly Supervised Object Detection with Category Transfer
Tianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya Zhang, Yan-Feng Wang Causal Attention for Unbiased Visual Recognition
Tan Wang, Chang Zhou, Qianru Sun, Hanwang Zhang CDNet: Centripetal Direction Network for Nuclear Instance Segmentation
Hongliang He, Zhongyi Huang, Yao Ding, Guoli Song, Lin Wang, Qian Ren, Pengxu Wei, Zhiqiang Gao, Jie Chen CDS: Cross-Domain Self-Supervised Pre-Training
Donghyun Kim, Kuniaki Saito, Tae-Hyun Oh, Bryan A. Plummer, Stan Sclaroff, Kate Saenko Channel-Wise Knowledge Distillation for Dense Prediction
Changyong Shu, Yifan Liu, Jianfei Gao, Zheng Yan, Chunhua Shen Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation
Mikhail Usvyatsov, Anastasia Makarova, Rafael Ballester-Ripoll, Maxim Rakhuba, Andreas Krause, Konrad Schindler Class Semantics-Based Attention for Action Detection
Deepak Sridhar, Niamul Quader, Srikanth Muralidharan, Yaoxin Li, Peng Dai, Juwei Lu CLEAR: Clean-up Sample-Targeted Backdoor in Neural Networks
Liuwan Zhu, Rui Ning, Chunsheng Xin, Chonggang Wang, Hongyi Wu Click to Move: Controlling Video Generation with Sparse Motion
Pierfrancesco Ardino, Marco De Nadai, Bruno Lepri, Elisa Ricci, Stéphane Lathuilière Co-Scale Conv-Attentional Image Transformers
Weijian Xu, Yifan Xu, Tyler Chang, Zhuowen Tu Co2L: Contrastive Continual Learning
Hyuntak Cha, Jaeho Lee, Jinwoo Shin CODEs: Chamfer Out-of-Distribution Examples Against Overconfidence Issue
Keke Tang, Dingruibo Miao, Weilong Peng, Jianpeng Wu, Yawen Shi, Zhaoquan Gu, Zhihong Tian, Wenping Wang Collaging Class-Specific GANs for Semantic Image Synthesis
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh COMISR: Compression-Informed Video Super-Resolution
Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar Common Objects in 3D: Large-Scale Learning and Evaluation of Real-Life 3D Category Reconstruction
Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotny Compressing Visual-Linguistic Model via Knowledge Distillation
Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lijuan Wang, Yezhou Yang, Zicheng Liu Concept Generalization in Visual Representation Learning
Mert Bulent Sariyildiz, Yannis Kalantidis, Diane Larlus, Karteek Alahari Conditional DETR for Fast Training Convergence
Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang Conditional Diffusion for Interactive Segmentation
Xi Chen, Zhiyan Zhao, Feiwu Yu, Yilei Zhang, Manni Duan Conditional Variational Capsule Network for Open Set Recognition
Yunrui Guo, Guglielmo Camporese, Wenjing Yang, Alessandro Sperduti, Lamberto Ballan Confidence Calibration for Domain Generalization Under Covariate Shift
Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda Gervasio Conformer: Local Features Coupling Global Representations for Visual Recognition
Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye Consistency-Aware Graph Network for Human Interaction Understanding
Zhenhua Wang, Jiajun Meng, Dongyan Guo, Jianhua Zhang, Javen Qinfeng Shi, Shengyong Chen Contact-Aware Retargeting of Skinned Motion
Ruben Villegas, Duygu Ceylan, Aaron Hertzmann, Jimei Yang, Jun Saito Context Reasoning Attention Network for Image Super-Resolution
Yulun Zhang, Donglai Wei, Can Qin, Huan Wang, Hanspeter Pfister, Yun Fu Context-Aware Scene Graph Generation with Seq2Seq Transformers
Yichao Lu, Himanshu Rai, Jason Chang, Boris Knyazev, Guangwei Yu, Shashank Shekhar, Graham W. Taylor, Maksims Volkovs Context-Sensitive Temporal Feature Learning for Gait Recognition
Xiaohu Huang, Duowang Zhu, Hao Wang, Xinggang Wang, Bo Yang, Botao He, Wenyu Liu, Bin Feng Contextually Plausible and Diverse 3D Human Motion Prediction
Sadegh Aliakbarian, Fatemeh Saleh, Lars Petersson, Stephen Gould, Mathieu Salzmann Continual Learning for Image-Based Camera Localization
Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala Contrast and Classify: Training Robust VQA Models
Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal Contrast and Order Representations for Video Self-Supervised Learning
Kai Hu, Jie Shao, Yuan Liu, Bhiksha Raj, Marios Savvides, Zhiqiang Shen Contrasting Contrastive Self-Supervised Representation Learning Pipelines
Klemen Kotar, Gabriel Ilharco, Ludwig Schmidt, Kiana Ehsani, Roozbeh Mottaghi Contrastive Learning for Label Efficient Semantic Segmentation
Xiangyun Zhao, Raviteja Vemulapalli, Philip Andrew Mansfield, Boqing Gong, Bradley Green, Lior Shapira, Ying Wu Contrastive Multimodal Fusion with TupleInfoNCE
Yunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong, Thomas Funkhouser, Li Yi Cortical Surface Shape Analysis Based on Alexandrov Polyhedra
Min Zhang, Yang Guo, Na Lei, Zhou Zhao, Jianfeng Wu, Xiaoyin Xu, Yalin Wang, Xianfeng Gu COTR: Correspondence Transformer for Matching Across Images
Wei Jiang, Eduard Trulls, Jan Hosang, Andrea Tagliasacchi, Kwang Moo Yi CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds
Eric-Tuan Lê, Minhyuk Sung, Duygu Ceylan, Radomir Mech, Tamy Boubekeur, Niloy J. Mitra CrackFormer: Transformer Network for Fine-Grained Crack Detection
Huajun Liu, Xiangyu Miao, Christoph Mertz, Chengzhong Xu, Hui Kong Cross-Camera Convolutional Color Constancy
Mahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, Francois Bleibel Cross-Category Video Highlight Detection via Set-Based Learning
Minghao Xu, Hang Wang, Bingbing Ni, Riheng Zhu, Zhenbang Sun, Changhu Wang Cross-Descriptor Visual Localization and Mapping
Mihai Dusmanu, Ondrej Miksik, Johannes L. Schönberger, Marc Pollefeys CrossDet: Crossline Representation for Object Detection
Heqian Qiu, Hongliang Li, Qingbo Wu, Jianhua Cui, Zichen Song, Lanxiao Wang, Minjian Zhang CrossNorm and SelfNorm for Generalization Under Distribution Shifts
Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris N. Metaxas Crossover Learning for Fast Online Video Instance Segmentation
Shusheng Yang, Yuxin Fang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, Wenyu Liu Crowd Counting with Partial Annotations in an Image
Yanyu Xu, Ziming Zhong, Dongze Lian, Jing Li, Zhengxin Li, Xinxing Xu, Shenghua Gao CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization
Ara Jafarzadeh, Manuel López Antequera, Pau Gargallo, Yubin Kuang, Carl Toft, Fredrik Kahl, Torsten Sattler CSG-Stump: A Learning Friendly CSG-like Representation for Interpretable Shape Parsing
Daxuan Ren, Jianmin Zheng, Jianfei Cai, Jiatong Li, Haiyong Jiang, Zhongang Cai, Junzhe Zhang, Liang Pan, Mingyuan Zhang, Haiyu Zhao, Shuai Yi CTRL-C: Camera Calibration TRansformer with Line-Classification
Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim CvT: Introducing Convolutions to Vision Transformers
Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang DAE-GAN: Dynamic Aspect-Aware GAN for Text-to-Image Synthesis
Shulan Ruan, Yong Zhang, Kun Zhang, Yanbo Fan, Fan Tang, Qi Liu, Enhong Chen DAM: Discrepancy Alignment Metric for Face Recognition
Jiaheng Liu, Yudong Wu, Yichao Wu, Chuming Li, Xiaolin Hu, Ding Liang, Mengyu Wang De-Rendering Stylized Texts
Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi DecentLaM: Decentralized Momentum SGD for Large-Batch Deep Training
Kun Yuan, Yiming Chen, Xinmeng Huang, Yingya Zhang, Pan Pan, Yinghui Xu, Wotao Yin Deep 3D Mask Volume for View Synthesis of Dynamic Scenes
Kai-En Lin, Lei Xiao, Feng Liu, Guowei Yang, Ravi Ramamoorthi Deep Blind Video Super-Resolution
Jinshan Pan, Haoran Bai, Jiangxin Dong, Jiawei Zhang, Jinhui Tang Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation
Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim Deep Edge-Aware Interactive Colorization Against Color-Bleeding Effects
Eungyeup Kim, Sanghyeon Lee, Jeonghoon Park, Somi Choi, Choonghyun Seo, Jaegul Choo Deep Halftoning with Reversible Binary Pattern
Menghan Xia, Wenbo Hu, Xueting Liu, Tien-Tsin Wong Deep Hough Voting for Robust Global Registration
Junha Lee, Seungwook Kim, Minsu Cho, Jaesik Park Deep Hybrid Self-Prior for Full 3D Mesh Generation
Xingkui Wei, Zhengqing Chen, Yanwei Fu, Zhaopeng Cui, Yinda Zhang Deep Implicit Surface Point Prediction Networks
Rahul Venkatesh, Tejan Karmali, Sarthak Sharma, Aurobrata Ghosh, R. Venkatesh Babu, László A. Jeni, Maneesh Singh Deep Metric Learning for Open World Semantic Segmentation
Jun Cen, Peng Yun, Junhao Cai, Michael Yu Wang, Ming Liu Deep Permutation Equivariant Structure from Motion
Dror Moran, Hodaya Koslowsky, Yoni Kasten, Haggai Maron, Meirav Galun, Ronen Basri Deep Relational Metric Learning
Wenzhao Zheng, Borui Zhang, Jiwen Lu, Jie Zhou Deep Reparametrization of Multi-Frame Super-Resolution and Denoising
Goutam Bhat, Martin Danelljan, Fisher Yu, Luc Van Gool, Radu Timofte Deep Survival Analysis with Longitudinal X-Rays for COVID-19
Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih Deep Virtual Markers for Articulated 3D Shapes
Hyomin Kim, Jungeon Kim, Jaewon Kam, Jaesik Park, Seungyong Lee DeepPRO: Deep Partial Point Cloud Registration of Objects
Donghoon Lee, Onur C. Hamsici, Steven Feng, Prachee Sharma, Thorsten Gernoth Defending Against Universal Adversarial Patches by Clipping Feature Norms
Cheng Yu, Jiansheng Chen, Youze Xue, Yuyang Liu, Weitao Wan, Jiayu Bao, Huimin Ma Defocus mAP Estimation and Deblurring from a Single Dual-Pixel Image
Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, Rahul Garg DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection
Limeng Qiao, Yuxuan Zhao, Zhiyuan Li, Xi Qiu, Jianan Wu, Chi Zhang Dense Interaction Learning for Video-Based Person Re-Identification
Tianyu He, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua DepthTrack: Unveiling the Power of RGBD Tracking
Song Yan, Jinyu Yang, Jani Käpylä, Feng Zheng, Aleš Leonardis, Joni-Kristian Kämäräinen Describing and Localizing Multiple Changes with Transformers
Yue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh DetCo: Unsupervised Contrastive Learning for Object Detection
Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Peize Sun, Zhenguo Li, Ping Luo Detecting Invisible People
Tarasha Khurana, Achal Dave, Deva Ramanan Detection and Continual Learning of Novel Face Presentation Attacks
Mohammad Rostami, Leonidas Spinoulas, Mohamed Hussein, Joe Mathai, Wael Abd-Almageed Detector-Free Weakly Supervised Grounding by Separation
Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogerio Feris, Leonid Karlinsky DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities
Elias Eulig, Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Kilian Rambach, William Beluch, Xiahan Shi, Volker Fischer Differentiable Convolution Search for Point Cloud Processing
Xing Nie, Yongcheng Liu, Shaohong Chen, Jianlong Chang, Chunlei Huo, Gaofeng Meng, Qi Tian, Weiming Hu, Chunhong Pan Differentiable Dynamic Wirings for Neural Networks
Kun Yuan, Quanquan Li, Shaopeng Guo, Dapeng Chen, Aojun Zhou, Fengwei Yu, Ziwei Liu Differentiable Surface Rendering via Non-Differentiable Sampling
Forrester Cole, Kyle Genova, Avneesh Sud, Daniel Vlasic, Zhoutong Zhang Digging into Uncertainty in Self-Supervised Multi-View Stereo
Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao Direct Differentiable Augmentation Search
Aoming Liu, Zehao Huang, Zhiwu Huang, Naiyan Wang DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision
Shiyi Lan, Zhiding Yu, Christopher Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar Discovering 3D Parts from Image Collections
Chun-Han Yao, Wei-Chih Hung, Varun Jampani, Ming-Hsuan Yang Discriminative Region-Based Multi-Label Zero-Shot Learning
Sanath Narayan, Akshita Gupta, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah Disentangled High Quality Salient Object Detection
Lv Tang, Bo Li, Yijie Zhong, Shouhong Ding, Mofei Song Disentangled Lifespan Face Synthesis
Sen He, Wentong Liao, Michael Ying Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang Dissecting Image Crops
Basile Van Hoorick, Carl Vondrick Distance-Aware Quantization
Dohyung Kim, Junghyup Lee, Bumsub Ham Distillation-Guided Image Inpainting
Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan Distilling Global and Local Logits with Densely Connected Relations
Youmin Kim, Jinbae Park, YounHo Jang, Muhammad Ali, Tae-Hyun Oh, Sung-Ho Bae Distilling Holistic Knowledge with Graph Neural Networks
Sheng Zhou, Yucheng Wang, Defang Chen, Jiawei Chen, Xin Wang, Can Wang, Jiajun Bu Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces
Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort DisUnknown: Distilling Unknown Factors for Disentanglement Learning
Sitao Xiang, Yuming Gu, Pengda Xiang, Menglei Chai, Hao Li, Yajie Zhao, Mingming He Diverse Image Style Transfer via Invertible Cross-Space Mapping
Haibo Chen, Lei Zhao, Huiming Zhang, Zhizhong Wang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu Divide and Conquer for Single-Frame Temporal Action Localization
Chen Ju, Peisen Zhao, Siheng Chen, Ya Zhang, Yanfeng Wang, Qi Tian DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes
Dongki Jung, Jaehoon Choi, Yonghan Lee, Deokhwa Kim, Changick Kim, Dinesh Manocha, Donghwan Lee Do Image Classifiers Generalize Across Time?
Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features
Min Yang, Dongliang He, Miao Fan, Baorong Shi, Xuetong Xue, Fu Li, Errui Ding, Jizhou Huang Domain Generalization via Gradient Surgery
Lucas Mansilla, Rodrigo Echeveste, Diego H. Milone, Enzo Ferrante Domain-Aware Universal Style Transfer
Kibeom Hong, Seogkyu Jeon, Huan Yang, Jianlong Fu, Hyeran Byun Domain-Invariant Disentangled Network for Generalizable Object Detection
Chuang Lin, Zehuan Yuan, Sicheng Zhao, Peize Sun, Changhu Wang, Jianfei Cai Dual Contrastive Loss and Attention for GANs
Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry S. Davis, Mario Fritz Dual Path Learning for Domain Adaptation of Semantic Segmentation
Yiting Cheng, Fangyun Wei, Jianmin Bao, Dong Chen, Fang Wen, Wenqiang Zhang Dual Projection Generative Adversarial Networks for Conditional Image Generation
Ligong Han, Martin Renqiang Min, Anastasis Stathopoulos, Yu Tian, Ruijiang Gao, Asim Kadav, Dimitris N. Metaxas Dual-Camera Super-Resolution with Aligned Attention Modules
Tengfei Wang, Jiaxin Xie, Wenxiu Sun, Qiong Yan, Qifeng Chen Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection
Miao Zhang, Jie Liu, Yifei Wang, Yongri Piao, Shunyu Yao, Wei Ji, Jingjing Li, Huchuan Lu, Zhongxuan Luo Dynamic DETR: End-to-End Object Detection with Dynamic Attention
Xiyang Dai, Yinpeng Chen, Jianwei Yang, Pengchuan Zhang, Lu Yuan, Lei Zhang Dynamic Dual Gating Neural Networks
Fanrong Li, Gang Li, Xiangyu He, Jian Cheng Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution
Salma Abdel Magid, Yulun Zhang, Donglai Wei, Won-Dong Jang, Zudi Lin, Yun Fu, Hanspeter Pfister Dynamic Network Quantization for Efficient Video Inference
Ximeng Sun, Rameswar Panda, Chun-Fu Chen, Aude Oliva, Rogerio Feris, Kate Saenko Dynamic View Synthesis from Dynamic Monocular Video
Chen Gao, Ayush Saraf, Johannes Kopf, Jia-Bin Huang Dynamical Pose Estimation
Heng Yang, Chris Doran, Jean-Jacques Slotine E-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Maxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS
Qinqin Zhou, Xiawu Zheng, Liujuan Cao, Bineng Zhong, Teng Xi, Gang Zhang, Errui Ding, Mingliang Xu, Rongrong Ji Editing Conditional Radiance Fields
Steven Liu, Xiuming Zhang, Zhoutong Zhang, Richard Zhang, Jun-Yan Zhu, Bryan Russell Effectively Leveraging Attributes for Visual Similarity
Samarth Mishra, Zhongping Zhang, Yuan Shen, Ranjitha Kumar, Venkatesh Saligrama, Bryan A. Plummer Efficient Action Recognition via Dynamic Knowledge Propagation
Hanul Kim, Mihir Jain, Jun-Tae Lee, Sungrack Yun, Fatih Porikli Efficient and Differentiable Shadow Computation for Inverse Problems
Linjie Lyu, Marc Habermann, Lingjie Liu, B R Mallikarjun, Ayush Tewari, Christian Theobalt Efficient Visual Pretraining with Contrastive Detection
Olivier J. Hénaff, Skanda Koppula, Jean-Baptiste Alayrac, Aaron van den Oord, Oriol Vinyals, João Carreira EgoRenderer: Rendering Human Avatars from Egocentric Camera Images
Tao Hu, Kripasindhu Sarkar, Lingjie Liu, Matthias Zwicker, Christian Theobalt ELF-VC: Efficient Learned Flexible-Rate Video Coding
Oren Rippel, Alexander G. Anderson, Kedar Tatwawadi, Sanjay Nair, Craig Lytle, Lubomir Bourdev ELSD: Efficient Line Segment Detector and Descriptor
Haotian Zhang, Yicheng Luo, Fangbo Qin, Yijia He, Xiao Liu EM-POSE: 3D Human Pose Estimation from Sparse Electromagnetic Trackers
Manuel Kaufmann, Yi Zhao, Chengcheng Tang, Lingling Tao, Christopher Twigg, Jie Song, Robert Wang, Otmar Hilliges Embed Me if You Can: A Geometric Perceptron
Pavlo Melnyk, Michael Felsberg, Mårten Wadenbäck Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin End-to-End Dense Video Captioning with Parallel Decoding
Teng Wang, Ruimao Zhang, Zhichao Lu, Feng Zheng, Ran Cheng, Ping Luo End-to-End Piece-Wise Unwarping of Document Images
Sagnik Das, Kunwar Yashraj Singh, Jon Wu, Erhan Bas, Vijay Mahadevan, Rahul Bhotika, Dimitris Samaras End-to-End Semi-Supervised Object Detection with Soft Teacher
Mengde Xu, Zheng Zhang, Han Hu, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu End-to-End Unsupervised Document Image Blind Denoising
Mehrdad J. Gangeh, Marcin Plata, Hamid R. Motahari Nezhad, Nigel P Duffy End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool Enhanced Boundary Learning for Glass-like Object Segmentation
Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, LuBin Weng Ensemble Attention Distillation for Privacy-Preserving Federated Learning
Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David Doermann, Arun Innanje Estimating Egocentric 3D Human Pose in Global Space
Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Christian Theobalt EventHands: Real-Time Neural 3D Hand Pose Estimation from an Event Stream
Viktor Rudnev, Vladislav Golyanik, Jiayi Wang, Hans-Peter Seidel, Franziska Mueller, Mohamed Elgharib, Christian Theobalt EventHPE: Event-Based 3D Human Pose and Shape Estimation
Shihao Zou, Chuan Guo, Xinxin Zuo, Sen Wang, Pengyu Wang, Xiaoqin Hu, Shoushun Chen, Minglun Gong, Li Cheng Evolving Search Space for Neural Architecture Search
Yuanzheng Ci, Chen Lin, Ming Sun, Boyu Chen, Hongwen Zhang, Wanli Ouyang Explaining in Style: Training a GAN to Explain a Classifier in StyleSpace
Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri Explanations for Occluded Images
Hana Chockler, Daniel Kroening, Youcheng Sun Exploiting Explanations for Model Inversion Attacks
Xuejun Zhao, Wencan Zhang, Xiaokui Xiao, Brian Lim Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
Mingjun Yin, Shasha Li, Zikui Cai, Chengyu Song, M. Salman Asif, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy Exploiting Sample Correlation for Crowd Counting with Multi-Expert Network
Xinyan Liu, Guorong Li, Zhenjun Han, Weigang Zhang, Yifan Yang, Qingming Huang, Nicu Sebe Exploring Cross-Image Pixel Contrast for Semantic Segmentation
Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection
Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang, Zhenguo Li, Luc Van Gool Exploring Inter-Channel Correlation for Diversity-Preserved Knowledge Distillation
Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang Exploring Long Tail Visual Relationship Recognition with Large Vocabulary
Sherif Abdelkarim, Aniket Agarwal, Panos Achlioptas, Jun Chen, Jiaji Huang, Boyang Li, Kenneth Church, Mohamed Elhoseiny Exploring Relational Context for Multi-Task Dense Prediction
David Brüggemann, Menelaos Kanakis, Anton Obukhov, Stamatios Georgoulis, Luc Van Gool Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation
Jinyu Yang, Chunyuan Li, Weizhi An, Hehuan Ma, Yuzhi Guo, Yu Rong, Peilin Zhao, Junzhou Huang Exploring Visual Engagement Signals for Representation Learning
Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim Extreme Structure from Motion for Indoor Panoramas Without Visual Overlaps
Mohammad Amin Shabani, Weilian Song, Makoto Odamaki, Hirochika Fujiki, Yasutaka Furukawa Face Image Retrieval with Attribute Manipulation
Alireza Zaeemzadeh, Shabnam Ghadar, Baldo Faieta, Zhe Lin, Nazanin Rahnavard, Mubarak Shah, Ratheesh Kalarot FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning
Chenxu Zhang, Yifan Zhao, Yifei Huang, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo Factorizing Perception and Policy for Interactive Instruction Following
Kunal Pratap Singh, Suvaansh Bhambri, Byeonghwi Kim, Roozbeh Mottaghi, Jonghyun Choi Fake It till You Make It: Face Analysis in the Wild Using Synthetic Data Alone
Erroll Wood, Tadas Baltrušaitis, Charlie Hewitt, Sebastian Dziadzio, Thomas J. Cashman, Jamie Shotton Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning
Qi Sun, Chen Bai, Tinghuan Chen, Hao Geng, Xinyun Zhang, Yang Bai, Bei Yu Fast Convergence of DETR with Spatially Modulated Co-Attention
Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li Faster Multi-Object Segmentation Using Parallel Quadratic Pseudo-Boolean Optimization
Niels Jeppesen, Patrick M. Jensen, Anders N. Christensen, Anders B. Dahl, Vedrana A. Dahl FastNeRF: High-Fidelity Neural Rendering at 200FPS
Stephan J. Garbin, Marek Kowalski, Matthew Johnson, Jamie Shotton, Julien Valentin Feature Importance-Aware Transferable Adversarial Attacks
Zhibo Wang, Hengchang Guo, Zhifei Zhang, Wenxin Liu, Zhan Qin, Kui Ren Few-Shot Visual Relationship Co-Localization
Revant Teotia, Vaibhav Mishra, Mayank Maheshwari, Anand Mishra Field Convolutions for Surface CNNs
Thomas W. Mitchel, Vladimir G. Kim, Michael Kazhdan Field-Guide-Inspired Zero-Shot Learning
Utkarsh Mall, Bharath Hariharan, Kavita Bala FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall Finding Representative Interpretations on Convolutional Neural Networks
Peter Cho-Ho Lam, Lingyang Chu, Maxim Torgonskiy, Jian Pei, Yong Zhang, Lanjun Wang Flow-Guided Video Inpainting with Scene Templates
Dong Lao, Peihao Zhu, Peter Wonka, Ganesh Sundaramoorthi FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters
Yuwei Cheng, Jiannan Zhu, Mengxin Jiang, Jie Fu, Changsong Pang, Peidong Wang, Kris Sankaran, Olawale Onabola, Yimin Liu, Dianbo Liu, Yoshua Bengio FMODetect: Robust Detection of Fast Moving Objects
Denys Rozumnyi, Jiří Matas, Filip Šroubek, Marc Pollefeys, Martin R. Oswald FOVEA: Foveated Image Magnification for Autonomous Navigation
Chittesh Thavamani, Mengtian Li, Nicolas Cebron, Deva Ramanan Free-Form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud
Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, XiangDong Zhang, Guangming Zhu, Hui Zhang, Yaonan Wang, Ajmal Mian FREE: Feature Refinement for Generalized Zero-Shot Learning
Shiming Chen, Wenjie Wang, Beihao Xia, Qinmu Peng, Xinge You, Feng Zheng, Ling Shao From General to Specific: Informative Scene Graph Generation via Balance Adjustment
Yuyu Guo, Lianli Gao, Xuanhan Wang, Yuxuan Hu, Xing Xu, Xu Lu, Heng Tao Shen, Jingkuan Song Full-Duplex Strategy for Video Object Segmentation
Ge-Peng Ji, Keren Fu, Zhe Wu, Deng-Ping Fan, Jianbing Shen, Ling Shao Full-Velocity Radar Returns by Radar-Camera Fusion
Yunfei Long, Daniel Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li Fusion Moves for Graph Matching
Lisa Hutschenreiter, Stefan Haller, Lorenz Feineis, Carsten Rother, Dagmar Kainmüller, Bogdan Savchynskyy Gait Recognition in the Wild: A Benchmark
Zheng Zhu, Xianda Guo, Tian Yang, Junjie Huang, Jiankang Deng, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou GAN-Control: Explicitly Controllable GANs
Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Gérard Medioni Gated3D: Monocular 3D Object Detection from Temporal Illumination Cues
Frank Julca-Aguilar, Jason Taylor, Mario Bijelic, Fahim Mannan, Ethan Tseng, Felix Heide Generalize Then Adapt: Source-Free Domain Adaptive Semantic Segmentation
Jogendra Nath Kundu, Akshay Kulkarni, Amit Singh, Varun Jampani, R. Venkatesh Babu Generalized Shuffled Linear Regression
Feiran Li, Kent Fujiwara, Fumio Okura, Yasuyuki Matsushita Generalized Source-Free Domain Adaptation
Shiqi Yang, Yaxing Wang, Joost van de Weijer, Luis Herranz, Shangling Jui Generative Compositional Augmentations for Scene Graph Prediction
Boris Knyazev, Harm de Vries, Cătălina Cangea, Graham W. Taylor, Aaron Courville, Eugene Belilovsky Generative Layout Modeling Using Constraint Graphs
Wamiq Para, Paul Guerrero, Tom Kelly, Leonidas J. Guibas, Peter Wonka Generic Event Boundary Detection: A Benchmark for Event Segmentation
Mike Zheng Shou, Stan Weixian Lei, Weiyao Wang, Deepti Ghadiyaram, Matt Feiszli Geography-Aware Self-Supervised Learning
Kumar Ayush, Burak Uzkent, Chenlin Meng, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon Geometric Granularity Aware Pixel-to-Mesh
Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang Geometry Uncertainty Projection Network for Monocular 3D Object Detection
Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Junjie Yan, Wanli Ouyang Geometry-Based Distance Decomposition for Monocular 3D Object Detection
Xuepeng Shi, Qi Ye, Xiaozhi Chen, Chuangrong Chen, Zhixiang Chen, Tae-Kyun Kim GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen, Peixia Li, Chuming Li, Baopu Li, Lei Bai, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang GNeRF: GAN-Based Neural Radiance Field Without Posed Camera
Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu Going Deeper with Image Transformers
Hugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve, Hervé Jégou GP-S3Net: Graph-Based Panoptic Sparse Semantic Segmentation Network
Ryan Razani, Ran Cheng, Enxu Li, Ehsan Taghavi, Yuan Ren, Liu Bingbing Grafit: Learning Fine-Grained Image Representations with Coarse Labels
Hugo Touvron, Alexandre Sablayrolles, Matthijs Douze, Matthieu Cord, Hervé Jégou Graph Constrained Data Representation Learning for Human Motion Segmentation
Mariella Dimiccoli, Lluís Garrido, Guillem Rodriguez-Corominas, Herwig Wendt Graph Contrastive Clustering
Huasong Zhong, Jianlong Wu, Chong Chen, Jianqiang Huang, Minghua Deng, Liqiang Nie, Zhouchen Lin, Xian-Sheng Hua Graph-BAS3Net: Boundary-Aware Semi-Supervised Segmentation Network with Bilateral Graph Convolution
Huimin Huang, Lanfen Lin, Yue Zhang, Yingying Xu, Jing Zheng, XiongWei Mao, Xiaohan Qian, Zhiyi Peng, Jianying Zhou, Yen-Wei Chen, Ruofeng Tong Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang Graph-Based Asynchronous Event Processing for Rapid Object Recognition
Yijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Chenxi Wang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu Gravity-Aware Monocular 3D Human-Object Reconstruction
Rishabh Dabral, Soshi Shimada, Arjun Jain, Christian Theobalt, Vladislav Golyanik GridToPix: Training Embodied Agents with Minimal Supervision
Unnat Jain, Iou-Jen Liu, Svetlana Lazebnik, Aniruddha Kembhavi, Luca Weihs, Alexander G. Schwing Group-Free 3D Object Detection via Transformers
Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer
Shuaicheng Li, Qianggang Cao, Lingbo Liu, Kunlin Yang, Shinan Liu, Jun Hou, Shuai Yi H2O: A Benchmark for Visual Human-Human Object Handover Analysis
Ruolin Ye, Wenqiang Xu, Zhendong Xue, Tutian Tang, Yanfeng Wang, Cewu Lu H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction
Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime Garcia, Xavier Giró-i-Nieto, Francesc Moreno-Noguer HAA500: Human-Centric Atomic Action Dataset with Curated Videos
Jihoon Chung, Cheng-hsin Wuu, Hsuan-ru Yang, Yu-Wing Tai, Chi-Keung Tang Hand Image Understanding via Deep Multi-Task Learning
Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu Handwriting Transformers
Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Mubarak Shah HeadGAN: One-Shot Neural Head Synthesis and Editing
Michail Christos Doukas, Stefanos Zafeiriou, Viktoriia Sharmanska Hierarchical Aggregation for 3D Instance Segmentation
Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang Hierarchical Memory Matching Network for Video Object Segmentation
Hongje Seong, Seoung Wug Oh, Joon-Young Lee, Seongwon Lee, Suhyeon Lee, Euntai Kim Hierarchical Object-to-Zone Graph for Object Navigation
Sixian Zhang, Xinhang Song, Yubing Bai, Weijie Li, Yakui Chu, Shuqiang Jiang High Quality Disparity Remapping with Two-Stage Warping
Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo High-Performance Discriminative Tracking with Transformers
Bin Yu, Ming Tang, Linyu Zheng, Guibo Zhu, Jinqiao Wang, Hao Feng, Xuetao Feng, Hanqing Lu High-Resolution Optical Flow from 1d Attention and Correlation
Haofei Xu, Jiaolong Yang, Jianfei Cai, Juyong Zhang, Xin Tong HighlightMe: Detecting Highlights from Human-Centric Videos
Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli, Viswanathan Swaminathan, Dinesh Manocha HiNet: Deep Image Hiding by Invertible Network
Junpeng Jing, Xin Deng, Mai Xu, Jianyi Wang, Zhenyu Guan HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval
Song Liu, Haoqi Fan, Shengsheng Qian, Yiru Chen, Wenkui Ding, Zhongyuan Wang How Shift Equivariance Impacts Metric Learning for Instance Segmentation
Josef Lorenz Rumberger, Xiaoyan Yu, Peter Hirsch, Melanie Dohmen, Vanessa Emanuela Guarino, Ashkan Mokarian, Lisa Mais, Jan Funke, Dagmar Kainmüller How to Train Neural Networks for Flare Removal
Yicheng Wu, Qiurui He, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron HPNet: Deep Primitive Segmentation Using Hybrid Representations
Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, Qixing Huang Human Detection and Segmentation via Multi-View Consensus
Isinsu Katircioglu, Helge Rhodin, Jörg Spörri, Mathieu Salzmann, Pascal Fua Human Pose Regression with Residual Log-Likelihood Estimation
Jiefeng Li, Siyuan Bian, Ailing Zeng, Can Wang, Bo Pang, Wentao Liu, Cewu Lu HuMoR: 3D Human Motion Model for Robust Pose Estimation
Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas J. Guibas Hybrid Neural Fusion for Full-Frame Video Stabilization
Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Mike Roberts, Jason Ramapuram, Anurag Ranjan, Atulit Kumar, Miguel Angel Bautista, Nathan Paczan, Russ Webb, Joshua M. Susskind ICON: Learning Regular Maps Through Inverse Consistency
Hastings Greer, Roland Kwitt, François-Xavier Vialard, Marc Niethammer ID-Reveal: Identity-Aware DeepFake Video Detection
Davide Cozzolino, Andreas Rössler, Justus Thies, Matthias Nießner, Luisa Verdoliva IDARTS: Interactive Differentiable Architecture Search
Song Xue, Runqi Wang, Baochang Zhang, Tian Wang, Guodong Guo, David Doermann IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID
Yongxing Dai, Jun Liu, Yifan Sun, Zekun Tong, Chi Zhang, Ling-Yu Duan ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon Image Harmonization with Transformer
Zonghui Guo, Dongsheng Guo, Haiyong Zheng, Zhaorui Gu, Bing Zheng, Junyu Dong Image Synthesis from Layout with Locality-Aware Mask Adaption
Zejian Li, Jingyu Wu, Immanuel Koh, Yongchuan Tang, Lingyun Sun Image Synthesis via Semantic Composition
Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Nikhil Singh, Jeff Mentch, Jerry Ng, Matthew Beveridge, Iddo Drori iMAP: Implicit Mapping and Positioning in Real-Time
Edgar Sucar, Shikun Liu, Joseph Ortiz, Andrew J. Davison Impact of Aliasing on Generalization in Deep Convolutional Networks
Cristina Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Rob Romijnders, Nicolas Le Roux, Ross Goroshin Improve Unsupervised Pretraining for Few-Label Transfer
Suichan Li, Dongdong Chen, Yinpeng Chen, Lu Yuan, Lei Zhang, Qi Chu, Bin Liu, Nenghai Yu Improving 3D Object Detection with Channel-Wise Transformer
Hualian Sheng, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao Improving Neural Network Efficiency via Post-Training Quantization with Adaptive Floating-Point
Fangxin Liu, Wenbo Zhao, Zhezhi He, Yanzhi Wang, Zongwu Wang, Changzhi Dai, Xiaoyao Liang, Li Jiang In Defense of Scene Graphs for Image Captioning
Kien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen iNAS: Integral NAS for Device-Aware Salient Object Detection
Yu-Chao Gu, Shang-Hua Gao, Xu-Sheng Cao, Peng Du, Shao-Ping Lu, Ming-Ming Cheng Incorporating Convolution Designs into Visual Transformers
Kun Yuan, Shaopeng Guo, Ziwei Liu, Aojun Zhou, Fengwei Yu, Wei Wu Inference of Black Hole Fluid-Dynamics from Sparse Interferometric Measurements
Aviad Levis, Daeyoung Lee, Joel A. Tropp, Charles F. Gammie, Katherine L. Bouman Inferring High-Resolution Traffic Accident Risk Maps Based on Satellite Imagery and GPS Trajectories
Songtao He, Mohammad Amin Sadeghi, Sanjay Chawla, Mohammad Alizadeh, Hari Balakrishnan, Samuel Madden Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image
Andrew Liu, Richard Tucker, Varun Jampani, Ameesh Makadia, Noah Snavely, Angjoo Kanazawa Influence Selection for Active Learning
Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He Instances as Queries
Yuxin Fang, Shusheng Yang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, Wenyu Liu Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image
Baowen Zhang, Yangang Wang, Xiaoming Deng, Yinda Zhang, Ping Tan, Cuixia Ma, Hongan Wang Interpretable Visual Reasoning via Induced Symbolic Space
Zhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong, Wen-mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
Shivansh Patel, Saim Wani, Unnat Jain, Alexander G. Schwing, Svetlana Lazebnik, Manolis Savva, Angel X. Chang Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer
Haoyu Chen, Hao Tang, Henglin Shi, Wei Peng, Nicu Sebe, Guoying Zhao Invisible Backdoor Attack with Sample-Specific Triggers
Yuezun Li, Yiming Li, Baoyuan Wu, Longkang Li, Ran He, Siwei Lyu Is Pseudo-LiDAR Needed for Monocular 3D Object Detection?
Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon ISD: Self-Supervised Learning by Iterative Similarity Distillation
Ajinkya Tejankar, Soroush Abbasi Koohpayegani, Vipin Pillai, Paolo Favaro, Hamed Pirsiavash Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid Keep CALM and Improve Visual Feature Attribution
Jae Myung Kim, Junsuk Choe, Zeynep Akata, Seong Joon Oh Kernel Methods in Hyperbolic Spaces
Pengfei Fang, Mehrtash Harandi, Lars Petersson Keypoint Communities
Duncan Zauss, Sven Kreiss, Alexandre Alahi KoDF: A Large-Scale Korean DeepFake Detection Dataset
Patrick Kwon, Jaeseong You, Gyuhyeon Nam, Sungwoo Park, Gyeongsu Chae Labels4Free: Unsupervised Segmentation Using StyleGAN
Rameen Abdal, Peihao Zhu, Niloy J. Mitra, Peter Wonka LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation
Inkyu Shin, Dong-Jin Kim, Jae Won Cho, Sanghyun Woo, Kwanyong Park, In So Kweon LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments
Henry Howard-Jenkins, Jose-Raul Ruiz-Sarmiento, Victor Adrian Prisacariu Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu LapsCore: Language-Guided Person Search via Color Reasoning
Yushuang Wu, Zizheng Yan, Xiaoguang Han, Guanbin Li, Changqing Zou, Shuguang Cui Large Scale Interactive Motion Forecasting for Autonomous Driving: The Waymo Open Motion Dataset
Scott Ettinger, Shuyang Cheng, Benjamin Caine, Chenxi Liu, Hang Zhao, Sabeek Pradhan, Yuning Chai, Ben Sapp, Charles R. Qi, Yin Zhou, Zoey Yang, Aurélien Chouard, Pei Sun, Jiquan Ngiam, Vijay Vasudevan, Alexander McCauley, Jonathon Shlens, Dragomir Anguelov Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm Under Mixed Illumination
Dongyoung Kim, Jinwoo Kim, Seonghyeon Nam, Dongwoo Lee, Yeonkyung Lee, Nahyup Kang, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han, Seon Joo Kim Latent Transformations via NeuralODEs for GAN-Based Image Editing
Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko LayoutTransformer: Layout Generation and Completion with Self-Attention
Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry S. Davis, Vijay Mahadevan, Abhinav Shrivastava Learn-to-Race: A Multimodal Control Environment for Autonomous Racing
James Herman, Jonathan Francis, Siddha Ganju, Bingqing Chen, Anirudh Koul, Abhinav Gupta, Alexey Skabelkin, Ivan Zhukov, Max Kumskoy, Eric Nyberg Learned Spatial Representations for Few-Shot Talking-Head Synthesis
Moustafa Meshry, Saksham Suri, Larry S. Davis, Abhinav Shrivastava Learning a Single Network for Scale-Arbitrary Super-Resolution
Longguang Wang, Yingqian Wang, Zaiping Lin, Jungang Yang, Wei An, Yulan Guo Learning Compatible Embeddings
Qiang Meng, Chixiang Zhang, Xiaoqiang Xu, Feng Zhou Learning Cross-Modal Contrastive Features for Video Domain Adaptation
Donghyun Kim, Yi-Hsuan Tsai, Bingbing Zhuang, Xiang Yu, Stan Sclaroff, Kate Saenko, Manmohan Chandraker Learning Dual Priors for JPEG Compression Artifacts Removal
Xueyang Fu, Xi Wang, Aiping Liu, Junwei Han, Zheng-Jun Zha Learning Efficient Photometric Feature Transform for Multi-View Stereo
Kaizhang Kang, Cihui Xie, Ruisheng Zhu, Xiaohe Ma, Ping Tan, Hongzhi Wu, Kun Zhou Learning Hierarchical Graph Neural Networks for Image Clustering
Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto Learning Motion Priors for 4D Human Body Capture in 3D Scenes
Siwei Zhang, Yan Zhang, Federica Bogo, Marc Pollefeys, Siyu Tang Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering
Bangbang Yang, Yinda Zhang, Yinghao Xu, Yijin Li, Han Zhou, Hujun Bao, Guofeng Zhang, Zhaopeng Cui Learning of Visual Relations: The Devil Is in the Tails
Alakh Desai, Tz-Ying Wu, Subarna Tripathi, Nuno Vasconcelos Learning Rare Category Classifiers on a Tight Labeling Budget
Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision
Zhilu Zhang, Haolin Wang, Ming Liu, Ruohao Wang, Jiawei Zhang, Wangmeng Zuo Learning Realistic Human Reposing Using Cyclic Self-Supervision with 3D Shape, Pose, and Appearance Consistency
Soubhik Sanyal, Alex Vorobiov, Timo Bolkart, Matthew Loper, Betty Mohler, Larry S. Davis, Javier Romero, Michael J. Black Learning Self-Consistency for Deepfake Detection
Tianchen Zhao, Xiang Xu, Mingze Xu, Hui Ding, Yuanjun Xiong, Wei Xia Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation
Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu Learning Spatio-Temporal Transformer for Visual Tracking
Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu Learning to Adversarially Blur Visual Object Tracking
Qing Guo, Ziyi Cheng, Felix Juefei-Xu, Lei Ma, Xiaofei Xie, Yang Liu, Jianjun Zhao Learning to Cut by Watching Movies
Alejandro Pardo, Fabian Caba, Juan Léon Alcázar, Ali K. Thabet, Bernard Ghanem Learning to Diversify for Single Domain Generalization
Zijian Wang, Yadan Luo, Ruihong Qiu, Zi Huang, Mahsa Baktashmotlagh Learning to Drive from a World on Rails
Dian Chen, Vladlen Koltun, Philipp Krähenbühl Learning to Estimate Hidden Motions with Global Motion Aggregation
Shihao Jiang, Dylan Campbell, Yao Lu, Hongdong Li, Richard Hartley Learning to Hallucinate Examples from Extrinsic and Intrinsic Supervision
Liangke Gui, Adrien Bardes, Ruslan Salakhutdinov, Alexander Hauptmann, Martial Hebert, Yu-Xiong Wang Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-Identification
Jinrui Yang, Jiawei Zhang, Fufu Yu, Xinyang Jiang, Mengdan Zhang, Xing Sun, Ying-Cong Chen, Wei-Shi Zheng Learning to Match Features with Seeded Graph Matching Network
Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data
Abdullah Abuolaim, Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar Learning to Stylize Novel Views
Hsin-Ping Huang, Hung-Yu Tseng, Saurabh Saini, Maneesh Singh, Ming-Hsuan Yang Learning to Track Objects from Unlabeled Videos
Jilai Zheng, Chao Ma, Houwen Peng, Xiaokang Yang Learning to Track with Object Permanence
Pavel Tokmakov, Jie Li, Wolfram Burgard, Adrien Gaidon Learning Unsupervised Metaformer for Anomaly Detection
Jhih-Ciang Wu, Ding-Jie Chen, Chiou-Shann Fuh, Tyng-Luh Liu Learning with Noisy Labels via Sparse Regularization
Xiong Zhou, Xianming Liu, Chenyang Wang, Deming Zhai, Junjun Jiang, Xiangyang Ji Learning with Privileged Tasks
Yuru Song, Zan Lou, Shan You, Erkun Yang, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang LeViT: A Vision Transformer in ConvNet's Clothing for Faster Inference
Benjamin Graham, Alaaeldin El-Nouby, Hugo Touvron, Pierre Stock, Armand Joulin, Hervé Jégou, Matthijs Douze Likelihood-Based Diverse Sampling for Trajectory Forecasting
Yecheng Jason Ma, Jeevana Priya Inala, Dinesh Jayaraman, Osbert Bastani Lipschitz Continuity Guided Knowledge Distillation
Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan Localized Simple Multiple Kernel K-Means
Xinwang Liu, Sihang Zhou, Li Liu, Chang Tang, Siwei Wang, Jiyuan Liu, Yi Zhang Location-Aware Single Image Reflection Removal
Zheng Dong, Ke Xu, Yin Yang, Hujun Bao, Weiwei Xu, Rynson W.H. Lau LOKI: Long Term and Key Intentions for Trajectory Prediction
Harshayu Girase, Haiming Gang, Srikanth Malla, Jiachen Li, Akira Kanehara, Karttikeya Mangalam, Chiho Choi Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D Data
Dominik Rivoir, Micha Pfeiffer, Reuben Docea, Fiona Kolbinger, Carina Riediger, Jürgen Weitz, Stefanie Speidel Looking Here or There? Gaze Following in 360-Degree Images
Yunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving
Alexander Cui, Sergio Casas, Abbas Sadat, Renjie Liao, Raquel Urtasun LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories
Fait Poms, Vishnu Sarukkai, Ravi Teja Mullapudi, Nimit S. Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian LSD-StructureNet: Modeling Levels of Structural Detail in 3D Part Hierarchies
Dominic Roberts, Ara Danielyan, Hang Chu, Mani Golparvar-Fard, David Forsyth M3D-VTON: A Monocular-to-3D Virtual Try-on Network
Fuwei Zhao, Zhenyu Xie, Michael Kampffmeyer, Haoye Dong, Songfang Han, Tianxiang Zheng, Tao Zhang, Xiaodan Liang MAAS: Multi-Modal Assignation for Active Speaker Detection
Juan Léon Alcázar, Fabian Caba, Ali K. Thabet, Bernard Ghanem Making Higher Order MOT Scalable: An Efficient Approximate Solver for Lifted Disjoint Paths
Andrea Hornakova, Timo Kaiser, Paul Swoboda, Michal Rolinek, Bodo Rosenhahn, Roberto Henschel Manifold Alignment for Semantically Aligned Style Transfer
Jing Huo, Shiyin Jin, Wenbin Li, Jing Wu, Yu-Kun Lai, Yinghuan Shi, Yang Gao Matching in the Dark: A Dataset for Matching Image Pairs of Low-Light Scenes
Wenzheng Song, Masanori Suganuma, Xing Liu, Noriyuki Shimobayashi, Daisuke Maruta, Takayuki Okatani MBA-VO: Motion Blur Aware Visual Odometry
Peidong Liu, Xingxing Zuo, Viktor Larsson, Marc Pollefeys MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
Aishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion ME-PCN: Point Completion Conditioned on Mask Emptiness
Bingchen Gong, Yinyu Nie, Yiqun Lin, Xiaoguang Han, Yizhou Yu Mean Shift for Self-Supervised Learning
Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash Memory-Augmented Dynamic Neural Relational Inference
Dong Gong, Frederic Z. Zhang, Javen Qinfeng Shi, Anton van den Hengel Mesh Graphormer
Kevin Lin, Lijuan Wang, Zicheng Liu MeshTalk: 3D Face Animation from Speech Using Cross-Modality Disentanglement
Alexander Richard, Michael Zollhöfer, Yandong Wen, Fernando de la Torre, Yaser Sheikh Meta Gradient Adversarial Attack
Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue, Shiguang Shan Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning
Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
Sungyong Baik, Janghoon Choi, Heewon Kim, Dohee Cho, Jaesik Min, Kyoung Mu Lee MicroNet: Improving Image Recognition with Extremely Low FLOPs
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Lei Zhang, Nuno Vasconcelos MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis
Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee Mining Contextual Information Beyond Image for Semantic Segmentation
Zhenchao Jin, Tao Gong, Dongdong Yu, Qi Chu, Jian Wang, Changhu Wang, Jie Shao Mining Latent Classes for Few-Shot Segmentation
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields
Jonathan T. Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing
Yuhang Li, Feng Zhu, Ruihao Gong, Mingzhu Shen, Xin Dong, Fengwei Yu, Shaoqing Lu, Shi Gu Modulated Periodic Activations for Generalizable Local Functional Representations
Ishit Mehta, Michaël Gharbi, Connelly Barnes, Eli Shechtman, Ravi Ramamoorthi, Manmohan Chandraker Monocular, One-Stage, Regression of Multiple 3D People
Yu Sun, Qian Bao, Wu Liu, Yili Fu, Michael J. Black, Tao Mei MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao Motion Deblurring with Real Events
Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu Motion Prediction Using Trajectory Cues
Zhenguang Liu, Pengxiang Su, Shuang Wu, Xuanjing Shen, Haipeng Chen, Yanbin Hao, Meng Wang Motion-Focused Contrastive Learning of Video Representations
Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?
Matteo Fabbri, Guillem Brasó, Gianluca Maugeri, Orcun Cetintas, Riccardo Gasparini, Aljoša Ošep, Simone Calderara, Laura Leal-Taixé, Rita Cucchiara Move2Hear: Active Audio-Visual Source Separation
Sagnik Majumder, Ziad Al-Halah, Kristen Grauman MT-ORL: Multi-Task Occlusion Relationship Learning
Panhe Feng, Qi She, Lei Zhu, Jiaxin Li, Lin Zhang, Zijian Feng, Changhu Wang, Chunpeng Li, Xuejing Kang, Anlong Ming Multi-Anchor Active Domain Adaptation for Semantic Segmentation
Munan Ning, Donghuan Lu, Dong Wei, Cheng Bian, Chenglang Yuan, Shuang Yu, Kai Ma, Yefeng Zheng Multi-Class Cell Detection Using Spatial Context Representation
Shahira Abousamra, David Belinsky, John Van Arnam, Felicia Allard, Eric Yee, Rajarsi Gupta, Tahsin Kurc, Dimitris Samaras, Joel Saltz, Chao Chen Multi-Echo LiDAR for 3D Object Detection
Yunze Man, Xinshuo Weng, Prasanna Kumar Sivakumar, Matthew O'Toole, Kris M. Kitani Multi-Modal Multi-Action Video Recognition
Zhensheng Shi, Ju Liang, Qianqian Li, Haiyong Zheng, Zhaorui Gu, Junyu Dong, Bing Zheng Multi-Scale Matching Networks for Semantic Correspondence
Dongyang Zhao, Ziyang Song, Zhenghao Ji, Gangming Zhao, Weifeng Ge, Yizhou Yu Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring
Senyou Deng, Wenqi Ren, Yanyang Yan, Tao Wang, Fenglong Song, Xiaochun Cao Multi-Task Self-Training for Learning General Representations
Golnaz Ghiasi, Barret Zoph, Ekin D. Cubuk, Quoc V. Le, Tsung-Yi Lin Multi-View 3D Reconstruction with Transformers
Dan Wang, Xinrui Cui, Xun Chen, Zhengxia Zou, Tianyang Shi, Septimiu Salcudean, Z. Jane Wang, Rabab Ward Multi-View Radar Semantic Segmentation
Arthur Ouaknine, Alasdair Newson, Patrick Pérez, Florence Tupin, Julien Rebut Multimodal Clustering Networks for Self-Supervised Learning from Unlabeled Videos
Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie Boggust, Rameswar Panda, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, Michael Picheny, Shih-Fu Chang Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images
Richard J. Chen, Ming Y. Lu, Wei-Hung Weng, Tiffany Y. Chen, Drew F.K. Williamson, Trevor Manz, Maha Shady, Faisal Mahmood Multimodal Knowledge Expansion
Zihui Xue, Sucheng Ren, Zhengqi Gao, Hang Zhao Multiresolution Deep Implicit Functions for 3D Shape Representation
Zhang Chen, Yinda Zhang, Kyle Genova, Sean Fanello, Sofien Bouaziz, Christian Häne, Ruofei Du, Cem Keskin, Thomas Funkhouser, Danhang Tang Multiscale Vision Transformers
Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, Christoph Feichtenhofer Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection
Ziteng Cui, Guo-Jun Qi, Lin Gu, Shaodi You, Zenghui Zhang, Tatsuya Harada MUSIQ: Multi-Scale Image Quality Transformer
Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology Image
Zunlei Feng, Zhonghua Wang, Xinchao Wang, Yining Mao, Thomas Li, Jie Lei, Yuexuan Wang, Mingli Song MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
Anpei Chen, Zexiang Xu, Fuqiang Zhao, Xiaoshuai Zhang, Fanbo Xiang, Jingyi Yu, Hao Su NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization
Haoyue Bai, Fengwei Zhou, Lanqing Hong, Nanyang Ye, S.-H. Gary Chan, Zhenguo Li NASOA: Towards Faster Task-Oriented Online Fine-Tuning with a Zoo of Models
Hang Xu, Ning Kang, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li Naturalistic Physical Adversarial Patch for Object Detectors
Yu-Chih-Tuan Hu, Bo-Han Kung, Daniel Stanley Tan, Jun-Cheng Chen, Kai-Lung Hua, Wen-Huang Cheng NeRD: Neural Reflectance Decomposition from Image Collections
Mark Boss, Raphael Braun, Varun Jampani, Jonathan T. Barron, Ce Liu, Hendrik P.A. Lensch Nerfies: Deformable Neural Radiance Fields
Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Steven M. Seitz, Ricardo Martin-Brualla Neural Articulated Radiance Field
Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada Neural Photofit: Gaze-Based Mental Image Reconstruction
Florian Strohm, Ekta Sood, Sven Mayer, Philipp Müller, Mihai Bâce, Andreas Bulling Neural Radiance Flow for 4D View Synthesis and Video Processing
Yilun Du, Yinan Zhang, Hong-Xing Yu, Joshua B. Tenenbaum, Jiajun Wu Neural Strokes: Stylized Line Drawing of 3D Shapes
Difan Liu, Matthew Fisher, Aaron Hertzmann, Evangelos Kalogerakis NGC: A Unified Framework for Learning with Open-World Noisy Data
Zhi-Fan Wu, Tong Wei, Jianwen Jiang, Chaojie Mao, Mingqian Tang, Yu-Feng Li Normalized Human Pose Features for Human Action Video Alignment
Jingyuan Liu, Mingyi Shi, Qifeng Chen, Hongbo Fu, Chiew-Lan Tai NPMs: Neural Parametric Models for 3D Deformable Shapes
Pablo Palafox, Aljaž Božič, Justus Thies, Matthias Nießner, Angela Dai OadTR: Online Action Detection with Transformers
Xiang Wang, Shiwei Zhang, Zhiwu Qing, Yuanjie Shao, Zhengrong Zuo, Changxin Gao, Nong Sang Object Tracking by Jointly Exploiting Frame and Event Domain
Jiqing Zhang, Xin Yang, Yingkai Fu, Xiaopeng Wei, Baocai Yin, Bo Dong Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID
Peixian Chen, Wenfeng Liu, Pingyang Dai, Jianzhuang Liu, Qixiang Ye, Mingliang Xu, Qi’an Chen, Rongrong Ji ODAM: Object Detection, Association, and Mapping Using Posed RGB Video
Kejie Li, Daniel DeTone, Yu Fan Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard Newcombe Omni-GAN: On the Secrets of cGANs and Beyond
Peng Zhou, Lingxi Xie, Bingbing Ni, Cong Geng, Qi Tian Omniscient Video Super-Resolution
Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, Jiayi Ma On Compositions of Transformations in Contrastive Self-Supervised Learning
Mandela Patrick, Yuki M. Asano, Polina Kuznetsova, Ruth Fong, João F. Henriques, Geoffrey Zweig, Andrea Vedaldi On Feature Decorrelation in Self-Supervised Learning
Tianyu Hua, Wenxiao Wang, Zihui Xue, Sucheng Ren, Yue Wang, Hang Zhao On Generating Transferable Targeted Perturbations
Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Fatih Porikli On the Hidden Treasure of Dialog in Video Question Answering
Deniz Engin, François Schnitzler, Ngoc Q. K. Duong, Yannis Avrithis Once Quantization-Aware Training: High Performance Extremely Low-Bit Architecture Search
Mingzhu Shen, Feng Liang, Ruihao Gong, Yuhang Li, Chuming Li, Chen Lin, Fengwei Yu, Junjie Yan, Wanli Ouyang One-Pass Multi-View Clustering for Large-Scale Data
Jiyuan Liu, Xinwang Liu, Yuexiang Yang, Li Liu, Siqi Wang, Weixuan Liang, Jiangyong Shi Online Knowledge Distillation for Efficient Pose Estimation
Zheng Li, Jingwen Ye, Mingli Song, Ying Huang, Zhigeng Pan ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
Daniela Massiceti, Luisa Zintgraf, John Bronskill, Lida Theodorou, Matthew Tobias Harris, Edward Cutrell, Cecily Morrison, Katja Hofmann, Simone Stumpf Oriented R-CNN for Object Detection
Xingxing Xie, Gong Cheng, Jiabao Wang, Xiwen Yao, Junwei Han Orthogonal Projection Loss
Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman Khan, Fahad Shahbaz Khan Orthographic-Perspective Epipolar Geometry
Viktor Larsson, Marc Pollefeys, Magnus Oskarsson Overfitting the Data: Compact Neural Video Delivery via Content-Aware Feature Modulation
Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen, Chuang Zhang, Ming Wu P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching
Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham Paint Transformer: Feed Forward Neural Painting with Stroke Prediction
Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang Painting from Part
Dongsheng Guo, Haoru Zhao, Yunhao Cheng, Haiyong Zheng, Zhaorui Gu, Bing Zheng Panoptic Narrative Grounding
Cristina González, Nicolás Ayobi, Isabela Hernández, José Hernández, Jordi Pont-Tuset, Pablo Arbeláez Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation
Yunhang Shen, Liujuan Cao, Zhiwei Chen, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji Parallel Multi-Resolution Fusion Network for Image Inpainting
Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang Parametric Contrastive Learning
Jiequan Cui, Zhisheng Zhong, Shu Liu, Bei Yu, Jiaya Jia PARE: Part Attention Regressor for 3D Human Body Estimation
Muhammed Kocabas, Chun-Hao P. Huang, Otmar Hilliges, Michael J. Black Parsing Table Structures in the Wild
Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia Partner-Assisted Learning for Few-Shot Image Classification
Jiawei Ma, Hanchen Xie, Guangxing Han, Shih-Fu Chang, Aram Galstyan, Wael Abd-Almageed Pathdreamer: A World Model for Indoor Navigation
Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation
Zhuangwei Zhuang, Rong Li, Kui Jia, Qicheng Wang, Yuanqing Li, Mingkui Tan Personalized Image Semantic Segmentation
Yu Zhang, Chang-Bin Zhang, Peng-Tao Jiang, Ming-Ming Cheng, Feng Mao Physics-Based Human Motion Estimation and Synthesis from Videos
Kevin Xie, Tingwu Wang, Umar Iqbal, Yunrong Guo, Sanja Fidler, Florian Shkurti Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy
Colin L. Cooke, Fanjie Kong, Amey Chaware, Kevin C. Zhou, Kanghyun Kim, Rong Xu, D. Michael Ando, Samuel J. Yang, Pavan Chandra Konda, Roarke Horstmeyer PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation
Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Yuanyi Zhong, Bodi Yuan, Hong Wu, Zhiqiang Yuan, Jian Peng, Yu-Xiong Wang Pixel Difference Networks for Efficient Edge Detection
Zhuo Su, Wenzhe Liu, Zitong Yu, Dewen Hu, Qing Liao, Qi Tian, Matti Pietikäinen, Li Liu Pixel-Perfect Structure-from-Motion with Featuremetric Refinement
Philipp Lindenberger, Paul-Edouard Sarlin, Viktor Larsson, Marc Pollefeys Planar Surface Reconstruction from Sparse Views
Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey PlenOctrees for Real-Time Rendering of Neural Radiance Fields
Alex Yu, Ruilong Li, Matthew Tancik, Hao Li, Ren Ng, Angjoo Kanazawa Point Cloud Augmentation with Weighted Local Transformations
Sihyeon Kim, Sanghyeok Lee, Dasol Hwang, Jaewon Lee, Seong Jae Hwang, Hyunwoo J. Kim Point Transformer
Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip H.S. Torr, Vladlen Koltun Point-Based Modeling of Human Clothing
Ilya Zakharkin, Kirill Mazur, Artur Grigorev, Victor Lempitsky Point-Set Distances for Learning Representations of 3D Point Clouds
Trung Nguyen, Quang-Hieu Pham, Tam Le, Tung Pham, Nhat Ho, Binh-Son Hua PointBA: Towards Backdoor Attacks in 3D Point Cloud
Xinke Li, Zhirui Chen, Yue Zhao, Zekun Tong, Yabang Zhao, Andrew Lim, Joey Tianyi Zhou Polarimetric Helmholtz Stereopsis
Yuqi Ding, Yu Ji, Mingyuan Zhou, Sing Bing Kang, Jinwei Ye Poly-NL: Linear Complexity Non-Local Layers with 3rd Order Polynomials
Francesca Babiloni, Ioannis Marras, Filippos Kokkinos, Jiankang Deng, Grigorios Chrysos, Stefanos Zafeiriou PR-Net: Preference Reasoning for Personalized Video Highlight Detection
Runnan Chen, Penghao Zhou, Wenzhe Wang, Nenglun Chen, Pai Peng, Xing Sun, Wenping Wang Practical Relative Order Attack in Deep Ranking
Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, Gang Hua Predicting with Confidence on Unseen Distributions
Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt Predictive Feature Learning for Future Segmentation Prediction
Zihang Lin, Jiangxin Sun, Jian-Fang Hu, Qizhi Yu, Jian-Huang Lai, Wei-Shi Zheng Pri3D: Can 3D Priors Help 2D Representation Learning?
Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner Probabilistic Modeling for Human Mesh Recovery
Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining
Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang Progressive Correspondence Pruning by Consensus Learning
Chen Zhao, Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li, Mathieu Salzmann Provably Approximated Point Cloud Registration
Ibrahim Jubran, Alaa Maalouf, Ron Kimmel, Dan Feldman PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop
Hongwen Zhang, Yating Tian, Xinchi Zhou, Wanli Ouyang, Yebin Liu, Limin Wang, Zhenan Sun Pyramid Architecture Search for Real-Time Image Deblurring
Xiaobin Hu, Wenqi Ren, Kaicheng Yu, Kaihao Zhang, Xiaochun Cao, Wei Liu, Bjoern Menze Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions
Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao Q-Match: Iterative Shape Matching via Quantum Annealing
Marcel Seelbach Benkner, Zorah Lähner, Vladislav Golyanik, Christof Wunderlich, Christian Theobalt, Michael Moeller RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
Jiachen Li, Fan Yang, Hengbo Ma, Srikanth Malla, Masayoshi Tomizuka, Chiho Choi Ranking Models in Unlabeled New Environments
Xiaoxiao Sun, Yunzhong Hou, Weijian Deng, Hongdong Li, Liang Zheng RDI-Net: Relational Dynamic Inference Networks
Huanyu Wang, Songyuan Li, Shihao Su, Zequn Qin, Xi Li Real-Time Image Enhancer via Learnable Spatial-Aware 3D Lookup Tables
Tao Wang, Yong Li, Jingyang Peng, Yipeng Ma, Xian Wang, Fenglong Song, Youliang Yan Real-Time Instance Segmentation with Discriminative Orientation Maps
Wentao Du, Zhiyu Xiang, Shuya Chen, Chengyu Qiao, Yiman Chen, Tingming Bai Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Mehrdad Khani, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh RECALL: Replay-Based Continual Learning in Semantic Segmentation
Andrea Maracani, Umberto Michieli, Marco Toldo, Pietro Zanuttigh Reconstructing Hand-Object Interactions in the Wild
Zhe Cao, Ilija Radosavovic, Angjoo Kanazawa, Jitendra Malik ReCU: Reviving the Dead Weights in Binary Neural Networks
Zihan Xu, Mingbao Lin, Jianzhuang Liu, Jie Chen, Ling Shao, Yue Gao, Yonghong Tian, Rongrong Ji ReDAL: Region-Based and Diversity-Aware Active Learning for Point Cloud Semantic Segmentation
Tsung-Han Wu, Yueh-Cheng Liu, Yu-Kai Huang, Hsin-Ying Lee, Hung-Ting Su, Ping-Chia Huang, Winston H. Hsu Refining Activation Downsampling with SoftPool
Alexandros Stergiou, Ronald Poppe, Grigorios Kalliatakis Region Similarity Representation Learning
Tete Xiao, Colorado J Reed, Xiaolong Wang, Kurt Keutzer, Trevor Darrell Relational Embedding for Few-Shot Classification
Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho Removing Adversarial Noise in Class Activation Feature Space
Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu RePOSE: Fast 6d Object Pose Refinement via Deep Texture Rendering
Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani Representative Color Transform for Image Enhancement
Hanul Kim, Su-Min Choi, Chang-Su Kim, Yeong Jun Koh ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting
Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding Rethinking 360deg Image Visual Attention Modelling with Unsupervised Learning.
Yasser Abdelaziz Dahou Djilali, Tarun Krishna, Kevin McGuinness, Noel E. O’Connor Rethinking Coarse-to-Fine Approach in Single Image Deblurring
Sung-Jin Cho, Seo-Won Ji, Jun-Pyo Hong, Seung-Won Jung, Sung-Jea Ko Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework
Qingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu Rethinking Deep Image Prior for Denoising
Yeonsik Jo, Se Young Chun, Jonghyun Choi Rethinking Spatial Dimensions of Vision Transformers
Byeongho Heo, Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Junsuk Choe, Seong Joon Oh Rethinking the Truly Unsupervised Image-to-Image Translation
Kyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung Shim RetrievalFuse: Neural 3D Scene Reconstruction with a Database
Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai Revealing the Reciprocal Relations Between Self-Supervised Stereo and Monocular Depth Estimation
Zhi Chen, Xiaoqing Ye, Wei Yang, Zhenbo Xu, Xiao Tan, Zhikang Zou, Errui Ding, Xinming Zhang, Liusheng Huang Revisiting Stereo Depth Estimation from a Sequence-to-Sequence Perspective with Transformers
Zhaoshuo Li, Xingtong Liu, Nathan Drenkow, Andy Ding, Francis X. Creighton, Russell H. Taylor, Mathias Unberath RFNet: Recurrent Forward Network for Dense Point Cloud Completion
Tianxin Huang, Hao Zou, Jinhao Cui, Xuemeng Yang, Mengmeng Wang, Xiangrui Zhao, Jiangning Zhang, Yi Yuan, Yifan Xu, Yong Liu RGB-D Saliency Detection via Cascaded Mutual Information Minimization
Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao Road Anomaly Detection by Partial Image Reconstruction with Segmentation Coupling
Tomas Vojir, Tomáš Šipka, Rahaf Aljundi, Nikolay Chumerin, Daniel Olmeda Reino, Jiri Matas Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVIS
Hui Miao, Feixiang Lu, Zongdai Liu, Liangjun Zhang, Dinesh Manocha, Bin Zhou Robust Object Detection via Instance-Level Temporal Cycle Confusion
Xin Wang, Thomas E. Huang, Benlin Liu, Fisher Yu, Xiaolong Wang, Joseph E. Gonzalez, Trevor Darrell RobustNav: Towards Benchmarking Robustness in Embodied Navigation
Prithvijit Chattopadhyay, Judy Hoffman, Roozbeh Mottaghi, Aniruddha Kembhavi Robustness and Generalization via Generative Adversarial Training
Omid Poursaeed, Tianxing Jiang, Harry Yang, Serge Belongie, Ser-Nam Lim Robustness Certification for Point Cloud Models
Tobias Lorenz, Anian Ruoss, Mislav Balunović, Gagandeep Singh, Martin Vechev Robustness via Cross-Domain Ensembles
Teresa Yeo, Oğuzhan Fatih Kar, Amir Zamir SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam
Yonggan Fu, Yang Zhang, Yue Wang, Zhihan Lu, Vivek Boominathan, Ashok Veeraraghavan, Yingyan Lin Saliency-Associated Object Tracking
Zikun Zhou, Wenjie Pei, Xin Li, Hongpeng Wang, Feng Zheng, Zhenyu He Salient Object Ranking with Position-Preserved Attention
Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He Sat2Vid: Street-View Panoramic Video Synthesis from a Single Satellite Image
Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Rongjun Qin, Marc Pollefeys, Martin R. Oswald Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan, Bohan Zhuang, Jing Liu, Haoyu He, Jianfei Cai Scaling Semantic Segmentation Beyond 1k Classes on a Single GPU
Shipra Jain, Danda Pani Paudel, Martin Danelljan, Luc Van Gool Scene Context-Aware Salient Object Detection
Avishek Siris, Jianbo Jiao, Gary K.L. Tam, Xianghua Xie, Rynson W.H. Lau Scene Synthesis via Uncertainty-Driven Attribute Synchronization
Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, Qixing Huang SCOUTER: Slot Attention-Based Classifier for Explainable Image Recognition
Liangzhi Li, Bowen Wang, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara Scribble-Supervised Semantic Segmentation Inference
Jingshan Xu, Chuanwei Zhou, Zhen Cui, Chunyan Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang Searching for Controllable Image Restoration Networks
Heewon Kim, Sungyong Baik, Myungsub Choi, Janghoon Choi, Kyoung Mu Lee Searching for Two-Stream Models in Multivariate Space for Video Recognition
Xinyu Gong, Heng Wang, Mike Zheng Shou, Matt Feiszli, Zhangyang Wang, Zhicheng Yan Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data
Oscar Mañas, Alexandre Lacoste, Xavier Giró-i-Nieto, David Vazquez, Pau Rodríguez Seeking Similarities over Differences: Similarity-Based Domain Alignment for Adaptive Object Detection
Farzaneh Rezaeianaran, Rakshith Shetty, Rahaf Aljundi, Daniel Olmeda Reino, Shanshan Zhang, Bernt Schiele Segmentation-Grounded Scene Graph Generation
Siddhesh Khandelwal, Mohammed Suhail, Leonid Sigal Segmenter: Transformer for Semantic Segmentation
Robin Strudel, Ricardo Garcia, Ivan Laptev, Cordelia Schmid Self-Born Wiring for Neural Trees
Ying Chen, Feng Mao, Jie Song, Xinchao Wang, Huiqiong Wang, Mingli Song Self-Calibrating Neural Radiance Fields
Yoonwoo Jeong, Seokjun Ahn, Christopher Choy, Anima Anandkumar, Minsu Cho, Jaesik Park Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation
Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao Self-Regulation for Semantic Segmentation
Dong Zhang, Hanwang Zhang, Jinhui Tang, Xian-Sheng Hua, Qianru Sun Self-Supervised Object Detection via Generative Image Synthesis
Siva Karthik Mustikovela, Shalini De Mello, Aayush Prakash, Umar Iqbal, Sifei Liu, Thu Nguyen-Phuoc, Carsten Rother, Jan Kautz Self-Supervised Real-to-Sim Scene Generation
Aayush Prakash, Shoubhik Debnath, Jean-Francois Lafleche, Eric Cameracci, Gavriel State, Stan Birchfield, Marc T. Law Self-Supervised Vessel Segmentation via Adversarial Learning
Yuxin Ma, Yang Hua, Hanming Deng, Tao Song, Hao Wang, Zhengui Xue, Heng Cao, Ruhui Ma, Haibing Guan Self-Supervised Video Object Segmentation by Motion Grouping
Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, Weidi Xie SeLFVi: Self-Supervised Light-Field Video Reconstruction from Stereo Video
Prasan Shedligeri, Florian Schiffers, Sushobhan Ghosh, Oliver Cossairt, Kaushik Mitra Semantic Concentration for Domain Adaptation
Shuang Li, Mixue Xie, Fangrui Lv, Chi Harold Liu, Jian Liang, Chen Qin, Wei Li Semantic Diversity Learning for Zero-Shot Multi-Label Classification
Avi Ben-Cohen, Nadav Zamir, Emanuel Ben-Baruch, Itamar Friedman, Lihi Zelnik-Manor Semantically Coherent Out-of-Distribution Detection
Jingkang Yang, Haoqi Wang, Litong Feng, Xiaopeng Yan, Huabin Zheng, Wayne Zhang, Ziwei Liu Semantics Disentangling for Generalized Zero-Shot Learning
Zhi Chen, Yadan Luo, Ruihong Qiu, Sen Wang, Zi Huang, Jingjing Li, Zheng Zhang SemIE: Semantically-Aware Image Extrapolation
Bholeshwar Khurana, Soumya Ranjan Dash, Abhishek Bhatia, Aniruddha Mahapatra, Hrituraj Singh, Kuldeep Kulkarni Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
Hongjun Chen, Jinbao Wang, Hong Cai Chen, Xiantong Zhen, Feng Zheng, Rongrong Ji, Ling Shao Sensor-Guided Optical Flow
Matteo Poggi, Filippo Aleotti, Stefano Mattoccia Separable Flow: Learning Motion Cost Volumes for Optical Flow Estimation
Feihu Zhang, Oliver J. Woodford, Victor Adrian Prisacariu, Philip H.S. Torr Shape Self-Correction for Unsupervised Point Cloud Understanding
Ye Chen, Jinxian Liu, Bingbing Ni, Hang Wang, Jiancheng Yang, Ning Liu, Teng Li, Qi Tian ShapeConv: Shape-Aware Convolutional Layer for Indoor RGB-D Semantic Segmentation
Jinming Cao, Hanchao Leng, Dani Lischinski, Daniel Cohen-Or, Changhe Tu, Yangyan Li SimROD: A Simple Adaptation Method for Robust Object Detection
Rindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong Bai, Yong Zhang SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks
Zoe Landgraf, Raluca Scona, Tristan Laidlow, Stephen James, Stefan Leutenegger, Andrew J. Davison Single View Physical Distance Estimation Using Human Pose
Xiaohan Fei, Henry Wang, Lin Lee Cheong, Xiangyu Zeng, Meng Wang, Joseph Tighe Single-Shot Hyperspectral-Depth Imaging with Learned Diffractive Optics
Seung-Hwan Baek, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery
Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang Sketch Your Own GAN
Sheng-Yu Wang, David Bau, Jun-Yan Zhu SketchLattice: Latticed Representation for Sketch Manipulation
Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song SLIDE: Single Image 3D Photography with Soft Layering and Depth-Aware Inpainting
Varun Jampani, Huiwen Chang, Kyle Sargent, Abhishek Kar, Richard Tucker, Michael Krainin, Dominik Kaeser, William T. Freeman, David Salesin, Brian Curless, Ce Liu SLIM: Self-Supervised LiDAR Scene Flow and Motion Segmentation
Stefan Andreas Baur, David Josef Emmerichs, Frank Moosmann, Peter Pinggera, Björn Ommer, Andreas Geiger SO-Pose: Exploiting Self-Occlusion for Direct 6d Pose Estimation
Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari Solving Inefficiency of Self-Supervised Representation Learning
Guangrun Wang, Keze Wang, Guangcong Wang, Philip H.S. Torr, Liang Lin SOTR: Segmenting Objects with Transformers
Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li Space-Time Crop & Attend: Improving Cross-Modal Video Representation Learning
Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques Sparse Needlets for Lighting Estimation with Spherical Transport Loss
Fangneng Zhan, Changgong Zhang, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao Spatial Uncertainty-Aware Semi-Supervised Crowd Counting
Yanda Meng, Hongrun Zhang, Yitian Zhao, Xiaoyun Yang, Xuesheng Qian, Xiaowei Huang, Yalin Zheng Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn, Michael Ying Yang Spatially-Adaptive Image Restoration Using Distortion-Guided Networks
Kuldeep Purohit, Maitreya Suin, A. N. Rajagopalan, Vishnu Naresh Boddeti Spatio-Temporal Representation Factorization for Video-Based Person Re-Identification
Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu SPEC: Seeing People in the Wild with an Estimated Camera
Muhammed Kocabas, Chun-Hao P. Huang, Joachim Tesch, Lea Müller, Otmar Hilliges, Michael J. Black Specificity-Preserving RGB-D Saliency Detection
Tao Zhou, Huazhu Fu, Geng Chen, Yi Zhou, Deng-Ping Fan, Ling Shao Square Root Marginalization for Sliding-Window Bundle Adjustment
Nikolaus Demmel, David Schubert, Christiane Sommer, Daniel Cremers, Vladyslav Usenko SS-IL: Separated SoftMax for Incremental Learning
Hongjoon Ahn, Jihwan Kwak, Subin Lim, Hyeonsu Bang, Hyojun Kim, Taesup Moon SSH: A Self-Supervised Framework for Image Harmonization
Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang Stacked Homography Transformations for Multi-View Pedestrian Detection
Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan Stochastic Scene-Aware Motion Prediction
Mohamed Hassan, Duygu Ceylan, Ruben Villegas, Jun Saito, Jimei Yang, Yi Zhou, Michael J. Black Stochastic Transformer Networks with Linear Competing Units: Application to End-to-End SL Translation
Andreas Voskou, Konstantinos P. Panousis, Dimitrios Kosmopoulos, Dimitris N. Metaxas, Sotirios Chatzis STRIVE: Scene Text Replacement in Videos
B G Vijay Kumar, Jeyasri Subramanian, Varnith Chordia, Eugene Bart, Shaobo Fang, Kelly Guan, Raja Bala Structure-Preserving Deraining with Residue Channel Prior Guidance
Qiaosi Yi, Juncheng Li, Qinyan Dai, Faming Fang, Guixu Zhang, Tieyong Zeng StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski Super Resolve Dynamic Scene from Continuous Spike Streams
Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang Superpoint Network for Point Cloud Oversegmentation
Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang Support-Set Based Cross-Supervision for Video Grounding
Xinpeng Ding, Nannan Wang, Shiwei Zhang, De Cheng, Xiaomeng Li, Ziyuan Huang, Mingqian Tang, Xinbo Gao Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo Synchronization of Group-Labelled Multi-Graphs
Andrea Porfiri Dal Cin, Luca Magri, Federica Arrigoni, Andrea Fusiello, Giacomo Boracchi SynFace: Face Recognition with Synthetic Data
Haibo Qiu, Baosheng Yu, Dihong Gong, Zhifeng Li, Wei Liu, Dacheng Tao Synthesis of Compositional Animations from Textual Descriptions
Anindita Ghosh, Noshaba Cheema, Cennet Oguz, Christian Theobalt, Philipp Slusallek Synthesized Feature Based Few-Shot Class-Incremental Learning on a Mixture of Subspaces
Ali Cheraghian, Shafin Rahman, Sameera Ramasinghe, Pengfei Fang, Christian Simon, Lars Petersson, Mehrtash Harandi Talk-to-Edit: Fine-Grained Facial Editing via Dialog
Yuming Jiang, Ziqi Huang, Xingang Pan, Chen Change Loy, Ziwei Liu TAM: Temporal Adaptive Module for Video Recognition
Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu Task Switching Network for Multi-Task Learning
Guolei Sun, Thomas Probst, Danda Pani Paudel, Nikola Popović, Menelaos Kanakis, Jagruti Patel, Dengxin Dai, Luc Van Gool TeachText: CrossModal Generalized Distillation for Text-Video Retrieval
Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu, Hailin Jin, Andrew Zisserman, Samuel Albanie, Yang Liu TempNet: Online Semantic Segmentation on Large-Scale Point Cloud Series
Yunsong Zhou, Hongzi Zhu, Chunqin Li, Tiankai Cui, Shan Chang, Minyi Guo Temporal Action Detection with Multi-Level Supervision
Baifeng Shi, Qi Dai, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu Temporal-Wise Attention Spiking Neural Networks for Event Streams Classification
Man Yao, Huanhuan Gao, Guangshe Zhao, Dingheng Wang, Yihan Lin, Zhaoxu Yang, Guoqi Li Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases
Jan Bednarik, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal Fua, Noam Aigerman THDA: Treasure Hunt Data Augmentation for Semantic Navigation
Oleksandr Maksymets, Vincent Cartillier, Aaron Gokaslan, Erik Wijmans, Wojciech Galuba, Stefan Lee, Dhruv Batra The Functional Correspondence Problem
Zihang Lai, Senthil Purushwalkam, Abhinav Gupta The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer The Power of Points for Modeling Humans in Clothing
Qianli Ma, Jinlong Yang, Siyu Tang, Michael J. Black The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu THUNDR: Transformer-Based 3D Human Reconstruction with Markers
Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving
Soumi Das, Harikrishna Patibandla, Suparna Bhattacharya, Kshounis Bera, Niloy Ganguly, Sourangshu Bhattacharya TokenPose: Learning Keypoint Tokens for Human Pose Estimation
Yanjie Li, Shoukui Zhang, Zhicheng Wang, Sen Yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li Yuan, Yunpeng Chen, Tao Wang, Weihao Yu, Yujun Shi, Zi-Hang Jiang, Francis E.H. Tay, Jiashi Feng, Shuicheng Yan TOOD: Task-Aligned One-Stage Object Detection
Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang Toward a Visual Concept Vocabulary for GAN Latent Space
Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba Toward Spatially Unbiased Generative Models
Jooyoung Choi, Jungbeom Lee, Yonghyun Jeong, Sungroh Yoon Towards a Universal Model for Cross-Dataset Crowd Counting
Zhiheng Ma, Xiaopeng Hong, Xing Wei, Yunfeng Qiu, Yihong Gong Towards Discovery and Attribution of Open-World GAN Generated Images
Sharath Girish, Saksham Suri, Sai Saketh Rambhatla, Abhinav Shrivastava Towards Efficient Graph Convolutional Networks for Point Cloud Handling
Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory S. Chirikjian, Luc Van Gool Towards Face Encryption by Generating Adversarial Identity Masks
Xiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu, Yuefeng Chen, Hui Xue Towards Learning Spatially Discriminative Feature Representations
Chaofei Wang, Jiayu Xiao, Yizeng Han, Qisen Yang, Shiji Song, Gao Huang Towards Memory-Efficient Neural Networks via Multi-Level in Situ Generation
Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan Towards Robustness of Deep Neural Networks via Regularization
Yao Li, Martin Renqiang Min, Thomas Lee, Wenchao Yu, Erik Kruus, Wei Wang, Cho-Jui Hsieh Towards Rotation Invariance in Object Detection
Agastya Kalra, Guy Stoppi, Bradley Brown, Rishav Agarwal, Achuta Kadambi Towards Understanding the Generative Capability of Adversarially Robust Classifiers
Yao Zhu, Jiacheng Ma, Jiacheng Sun, Zewei Chen, Rongxin Jiang, Yaowu Chen, Zhenguo Li Training Weakly Supervised Video Frame Interpolation with Events
Zhiyang Yu, Yu Zhang, Deyuan Liu, Dongqing Zou, Xijun Chen, Yebin Liu, Jimmy S. Ren TransferI2I: Transfer Learning for Image-to-Image Translation from Small Datasets
Yaxing Wang, Héctor Laria, Joost van de Weijer, Laura Lopez-Fuentes, Bogdan Raducanu Transparent Object Tracking Benchmark
Heng Fan, Halady Akhilesha Miththanthaya, Harshit, Siranjiv Ramana Rajan, Xiaoqiong Liu, Zhilin Zou, Yuewei Lin, Haibin Ling TransReID: Transformer-Based Object Re-Identification
Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, Wei Jiang TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, Houqiang Li TRAR: Routing the Attention Spans in Transformer for Visual Question Answering
Yiyi Zhou, Tianhe Ren, Chaoyang Zhu, Xiaoshuai Sun, Jianzhuang Liu, Xinghao Ding, Mingliang Xu, Rongrong Ji Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning
Junkai Huang, Chaowei Fang, Weikai Chen, Zhenhua Chai, Xiaolin Wei, Pengxu Wei, Liang Lin, Guanbin Li Tripartite Information Mining and Integration for Image Matting
Yuhao Liu, Jiake Xie, Xiao Shi, Yu Qiao, Yujie Huang, Yong Tang, Xin Yang TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild
Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi TS-CAM: Token Semantic Coupled Attention mAP for Weakly Supervised Object Localization
Wei Gao, Fang Wan, Xingjia Pan, Zhiliang Peng, Qi Tian, Zhenjun Han, Bolei Zhou, Qixiang Ye UASNet: Uncertainty Adaptive Sampling Network for Deep Stereo Matching
Yamin Mao, Zhihua Liu, Weiming Li, Yuchao Dai, Qiang Wang, Yun-Tae Kim, Hong-Seok Lee UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-Body Decoupling 3D Model
Haonan Yan, Jiaqi Chen, Xujie Zhang, Shengkai Zhang, Nianhong Jiao, Xiaodan Liang, Tianxiang Zheng Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection
Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan Unconditional Scene Graph Generation
Sarthak Garg, Helisa Dhamo, Azade Farshad, Sabrina Musatian, Nassir Navab, Federico Tombari Unconstrained Scene Generation with Locally Conditioned Radiance Fields
Terrance DeVries, Miguel Angel Bautista, Nitish Srivastava, Graham W. Taylor, Joshua M. Susskind Understanding Robustness of Transformers for Image Classification
Srinadh Bhojanapalli, Ayan Chakrabarti, Daniel Glasner, Daliang Li, Thomas Unterthiner, Andreas Veit Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting
Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu Unifying Nonlocal Blocks for Neural Networks
Lei Zhu, Qi She, Duo Li, Yanye Lu, Xuejing Kang, Jie Hu, Changhu Wang Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction
Fang Zheng, Le Wang, Sanping Zhou, Wei Tang, Zhenxing Niu, Nanning Zheng, Gang Hua Unsupervised Deep Video Denoising
Dev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu Unsupervised Non-Rigid Image Distortion Removal via Grid Deformation
Nianyi Li, Simron Thapa, Cameron Whyte, Albert W. Reed, Suren Jayasuriya, Jinwei Ye Unsupervised Point Cloud Pre-Training via Occlusion Completion
Hanchen Wang, Qi Liu, Xiangyu Yue, Joan Lasenby, Matt J. Kusner Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals
Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool UVStyle-Net: Unsupervised Few-Shot Learning of 3D Style Similarity Measure for B-Reps
Peter Meltzer, Hooman Shayani, Amir Khasahmadi, Pradeep Kumar Jayaraman, Aditya Sanghi, Joseph Lambourne V-DESIRR: Very Fast Deep Embedded Single Image Reflection Removal
B H Pawan Prasad, K S Green Rosh, Lokesh R. Boregowda, Kaushik Mitra, Sanjoy Chowdhury VariTex: Variational Neural Face Textures
Marcel C. Bühler, Abhimitra Meka, Gengyan Li, Thabo Beeler, Otmar Hilliges Vector Neurons: A General Framework for SO(3)-Equivariant Networks
Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, Leonidas J. Guibas VENet: Voting Enhancement Network for 3D Object Detection
Qian Xie, Yu-Kun Lai, Jing Wu, Zhoutao Wang, Dening Lu, Mingqiang Wei, Jun Wang Vi2CLR: Video and Image for Visual Contrastive Learning of Representation
Ali Diba, Vivek Sharma, Reza Safdari, Dariush Lotfi, Saquib Sarfraz, Rainer Stiefelhagen, Luc Van Gool Video Annotation for Visual Tracking via Selection and Refinement
Kenan Dai, Jie Zhao, Lijun Wang, Dong Wang, Jianhua Li, Huchuan Lu, Xuesheng Qian, Xiaoyun Yang Video Matting via Consistency-Regularized Graph Neural Networks
Tiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang VideoLT: Large-Scale Long-Tailed Video Recognition
Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry S. Davis VidTr: Video Transformer Without Convolutions
Yanyi Zhang, Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe Viewing Graph Solvability via Cycle Consistency
Federica Arrigoni, Andrea Fusiello, Elisa Ricci, Tomas Pajdla Viewpoint Invariant Dense Matching for Visual Geolocalization
Gabriele Berton, Carlo Masone, Valerio Paolicelli, Barbara Caputo Viewpoint-Agnostic Change Captioning with Cycle Consistency
Hoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim VIL-100: A New Dataset and a Baseline Model for Video Instance Lane Detection
Yujun Zhang, Lei Zhu, Wei Feng, Huazhu Fu, Mingqian Wang, Qingxia Li, Cheng Li, Song Wang Virtual Light Transport Matrices for Non-Line-of-Sight Imaging
Julio Marco, Adrian Jarabo, Ji Hyun Nam, Xiaochun Liu, Miguel Ángel Cosculluela, Andreas Velten, Diego Gutierrez Visformer: The Vision-Friendly Transformer
Zhengsu Chen, Lingxi Xie, Jianwei Niu, Xuefeng Liu, Longhui Wei, Qi Tian Vision Transformer with Progressive Sampling
Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip H.S. Torr, Wayne Zhang, Dahua Lin Vision Transformers for Dense Prediction
René Ranftl, Alexey Bochkovskiy, Vladlen Koltun Vision-Language Navigation with Random Environmental Mixup
Chong Liu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, Zongyuan Ge, Yi-Dong Shen Visual Distant Supervision for Scene Graph Generation
Yuan Yao, Ao Zhang, Xu Han, Mengdi Li, Cornelius Weber, Zhiyuan Liu, Stefan Wermter, Maosong Sun Visual Saliency Transformer
Nian Liu, Ni Zhang, Kaiyuan Wan, Ling Shao, Junwei Han Visual Scene Graphs for Audio Source Separation
Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian Visual Transformers: Where Do Transformers Really Belong in Vision Models?
Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph E. Gonzalez, Kurt Keutzer, Peter Vajda ViViT: A Video Vision Transformer
Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lučić, Cordelia Schmid VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation
Zeyu Hu, Xuyang Bai, Jiaxiang Shang, Runze Zhang, Jiayu Dong, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction
Jaesung Choe, Sunghoon Im, Francois Rameau, Minjun Kang, In So Kweon Voxel Transformer for 3D Object Detection
Jiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu Warp-Refine Propagation: Semi-Supervised Auto-Labeling via Cycle-Consistency
Aditya Ganeshan, Alexis Vallet, Yasunori Kudo, Shin-ichi Maeda, Tommi Kerola, Rares Ambrus, Dennis Park, Adrien Gaidon Wasserstein Coupled Graph Learning for Cross-Modal Retrieval
Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang Watch Only Once: An End-to-End Video Action Detection Framework
Shoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo WaveFill: A Wavelet-Based Generation Network for Image Inpainting
Yingchen Yu, Fangneng Zhan, Shijian Lu, Jianxiong Pan, Feiying Ma, Xuansong Xie, Chunyan Miao WB-DETR: Transformer-Based Detector Without Backbone
Fanfan Liu, Haoran Wei, Wenzhe Zhao, Guozhen Li, Jingquan Peng, Zihao Li Weakly Supervised Contrastive Learning
Mingkai Zheng, Fei Wang, Shan You, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu Weakly Supervised Person Search with Region Siamese Networks
Chuchu Han, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, Yi Yang, Changhu Wang Weakly Supervised Text-Based Person Re-Identification
Shizhen Zhao, Changxin Gao, Yuanjie Shao, Wei-Shi Zheng, Nong Sang Weakly-Supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning
Yu Tian, Guansong Pang, Yuanhong Chen, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro Webly Supervised Fine-Grained Recognition: Benchmark Datasets and an Approach
Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen, Jianxin Wu, Jian Zhang, Heng Tao Shen What You Can Learn by Staring at a Blank Wall
Prafull Sharma, Miika Aittala, Yoav Y. Schechner, Antonio Torralba, Gregory W. Wornell, William T. Freeman, Frédo Durand When Do GANs Replicate? on the Choice of Dataset Size
Qianli Feng, Chenqi Guo, Fabian Benitez-Quiroz, Aleix M. Martinez When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes
Philipp Bomatter, Mengmi Zhang, Dimitar Karev, Spandan Madan, Claire Tseng, Gabriel Kreiman Where2Act: From Pixels to Actions for Articulated 3D Objects
Kaichun Mo, Leonidas J. Guibas, Mustafa Mukadam, Abhinav Gupta, Shubham Tulsiani Who's Waldo? Linking People Across Text and Images
Yuqing Cui, Apoorv Khandelwal, Yoav Artzi, Noah Snavely, Hadar Averbuch-Elor X-World: Accessibility, Vision, and Autonomy Meet
Jimuyang Zhang, Minglan Zheng, Matthew Boyd, Eshed Ohn-Bar XVFI: eXtreme Video Frame Interpolation
Hyeonjun Sim, Jihyong Oh, Munchurl Kim YouRefIt: Embodied Reference Understanding with Language and Gesture
Yixin Chen, Qing Li, Deqian Kong, Yik Lun Kei, Song-Chun Zhu, Tao Gao, Yixin Zhu, Siyuan Huang Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition
Ming Lin, Pichao Wang, Zhenhong Sun, Hesen Chen, Xiuyu Sun, Qi Qian, Hao Li, Rong Jin Zero-Shot Day-Night Domain Adaptation with a Physics Prior
Attila Lengyel, Sourav Garg, Michael Milford, Jan C. van Gemert Zero-Shot Natural Language Video Localization
Jinwoo Nam, Daechul Ahn, Dongyeop Kang, Seong Jong Ha, Jonghyun Choi