WACV 2023
639 papers
3D Change Localization and Captioning from Dynamic Scans of Indoor Scenes
Yue Qiu, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh 3D GAN Inversion with Pose Optimization
Jaehoon Ko, Kyusun Cho, Daewon Choi, Kwangrok Ryoo, Seungryong Kim 3DMM-RF: Convolutional Radiance Fields for 3D Face Modeling
Stathis Galanakis, Baris Gecer, Alexandros Lattas, Stefanos Zafeiriou A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials
Chuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc Van Gool A Morphology Focused Diffusion Probabilistic Model for Synthesis of Histopathology Images
Puria Azadi Moghadam, Sanne Van Dalen, Karina C. Martin, Jochen Lennerz, Stephen Yip, Hossein Farahani, Ali Bashashati A Neural Video Codec with Spatial Rate-Distortion Control
Noor Fathima, Jens Petersen, Guillaume Sautière, Auke Wiggers, Reza Pourreza A Quality Aware Sample-to-Sample Comparison for Face Recognition
Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Zafari, Moktari Mostofa, Nasser M. Nasrabadi A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation
Georgy Ponimatkin, Nermin Samet, Yang Xiao, Yuming Du, Renaud Marlet, Vincent Lepetit A Suspect Identification Framework Using Contrastive Relevance Feedback
Devansh Gupta, Aditya Saini, Sarthak Bhagat, Shagun Uppal, Rishi Raj Jain, Drishti Bhasin, Ponnurangam Kumaraguru, Rajiv Ratn Shah Addressing Feature Suppression in Unsupervised Visual Representations
Tianhong Li, Lijie Fan, Yuan Yuan, Hao He, Yonglong Tian, Rogerio Feris, Piotr Indyk, Dina Katabi AdvisIL - A Class-Incremental Learning Advisor
Eva Feillet, Grégoire Petit, Adrian Popescu, Marina Reyboz, Céline Hudelot An Embedding-Dynamic Approach to Self-Supervised Learning
Suhong Moon, Domas Buracas, Seunghyun Park, Jinkyu Kim, John Canny Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation
Zeyun Zhong, David Schneider, Michael Voit, Rainer Stiefelhagen, Jürgen Beyerer ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection
Rebbapragada V. C. Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N. Balasubramanian ATCON: Attention Consistency for Vision Models
Ali Mirzazadeh, Florian Dubost, Maxwell Pike, Krish Maniar, Max Zuo, Christopher Lee-Messer, Daniel Rubin Audio-Visual Face Reenactment
Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar AudioViewer: Learning to Visualize Sounds
Chunjin Song, Yuchi Zhang, Willis Peng, Parmis Mohaghegh, Bastian Wandt, Helge Rhodin Augmentation by Counterfactual Explanation - Fixing an Overconfident Classifier
Sumedha Singla, Nihal Murali, Forough Arabshahi, Sofia Triantafyllou, Kayhan Batmanghelich Automatically Annotating Indoor Images with CAD Models via RGB-D Scans
Stefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit Back to MLP: A Simple Baseline for Human Motion Prediction
Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer Benchmarking Visual Localization for Autonomous Navigation
Lauri Suomela, Jussi Kalliola, Atakan Dag, Harry Edelman, Joni-Kristian Kämäräinen Bent & Broken Bicycles: Leveraging Synthetic Data for Damaged Object Re-Identification
Luca Piano, Filippo Gabriele Pratticò, Alessandro Sebastian Russo, Lorenzo Lanari, Lia Morra, Fabrizio Lamberti Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields
Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang Boosting Neural Video Codecs by Exploiting Hierarchical Redundancy
Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautière, Auke Wiggers Boosting Vision Transformers for Image Retrieval
Chull Hwan Song, Jooyoung Yoon, Shunghyun Choi, Yannis Avrithis BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
Khurram Azeem Hashmi, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal Burst Reflection Removal Using Reflection Motion Aggregation Cues
B. H. Pawan Prasad, K. S. Green Rosh, R. B. Lokesh, Kaushik Mitra Burst Vision Using Single-Photon Cameras
Sizhuo Ma, Paul Mos, Edoardo Charbon, Mohit Gupta BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-Wild 2D Annotations
Cheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao, Ke Zhang, Zhongyu Jiang, Jenq-Neng Hwang, Cheng-Hao Kuo Can Shadows Reveal Biometric Information?
Safa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell CAST: Conditional Attribute Subsampling Toolkit for Fine-Grained Evaluation
Wes Robbins, Steven Zhou, Aman Bhatta, Chad Mello, Vítor Albiero, Kevin W. Bowyer, Terrance E. Boult Centroid Distance Keypoint Detector for Colored Point Clouds
Hanzhe Teng, Dimitrios Chatziparaschis, Xinyue Kan, Amit K. Roy-Chowdhury, Konstantinos Karydis Class-Level Confidence Based 3D Semi-Supervised Learning
Zhimin Chen, Longlong Jing, Liang Yang, Yingwei Li, Bing Li CoKe: Contrastive Learning for Robust Keypoint Detection
Yutong Bai, Angtian Wang, Adam Kortylewski, Alan Yuille Composite Learning for Robust and Effective Dense Predictions
Menelaos Kanakis, Thomas E. Huang, David Brüggemann, Fisher Yu, Luc Van Gool Computer Vision for International Border Legibility
Trevor Ortega, Thomas Nelson, Skyler Crane, Josh Myers-Dean, Scott Wehrwein Computer Vision for Ocean Eddy Detection in Infrared Imagery
Evangelos Moschos, Alisa Kugusheva, Paul Coste, Alexandre Stegner Context-Empowered Visual Attention Prediction in Pedestrian Scenarios
Igor Vozniak, Philipp Müller, Lorena Hell, Nils Lipp, Ahmed Abouelazm, Christian Müller Continual Learning with Dependency Preserving Hypernetworks
Dupati Srikar Chandra, Sakshi Varshney, P. K. Srijith, Sunil Gupta Contrastive Knowledge-Augmented Meta-Learning for Few-Shot Classification
Rakshith Subramanyam, Mark Heimann, T.S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan Contrastive Learning of Semantic Concepts for Open-Set Cross-Domain Retrieval
Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan, Biplab Banerjee Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation
Verica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance
Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando de la Torre, Steven Song, Aayush Prakash, Daeil Kim Cooperative Self-Training for Multi-Target Adaptive Semantic Segmentation
Yangsong Zhang, Subhankar Roy, Hongtao Lu, Elisa Ricci, Stéphane Lathuilière CountNet3D: A 3D Computer Vision Approach to Infer Counts of Occluded Objects
Porter Jenkins, Kyle Armstrong, Stephen Nelson, Siddhesh Gotad, J. Stockton Jenkins, Wade Wilkey, Tanner Watts Cross-View Image Sequence Geo-Localization
Xiaohan Zhang, Waqas Sultani, Safwan Wshah CTrGAN: Cycle Transformers GAN for Gait Transfer
Shahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi Dance Style Transfer with Cross-Modal Transformer
Wenjie Yin, Hang Yin, Kim Baraka, Danica Kragic, Mårten Björkman DBCE: A Saliency Method for Medical Deep Learning Through Anatomically-Consistent Free-Form Deformations
Joshua Peters, Léo Lebrat, Rodrigo Santa Cruz, Aaron Nicolson, Gregg Belous, Salamata Konate, Parnesh Raniga, Vincent Dore, Pierrick Bourgeat, Jurgen Mejan-Fripp, Clinton Fookes, Olivier Salvado DELS-MVS: Deep Epipolar Line Search for Multi-View Stereo
Christian Sormann, Emanuele Santellani, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer Dense Prediction with Attentive Feature Aggregation
Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu Dense Voxel Fusion for 3D Object Detection
Anas Mahmoud, Jordan S. K. Hu, Steven L. Waslander Diffeomorphic Image Registration with Neural Velocity Field
Kun Han, Shanlin Sun, Xiangyi Yan, Chenyu You, Hao Tang, Junayed Naushad, Haoyu Ma, Deying Kong, Xiaohui Xie DigiFace-1m: 1 Million Digital Face Images for Face Recognition
Gwangbin Bae, Martin de La Gorce, Tadas Baltrušaitis, Charlie Hewitt, Dong Chen, Julien Valentin, Roberto Cipolla, Jingjing Shen Domain Adaptation Using Self-Training with Mixup for One-Stage Object Detection
Jitender Maurya, Keyur R. Ranipa, Osamu Yamaguchi, Tomoyuki Shibata, Daisuke Kobayashi Domain Invariant Vision Transformer Learning for Face Anti-Spoofing
Chen-Hao Liao, Wen-Cheng Chen, Hsuan-Tung Liu, Yi-Ren Yeh, Min-Chun Hu, Chu-Song Chen DRAMA: Joint Risk Localization and Captioning in Driving
Srikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li DSAG: A Scalable Deep Framework for Action-Conditioned Multi-Actor Full Body Motion Synthesis
Debtanu Gupta, Shubh Maheshwari, Sai Shashank Kalakonda, Manasvi Vaidyula, Ravi Kiran Sarvadevabhatla DSFormer: A Dual-Domain Self-Supervised Transformer for Accelerated Multi-Contrast MRI Reconstruction
Bo Zhou, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal Sofka DSTrans: Dual-Stream Transformer for Hyperspectral Image Restoration
Dabing Yu, Qingwu Li, Xiaolin Wang, Zhiliang Zhang, Yixi Qian, Chang Xu DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network
Kamalakar Vijay Thakare, Yash Raghuwanshi, Debi Prosad Dogra, Heeseung Choi, Ig-Jae Kim Dynamic Neural Portraits
Michail Christos Doukas, Stylianos Ploumpis, Stefanos Zafeiriou Effective Invertible Arbitrary Image Rescaling
Zhihong Pan, Baopu Li, Dongliang He, Wenhao Wu, Errui Ding Efficient Few-Shot Learning for Pixel-Precise Handwritten Document Layout Analysis
Axel De Nardin, Silvia Zottin, Matteo Paier, Gian Luca Foresti, Emanuela Colombi, Claudio Piciarelli Efficient Flow-Guided Multi-Frame De-Fencing
Stavros Tsogkas, Fengjia Zhang, Allan Jepson, Alex Levinshtein Efficient Visual Tracking with Exemplar Transformers
Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification
Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le Enabling ISPless Low-Power Computer Vision
Gourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel Enhanced Bi-Directional Motion Estimation for Video Frame Interpolation
Xin Jin, Longhai Wu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, Cheul-hee Hahm Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution
Jinsu Yoo, Taehoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae Hyun Kim Evaluating Generative Networks Using Gaussian Mixtures of Image Features
Lorenzo Luzi, Carlos Ortiz Marrero, Nile Wynar, Richard G. Baraniuk, Michael J. Henry Event-Based RGB Sensing with Structured Light
Seyed Ehsan Marjani Bajestani, Giovanni Beltrame Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs
Shengyu Feng, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar, Subarna Tripathi FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping
Felix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund FaceOff: A Video-to-Video Face Swapping System
Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Far3Det: Towards Far-Field 3D Detection
Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong Fast and Accurate: Video Enhancement Using Sparse Depth
Yu Feng, Patrick Hansen, Paul N. Whatmough, Guoyu Lu, Yuhao Zhu FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning
Grégoire Petit, Adrian Popescu, Hugo Schindler, David Picard, Bertrand Delezoide Few-Shot Object Detection via Improved Classification Features
Xinyu Jiang, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Duoqian Miao Fine-Grained Activities of People Worldwide
Jeffrey Byrne, Gregory Castañón, Zhongheng Li, Gil Ettinger Fine-Grained Affordance Annotation for Egocentric Hand-Object Interaction Videos
Zecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Goutsu, Yoichi Sato FreeREA: Training-Free Evolution-Based Architecture Search
Niccolò Cavagnero, Luca Robbiano, Barbara Caputo, Giuseppe Averta From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
Britty Baby, Daksh Thapar, Mustafa Chasmai, Tamajit Banerjee, Kunal Dargan, Ashish Suri, Subhashis Banerjee, Chetan Arora FUSSL: Fuzzy Uncertain Self Supervised Learning
Salman Mohamadi, Gianfranco Doretto, Donald A. Adjeroh GAFNet: A Global Fourier Self Attention Based Novel Network for Multi-Modal Downstream Tasks
Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R. Sai Chandra Teja, Rekha Singhal GEMS: Generating Efficient Meta-Subnets
Varad Pimpalkhute, Shruti Kunde, Rekha Singhal GEMS: Scene Expansion Using Generative Models of Graphs
Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay Generative Colorization of Structured Mobile Web Pages
Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding
Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless Fowlkes GLAD: A Global-to-Local Anomaly Detector
Aitor Artola, Yannis Kolodziej, Jean-Michel Morel, Thibaud Ehret Guiding Visual Question Answering with Attention Priors
Thao Minh Le, Vuong Le, Sunil Gupta, Svetha Venkatesh, Truyen Tran Hear the Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization
Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju Heightfields for Efficient Scene Reconstruction for AR
Jamie Watson, Sara Vicente, Oisin Mac Aodha, Clément Godard, Gabriel Brostow, Michael Firman HiFormer: Hierarchical Multi-Scale Representations Using Transformers for Medical Image Segmentation
Moein Heidari, Amirhossein Kazerouni, Milad Soltany, Reza Azad, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars
Xiaoyu Xiang, Jon Morton, Fitsum A. Reda, Lucas D. Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan P. Allebach Human-in-the-Loop Video Semantic Segmentation Auto-Annotation
Nan Qiao, Yuyin Sun, Chong Liu, Lu Xia, Jiajia Luo, Ke Zhang, Cheng-Hao Kuo HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar
Shih-Po Lee, Niraj Prakash Kini, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang HyperShot: Few-Shot Learning by Kernel HyperNetworks
Marcin Sendera, Marcin Przewięźlikowski, Konrad Karanowski, Maciej Zięba, Jacek Tabor, Przemysław Spurek IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes
Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar IFQA: Interpretable Face Quality Assessment
Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong Image Completion with Heterogeneously Filtered Spectral Hints
Xingqian Xu, Shant Navasardyan, Vahram Tadevosyan, Andranik Sargsyan, Yadong Mu, Humphrey Shi Image-Free Domain Generalization via CLIP for 3D Hand Pose Estimation
Seongyeong Lee, Hansoo Park, Dong Uk Kim, Jihyeon Kim, Muhammadjon Boboev, Seungryul Baek ImpDet: Exploring Implicit Fields for 3D Object Detection
Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, Xiangyang Xue ImPosing: Implicit Pose Encoding for Efficient Visual Localization
Arthur Moreau, Thomas Gilles, Nathan Piasco, Dzmitry Tsishkou, Bogdan Stanciulescu, Arnaud de La Fortelle Improving Deep Facial Phenotyping for Ultra-Rare Disorder Verification Using Model Ensembles
Alexander Hustinx, Fabio Hellmann, Ömer Sümer, Behnam Javanmardi, Elisabeth André, Peter Krawitz, Tzung-Chien Hsieh Improving Diversity with Adversarially Learned Transformations for Domain Generalization
Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information
Ahmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy, Adrien Courtois, Gabriele Facciolo Instance-Dependent Noisy Label Learning via Graphical Modelling
Arpit Garg, Cuong Nguyen, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro Knowing What to Label for Few Shot Microscopy Image Cell Segmentation
Youssef Dawoud, Arij Bouazizi, Katharina Ernst, Gustavo Carneiro, Vasileios Belagiannis Language-Free Training for Zero-Shot Video Grounding
Dahye Kim, Jungin Park, Jiyoung Lee, Seongheon Park, Kwanghoon Sohn LAVA: Label-Efficient Visual Learning and Adaptation
Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari LayerDoc: Layer-Wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents
Puneet Mathur, Rajiv Jain, Ashutosh Mehra, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu N., Quan Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated Learning
Donald Shenaj, Eros Fanì, Marco Toldo, Debora Caldarola, Antonio Tavera, Umberto Michieli, Marco Ciccone, Pietro Zanuttigh, Barbara Caputo Learning Attention Propagation for Compositional Zero-Shot Learning
Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal Learning by Hallucinating: Vision-Language Pre-Training with Weak Supervision
Tzu-Jui Julius Wang, Jorma Laaksonen, Tomas Langer, Heikki Arponen, Tom E. Bishop Learning Lightweight Neural Networks via Channel-Split Recurrent Convolution
Guojun Wu, Xin Zhang, Ziming Zhang, Yanhua Li, Xun Zhou, Christopher Brinton, Zhenming Liu Learning to Detect 3D Lanes by Shape Matching and Embedding
Ruixin Liu, Zhihao Guan, Zejian Yuan, Ao Liu, Tong Zhou, Tang Kun, Erlong Li, Chao Zheng, Shuqi Mei Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks
Abhishek Aich, Shasha Li, Chengyu Song, M. Salman Asif, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury Lightweight Network for Video Motion Magnification
Jasdeep Singh, Subrahmanyam Murala, G. Sankara Raju Kosuru Lightweight Video Denoising Using Aggregated Shifted Window Attention
Lydia Lindner, Alexander Effland, Filip Ilic, Thomas Pock, Erich Kobler LineEX: Data Extraction from Scientific Line Charts
V. P. Shivasankaran, Muhammad Yusuf Hassan, Mayank Singh LoopDA: Constructing Self-Loops to Adapt Nighttime Semantic Segmentation
Fengyi Shen, Zador Pataki, Akhil Gurram, Ziyuan Liu, He Wang, Alois Knoll LRA&LDRA: Rethinking Residual Predictions for Efficient Shadow Detection and Removal
Mehmet Kerim Yücel, Valia Dimaridou, Bruno Manganelli, Mete Ozay, Anastasios Drosou, Albert Saà-Garriga M-FUSE: Multi-Frame Fusion for Scene Flow Estimation
Lukas Mehl, Azin Jahedi, Jenny Schmalfuss, Andrés Bruhn Mapping DNN Embedding Manifolds for Network Generalization Prediction
Molly O’Brien, Brett Wolfinger, Julia Bukowski, Mathias Unberath, Aria Pezeshk, Gregory D. Hager Masked Image Modeling Advances 3D Medical Image Analysis
Zekai Chen, Devansh Agarwal, Kshitij Aggarwal, Wiem Safta, Mariann Micsinai Balan, Kevin Brown Meta-Auxiliary Learning for Future Depth Prediction in Videos
Huan Liu, Zhixiang Chi, Yuanhao Yu, Yang Wang, Jun Chen, Jin Tang MEVID: Multi-View Extended Videos with Identities for Video Person Re-Identification
Daniel Davila, Dawei Du, Bryon Lewis, Christopher Funk, Joseph Van Pelt, Roderic Collins, Kellie Corona, Matt Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp MFFN: Multi-View Feature Fusion Network for Camouflaged Object Detection
Dehua Zheng, Xiaochen Zheng, Laurence T. Yang, Yuan Gao, Chenlu Zhu, Yiheng Ruan ML-Decoder: Scalable and Versatile Classification Head
Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben-Baruch, Asaf Noy MMPTRACK: Large-Scale Densely Annotated Multi-Camera Multiple People Tracking Benchmark
Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu Modality Mixer for Multi-Modal Action Recognition
Sumin Lee, Sangmin Woo, Yeonju Park, Muhammad Adi Nugroho, Changick Kim Modeling Stroke Mask for End-to-End Text Erasing
Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Tianlong Ma, Xingjiao Wu, Cheng Jin More Control for Free! Image Synthesis with Semantic Diffusion Guidance
Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell Motif Mining: Finding and Summarizing Remixed Image Content
William Theisen, Daniel Gonzalez Cedre, Zachariah Carmichael, Daniel Moreira, Tim Weninger, Walter Scheirer Motion Aware Self-Supervision for Generic Event Boundary Detection
Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O’Connor MovieCLIP: Visual Scene Recognition in Movies
Digbalay Bose, Rajat Hebbar, Krishna Somandepalli, Haoyang Zhang, Yin Cui, Kree Cole-McLaughlin, Huisheng Wang, Shrikanth Narayanan Multi-View Action Recognition Using Contrastive Learning
Ketul Shah, Anshul Shah, Chun Pong Lau, Celso M. de Melo, Rama Chellappa Multi-View Photometric Stereo Revisited
Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc Van Gool Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution
Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Cătălin Ristea, Nicolae Verga, Fahad Shahbaz Khan NAPReg: Nouns as Proxies Regularization for Semantically Aware Cross-Modal Embeddings
Bhavin Jawade, Deen Dayal Mohan, Naji Mohamed Ali, Srirangaraj Setlur, Venu Govindaraju NeuralBF: Neural Bilateral Filtering for Top-Down Instance Segmentation on Point Clouds
Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi nLMVS-Net: Deep Non-Lambertian Multi-View Stereo
Kohei Yamashita, Yuto Enyo, Shohei Nobuhara, Ko Nishino OCR-VQGAN: Taming Text-Within-Image Generation
Juan A. Rodríguez, David Vazquez, Issam Laradji, Marco Pedersoli, Pau Rodriguez On Quantizing Implicit Neural Representations
Cameron Gordon, Shin-Fang Chng, Lachlan MacDonald, Simon Lucey One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text
Abhinav Java, Shripad Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy One-Shot Synthesis of Images and Segmentation Masks
Vadim Sushko, Dan Zhang, Jürgen Gall, Anna Khoreva OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Rohan Sarkar, Navaneeth Bodla, Mariya I. Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gerard Medioni Overlap-Guided Gaussian Mixture Models for Point Cloud Registration
Guofeng Mei, Fabio Poiesi, Cristiano Saltori, Jian Zhang, Elisa Ricci, Nicu Sebe Panoptic-Aware Image-to-Image Translation
Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura Partially Calibrated Semi-Generalized Pose from Hybrid Point Correspondences
Snehal Bhayani, Torsten Sattler, Viktor Larsson, Janne Heikkilä, Zuzana Kukelova PatchDropout: Economizing Vision Transformers Using Patch Dropout
Yue Liu, Christos Matsoukas, Fredrik Strand, Hossein Azizpour, Kevin Smith Perceptual Image Enhancement for Smartphone Real-Time Applications
Marcos V. Conde, Florin Vasluianu, Javier Vazquez-Corral, Radu Timofte Physically Plausible Animation of Human Upper Body from a Single Image
Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu Pik-Fix: Restoring and Colorizing Old Photos
Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Alan Bovik, Hongkai Yu PP4AV: A Benchmarking Dataset for Privacy-Preserving Autonomous Driving
Linh Trinh, Phuong Pham, Hoang Trinh, Nguyen Bach, Dung Nguyen, Giang Nguyen, Huy Nguyen PreViTS: Contrastive Pretraining with Video Tracking Supervision
Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik PRN: Panoptic Refinement Network
Bo Sun, Jason Kuen, Zhe Lin, Philippos Mordohai, Simon Chen Proactive Deepfake Defence via Identity Watermarking
Yuan Zhao, Bo Liu, Ming Ding, Baoping Liu, Tianqing Zhu, Xin Yu ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts
Mikołaj Sacha, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński Pushing the Efficiency Limit Using Structured Sparse Convolutions
Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin QMagFace: Simple and Accurate Quality-Aware Face Recognition
Philipp Terhörst, Malte Ihlefeld, Marco Huber, Naser Damer, Florian Kirchbuchner, Kiran Raja, Arjan Kuijper Real-Time Restoration of Dark Stereo Images
Mohit Lamba, M. V. A. Suhas Kumar, Kaushik Mitra Realistic Full-Body Anonymization with Surface-Guided GANs
Håkon Hukkelås, Morten Smebye, Rudolf Mester, Frank Lindseth Recipe2Video: Synthesizing Personalized Videos from Recipe Texts
Prateksha Udhayanan, Suryateja Bv, Parth Laturia, Dev Chauhan, Darshan Khandelwal, Stefano Petrangeli, Balaji Vasan Srinivasan Recovering Fine Details for Neural Implicit Surface Reconstruction
Decai Chen, Peng Zhang, Ingo Feldmann, Oliver Schreer, Peter Eisert ReEnFP: Detail-Preserving Face Reconstruction by Encoding Facial Priors
Yasheng Sun, Jiangke Lin, Hang Zhou, Zhiliang Xu, Dongliang He, Hideki Koike Relaxing Contrastiveness in Multimodal Representation Learning
Zudi Lin, Erhan Bas, Kunwar Yashraj Singh, Gurumurthy Swaminathan, Rahul Bhotika Representation Recovering for Self-Supervised Pre-Training on Medical Images
Xiangyi Yan, Junayed Naushad, Shanlin Sun, Kun Han, Hao Tang, Deying Kong, Haoyu Ma, Chenyu You, Xiaohui Xie SAILOR: Scaling Anchors via Insights into Latent Object Representation
Dušan Malić, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof SAT: Scale-Augmented Transformer for Person Search
Mustansar Fiaz, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan ScanNeRF: A Scalable Benchmark for Neural Radiance Fields
Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano SD-Pose: Structural Discrepancy Aware Category-Level 6d Object Pose Estimation
Guowei Li, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Tianyu Zhang, Xiaolin Zhang, Jiamao Li Segmentation-Free Direct Iris Localization Networks
Takahiro Toizumi, Koichi Takahashi, Masato Tsukada Self-Attentive Pooling for Efficient Deep Learning
Fang Chen, Gourav Datta, Souvik Kundu, Peter A. Beerel Self-Distillation for Unsupervised 3D Domain Adaptation
Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano Self-Distilled Self-Supervised Representation Learning
Jiho Jang, Seonhoon Kim, Kiyoon Yoo, Chaerin Kong, Jangho Kim, Nojun Kwak Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion
Srikrishna Jaganathan, Maximilian Kukla, Jian Wang, Karthik Shetty, Andreas Maier Self-Supervised Correspondence Estimation via Multiview Registration
Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Ben Graham Self-Supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation
Ashraful Islam, Benjamin Lundell, Harpreet Sawhney, Sudipta N. Sinha, Peter Morales, Richard J. Radke SHARDS: Efficient Shadow Removal Using Dual Stage Network for High-Resolution Images
Mrinmoy Sen, Sai Pradyumna Chermala, Nazrinbanu Nurmohammad Nagori, Venkat Peddigari, Praful Mathur, B. H. Pawan Prasad, Moonhwan Jeong Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning
Julien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault, Stéphane Canu SIRA: Relightable Avatars from a Single Image
Pol Caselles, Eduard Ramon, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer, Gil Triginer SIUNet: Sparsity Invariant U-Net for Edge-Aware Depth Completion
Avinash Nittur Ramesh, Fabio Giovanneschi, María A. González-Huici Skew-Robust Human-Object Interactions in Videos
Apoorva Agarwal, Rishabh Dabral, Arjun Jain, Ganesh Ramakrishnan SONGs: Self-Organizing Neural Graphs
Łukasz Struski, Tomasz Danel, Marek Śmieja, Jacek Tabor, Bartosz Zieliński Sparsity Agnostic Depth Completion
Andrea Conti, Matteo Poggi, Stefano Mattoccia Spatially Multi-Conditional Image Generation
Nikola Popović, Ritika Chakraborty, Danda Pani Paudel, Thomas Probst, Luc Van Gool Spatio-Temporal Action Detection Under Large Motion
Gurkirt Singh, Vasileios Choutas, Suman Saha, Fisher Yu, Luc Van Gool Spike-Based Anytime Perception
Matthew Dutson, Yin Li, Mohit Gupta SPIQ: Data-Free Per-Channel Static Input Quantization
Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly Split to Learn: Gradient Split for Multi-Task Human Image Analysis
Weijian Deng, Yumin Suh, Xiang Yu, Masoud Faraki, Liang Zheng, Manmohan Chandraker Synthetic Latent Fingerprint Generator
André Brasil Vieira Wyzykowski, Anil K. Jain Temporally Consistent Online Depth Estimation in Dynamic Scenes
Zhaoshuo Li, Wei Ye, Dilin Wang, Francis X. Creighton, Russell H. Taylor, Ganesh Venkatesh, Mathias Unberath TeST: Test-Time Self-Training Under Distribution Shift
Samarth Sinha, Peter Gehler, Francesco Locatello, Bernt Schiele Text and Image Guided 3D Avatar Generation and Manipulation
Zehranaz Canfes, M. Furkan Atasoy, Alara Dirik, Pinar Yanardag The Box Size Confidence Bias Harms Your Object Detector
Johannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll The Change You Want to See
Ragav Sachdeva, Andrew Zisserman The Fully Convolutional Transformer for Medical Image Segmentation
Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders Using Hierarchical Maps Distillation
Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness Token Pooling in Vision Transformers for Image Classification
Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel Towards Discriminative and Transferable One-Stage Few-Shot Object Detectors
Karim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang, Jürgen Beyerer Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni, Kiana Ehsani, Luca Weihs, Jordi Salvador Towards Equivariant Optical Flow Estimation with Deep Learning
Stefano Savian, Pietro Morerio, Alessio Del Bue, Andrea A. Janes, Tammam Tillo Towards Generating Ultra-High Resolution Talking-Face Videos with Lip Synchronization
Anchit Gupta, Rudrabha Mukhopadhyay, Sindhu Balachandra, Faizan Farooq Khan, Vinay P. Namboodiri, C. V. Jawahar Tracking Growth and Decay of Plant Roots in Minirhizotron Images
Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas, Jürgen Kreyling, Gesche Blume-Werry Transformers for Recognition in Overhead Imagery: A Reality Check
Francesco Luzi, Aneesh Gupta, Leslie Collins, Kyle Bradbury, Jordan Malof TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Tianrui Liu, Shijian Lu, Liang Pan TransVLAD: Multi-Scale Attention-Based Global Descriptors for Visual Geo-Localization
Yifan Xu, Pourya Shamsolmoali, Eric Granger, Claire Nicodeme, Laurent Gardes, Jie Yang TTTFlow: Unsupervised Test-Time Training with Normalizing Flow
David Osowiechi, Gustavo A. Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth Completion
Kensuke Taguchi, Shogo Morita, Yusuke Hayashi, Wataru Imaeda, Hironobu Fujiyoshi Unifying Distribution Alignment as a Loss for Imbalanced Semi-Supervised Learning
Justin Lazarow, Kihyuk Sohn, Chen-Yu Lee, Chun-Liang Li, Zizhao Zhang, Tomas Pfister Unifying Margin-Based SoftMax Losses in Face Recognition
Yang Zhang, Simao Herdade, Kapil Thadani, Eric Dodds, Jack Culpepper, Yueh-Ning Ku Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh S., Anchit Gupta, C. V. Jawahar, Makarand Tapaswi Unsupervised Video Object Segmentation via Prototype Memory Network
Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee UVCGAN: UNet Vision Transformer Cycle-Consistent GAN for Unpaired Image-to-Image Translation
Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren Video Joint Denoising and Demosaicing with Recurrent CNNs
Valéry Dewil, Adrien Courtois, Mariano Rodríguez, Thibaud Ehret, Nicola Brandonisio, Denis Bujoreanu, Gabriele Facciolo, Pablo Arias Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin, Yen-Chen Lin, Wei-Sheng Lai, Tsung-Yi Lin, Yi-Chang Shih, Ravi Ramamoorthi VSGD-Net: Virtual Staining Guided Melanocyte Detection on Histopathological Images
Kechun Liu, Beibin Li, Wenjun Wu, Caitlin May, Oliver Chang, Stevan Knezevich, Lisa Reisch, Joann Elmore, Linda Shapiro Watching the News: Towards VideoQA Models That Can Read
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar Wavelength-Aware 2D Convolutions for Hyperspectral Imaging
Leon Amadeus Varga, Martin Messmer, Nuri Benbarka, Andreas Zell Weakly-Supervised Point Cloud Instance Segmentation with Geometric Priors
Heming Du, Xin Yu, Farookh Hussain, Mohammad Ali Armin, Lars Petersson, Weihao Li WSNet: Towards an Effective Method for Wound Image Segmentation
Subba Reddy Oota, Vijay Rowtula, Shahid Mohammed, Minghsun Liu, Manish Gupta X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation
Shubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Yogamani, Fatih Porikli