ICCVW 2023 498 papers
A Comprehensive Empirical Evaluation on Online Continual Learning
Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost van de Weijer, Hamed Hemati A Comprehensive Study of Transfer Learning Under Constraints
Tom Pégeot, Inna Kucher, Adrian Popescu, Bertrand Delezoide A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset
Andreas Voskou, Konstantinos P. Panousis, Harris Partaourides, Kyriakos Tolias, Sotirios Chatzis A New Large Dataset and a Transfer Learning Methodology for Plant Phenotyping in Vertical Farms
Nico Samà, Etienne David, Simone Rossetti, Alessandro Antona, Benjamin Franchetti, Fiora Pirri A Simple and Explainable Method for Uncertainty Estimation Using Attribute Prototype Networks
Claudius Zelenka, Andrea Göhring, Daniyal Kazempour, Maximilian Hünemörder, Lars Schmarje, Peer Kröger Cite
A Simple Signal for Domain Shift
Goirik Chakrabarty, Manogna Sreenivas, Soma Biswas Cite
Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity
Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah Deeplite Actor-Agnostic Multi-Label Action Recognition with Multi-Modal Query
Anindya Mondal, Sauradip Nag, Joaquin M. Prada, Xiatian Zhu, Anjan Dutta AD-CLIP: Adapting Domains in Prompt Space Using CLIP
Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-Free Continual Learning
Filip Szatkowski, Mateusz Pyla, Marcin Przewiezlikowski, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski Adapting Vision Foundation Models for Plant Phenotyping
Feng Chen, Mario Valerio Giuffrida, Sotirios A. Tsaftaris Adaptive Self-Training for Object Detection
Renaud Vandeghen, Gilles Louppe, Marc Van Droogenbroeck Adversarial Attacks Against Uncertainty Quantification
Emanuele Ledda, Daniele Angioni, Giorgio Piras, Giorgio Fumera, Battista Biggio, Fabio Roli Adversarial Examples with Specular Highlights
Vanshika Vats, Koteswar Rao Jerripothula Cite
Affordance Segmentation of Hand-Occluded Containers from Exocentric Images
Tommaso Apicella, Alessio Xompero, Edoardo Ragusa, Riccardo Berta, Andrea Cavallaro, Paolo Gastaldo Alignment and Generation Adapter for Efficient Video-Text Understanding
Han Fang, Zhifei Yang, Yuhan Wei, Xianghao Zang, Chao Ban, Zerun Feng, Zhongjiang He, Yongxiang Li, Hao Sun Cite
All-Pairs Consistency Learning for Weakly Supervised Semantic Segmentation
Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports
Ethan Dack, Lorenzo Brigato, Matthew McMurray, Matthias Fontanellaz, Thomas Frauenfelder, Hanno Hoppe, Aristomenis K. Exadaktylos, Thomas Geiser, Manuela Funke-Chambour, Andreas Christe, Lukas Ebner, Stavroula G. Mougiakakou An Empirical Analysis of Range for 3D Object Detection
Neehar Peri, Mengtian Li, Benjamin Wilson, Yu-Xiong Wang, James Hays, Deva Ramanan An Empirical Study of the Effect of Video Encoders on Temporal Video Grounding
Ignacio M. De La Jara, Cristian Rodriguez Opazo, Edison Marrese-Taylor, Felipe Bravo-Marquez Cite
An Experimental Protocol for Neural Architecture Search in Super-Resolution
Jesús Leopoldo Llano García, Raúl Monroy, Víctor Adrián Sosa-Hernández Cite
Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation
Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li AntiNODE: Evaluating Efficiency Robustness of Neural ODEs
Mirazul Haque, Simin Chen, Wasif Arman Haque, Cong Liu, Wei Yang Cite
AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation
Damian Sójka, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models
Jan Warchocki, Teodor Oprescu, Yunhan Wang, Alexandru Damacus, Paul Misterka, Robert-Jan Bruintjes, Attila Lengyel, Ombretta Strafforello, Jan van Gemert BluNF: Blueprint Neural Field
Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis
Angela Castillo, María Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali K. Thabet, Artsiom Sanakoyeu Chest X-Ray Feature Pyramid Sum Model with Diseased Area Data Augmentation Method
Changhyun Kim, Giyeol Kim, Sooyoung Yang, Hyunsu Kim, Sangyool Lee, Hansu Cho Cite
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
Jan Oscar Cross-Zamirski, Praveen Anand, Guy B. Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato Clustering-Based Domain-Incremental Learning
Christiaan Lamers, René Vidal, Nabil Belbachir, Niki van Stein, Thomas Bäck, Paris Giampouras CNOS: A Strong Baseline for CAD-Based Novel Object Segmentation
Van Nguyen Nguyen, Thibault Groueix, Georgy Ponimatkin, Vincent Lepetit, Tomas Hodan Combating Coronary Calcium Scoring Bias for Non-Gated CT by Semantic Learning on Gated CT
Jiajian Li, Anwei Li, Jiansheng Fang, Yonghe Hou, Chao Song, Huifang Yang, Jingwen Wang, Hongbo Liu, Jiang Liu Cite
Confusing Large Models by Confusing Small Models
Vítor Albiero, Raghav Mehta, Ivan Evtimov, Samuel J. Bell, Levent Sagun, Aram Markosyan Cite
Continual Evidential Deep Learning for Out-of-Distribution Detection
Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost van de Weijer Controllable Inversion of Black-Box Face Recognition Models via Diffusion
Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, A. P. Prathosh Cross-Grained Contrastive Representation for Unsupervised Lesion Segmentation in Medical Images
Ziqi Yu, Botao Zhao, Yipin Zhang, Shengjie Zhang, Xiang Chen, Haibo Yang, Tingying Peng, Xiao-Yong Zhang Cite
Cross-Model Temporal Cooperation via Saliency Maps for Efficient Frame Classification
Tomaso Trinci, Tommaso Bianconcini, Leonardo Sarti, Leonardo Taccari, Francesco Sambo Cite
D-ViSA: A Dataset for Detecting Visual Sentiment from Art Images
Seoyun Kim, ChaeHee An, Junyeop Cha, Dongjae Kim, Eunil Park Cite
Decision Boundary Optimization for Few-Shot Class-Incremental Learning
Chenxu Guo, Qi Zhao, Shuchang Lyu, Binghao Liu, Chunlei Wang, Lijiang Chen, Guangliang Cheng Cite
Deep Learning for Apple Fruit Quality Inspection Using X-Ray Imaging
Astrid Tempelaere, Leen Van Doorselaer, Jiaqi He, Pieter Verboven, Tinne Tuytelaars, Bart M. Nicolaï DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image Datasets
Alokendu Mazumder, Tirthajit Baruah, Akash Kumar Singh, Pagadala Krishna Murthy, Vishwajeet Pattanaik, Punit Rathore Detection of Fusarium Damaged Kernels in Wheat Using Deep Semi-Supervised Learning on a Novel WheatSeedBelt Dataset
Keyhan Najafian, Lingling Jin, H. Randy Kutcher, Mackenzie Hladun, Samuel Horovatin, Maria Alejandra Oviedo-Ludena, Sheila Maria Pereira De Andrade, Lipu Wang, Ian Stavness Cite
DFM-X: Augmentation by Leveraging Prior Knowledge of Shortcut Learning
Shunxin Wang, Christoph Brune, Raymond N. J. Veldhuis, Nicola Strisciuglio Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation
Jieming Zhou, Tong Zhang, Zeeshan Hayder, Lars Petersson, Mehrtash Harandi DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
Cédric Rommel, Eduardo Valle, Mickaël Chen, Souhaiel Khalfaoui, Renaud Marlet, Matthieu Cord, Patrick Pérez Direct Unsupervised Denoising
Benjamin Salmon, Alexander Krull Domain Adversarial Learning Towards Underwater Image Enhancement
Meghna Kapoor, Rohan Baghel, Badri Narayan Subudhi, Vinit Jakhetiya, Ankur Bansal Cite
DONNAv2 - Lightweight Neural Architecture Search for Vision Tasks
Sweta Priyadarshi, Tianyu Jiang, Hsin-Pai Cheng, Sendil Krishna, Viswanath Ganapathy, Chirag Patel Drones4Good: Supporting Disaster Relief Through Remote Sensing and AI
Nina Merkle, Reza Bahmanyar, Corentin Henry, Seyed Majid Azimi, Xiangtian Yuan, Simon Schopferer, Veronika Gstaiger, Stefan Auer, Anne Schneibel, Marc Wieland, Thomas Kraft Dynamic Scene Graph Representation for Surgical Video
Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab ECO: Ensembling Context Optimization for Vision-Language Models
Lorenzo Agnolucci, Alberto Baldrati, Francesco Todino, Federico Becattini, Marco Bertini, Alberto Del Bimbo Efficient Grapevine Structure Estimation in Vineyards Conditions
Theophile Gentilhomme, Michael Villamizar, Jerome Corre, Jean-Marc Odobez Cite
Efficient Neural PDE-Solvers Using Quantization Aware Training
Winfried van den Dool, Tijmen Blankevoort, Max Welling, Yuki M. Asano Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor Entropic Score Metric: Decoupling Topology and Size in Training-Free NAS
Niccolò Cavagnero, Luca Robbiano, Francesca Pistilli, Barbara Caputo, Giuseppe Averta Cite
Estimation of Crop Production by Fusing Images and Crop Features
Ángela Casado-García, Jónathan Heras, Xabier Simon Martínez-Goñi, Jon Miranda-Apodaca, Usue Pérez-López Cite
Evaluation of 3D Reconstruction for Cultural Heritage Applications
Cristián Llull, Nelson Baloian, Benjamin Bustos, Kornelius Kupczik, Ivan Sipiran, Andres Baloian Cite
Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning
Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Concetto Spampinato, Simone Palazzo, Bruno Casella, Marco Aldinucci Cite
Explaining Through Transformer Input Sampling
Alexandre Englebert, Sédrick Stassin, Géraldin Nanfack, Sidi Ahmed Mahmoudi, Xavier Siebert, Olivier Cornu, Christophe De Vleeschouwer Cite
Exploring Inlier and Outlier Specification for Improved Medical OOD Detection
Vivek Sivaraman Narayanaswamy, Yamen Mubarka, Rushil Anirudh, Deepta Rajan, Jayaraman J. Thiagarajan Cite
Factorized Dynamic Fully-Connected Layers for Neural Networks
Francesca Babiloni, Thomas Tanay, Jiankang Deng, Matteo Maggioni, Stefanos Zafeiriou Cite
Fair Robust Active Learning by Joint Inconsistency
Tsung-Han Wu, Hung-Ting Su, Shang-Tse Chen, Winston H. Hsu Fast Object Detection in High-Resolution Videos
Ryan Tran, Atul Kanaujia, Vasu Parameswaran Cite
FedLID: Self-Supervised Federated Learning for Leveraging Limited Image Data
Athanasios Psaltis, Anestis Kastellos, Charalampos Z. Patrikakis, Petros Daras FireFly: A Synthetic Dataset for Ember Detection in Wildfire
Yue Hu, Xinan Ye, Yifei Liu, Souvik Kundu, Gourav Datta, Srikar Mutnuri, Namo Asavisanu, Nora Ayanian, Konstantinos Psounis, Peter A. Beerel FIVA: Facial Image and Video Anonymization and Anonymization Defense
Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez Flashback for Continual Learning
Leila Mahmoodi, Mehrtash Harandi, Peyman Moghadam Cite
Focus on Content Not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN
Jonas Utz, Tobias Weise, Maja Schlereth, Fabian Wagner, Mareike Thies, Mingxuan Gu, Stefan Uderhardt, Katharina Breininger From Scarcity to Understanding: Transfer Learning for the Extremely Low Resource Irish Sign Language
Ruth Holmes, Ellen Rushe, Mathieu De Coster, Maxim Bonnaerens, Shinichi Satoh, Akihiro Sugimoto, Anthony Ventresque Fusion Approaches to Predict Post-Stroke Aphasia Severity from Multimodal Neuroimaging Data
Saurav Chennuri, Sha Lai, Anne Billot, Maria Varkanitsa, Emily J. Braun, Swathi Kiran, Archana Venkataraman, Janusz Konrad, Prakash Ishwar, Margrit Betke Cite
GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations
Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Rubén Vera-Rodríguez, Dominik Lawatsch, Florian Domin, Maxim Schaubert Geometric Contrastive Learning
Yeskendir Koishekenov, Sharvaree P. Vadgama, Riccardo Valperga, Erik J. Bekkers Cite
Good Fences Make Good Neighbours
Imanol González Estepa, Jesús M. Rodríguez-de-Vera, Bhalaji Nagarajan, Petia Radeva Cite
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
Juanita Puentes, Angela Castillo, Wilmar Osejo, Yuly Calderón, Viviana Quintero, Lina Saldarriaga, Diana Agudelo, Pablo Arbeláez HyperCoil-Recon: A Hypernetwork-Based Adaptive Coil Configuration Task Switching Network for MRI Reconstruction
Sriprabha Ramanarayanan, Mohammad Al Fahim, G. S. Rahul, Amrit Kumar Jethi, Keerthi Ram, Mohanasankar Sivaprakasam Hyperspectral Imaging of In-Site Stained Glasses: Illumination Variation Compensation Using Two Perpendicular Scans
Suzan Joseph Kessy, Takuya Funatomi, Kazuya Kitano, Yuki Fujimura, Guillaume Caron, El Mustapha Mouaddib, Yasuhiro Mukaigawa Identifying Out-of-Domain Objects with Dirichlet Deep Neural Networks
Ahmed Hammam, Frank Bonarens, Seyed Eghbal Ghobadi, Christoph Stiller Cite
ILSH: The Imperial Light-Stage Head Dataset for Human Head View Synthesis
Jiali Zheng, Youngkyoon Jang, Athanasios Papaioannou, Christos Kampouris, Rolandos Alexandros Potamias, Foivos Paraperas Papantoniou, Efstathios Galanakis, Ales Leonardis, Stefanos Zafeiriou Cite
Implicit Neural Representation in Medical Imaging: A Comparative Survey
Amirali Molaei, Amirhossein Aminimehr, Armin Tavakoli, Amirhossein Kazerouni, Bobby Azad, Reza Azad, Dorit Merhof Improving Automatic Endoscopic Stone Recognition Using a Multi-View Fusion Approach Enhanced with Two-Step Transfer Learning
Francisco Javier López-Tiro, Elias Villalvazo-Avila, Juan Pablo Betancur-Rengifo, Iván Reyes-Amezcua, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian Daul Instant Continual Learning of Neural Radiance Fields
Ryan Po, Zhengyang Dong, Alexander W. Bergman, Gordon Wetzstein InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
Wei-Jhe Huang, Jheng-Hsien Yeh, Min-Hung Chen, Gueter Josmy Faure, Shang-Hong Lai Is There Progress in Activity Progress Prediction?
Frans de Boer, Jan C. van Gemert, Jouke Dijkstra, Silvia L. Pintea Iterative Robust Visual Grounding with Masked Reference Based Centerpoint Supervision
Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao Just Ask Plus: Using Transcripts for VideoQA
Mohammad Javad Pirhadi, Motahhare Mirzaei, Sauleh Eetemadi Cite
Kinship Representation Learning with Face Componential Relation
Wen-Tai Su, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-Chun Chen Language-Enhanced RNR-mAP: Querying Renderable Neural Radiance Field Maps with Natural Language
Francesco Taioli, Federico Cunico, Federico Girella, Riccardo Bologna, Alessandro Farinelli, Marco Cristani LatentSwap3D: Semantic Edits on 3D Image GANs
Enis Simsar, Alessio Tonioni, Evin Pinar Örnek, Federico Tombari Learning to Rank Approach for Refining Image Retrieval in Visual Arts
Tetiana Yemelianenko, Iuliia Tkachenko, Tess Masclef, Mihaela Scuturici, Serge Miguet LEMMS: Label Estimation of Multi-Feature Movie Segments
Bartolomeo Vacchetti, Dawit Mureja Argaw, Tania Cequtelli Cite
Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration
Tomás Chobola, Gesine Müller, Veit Dausmann, Anton Theileis, Jan Taucher, Jan Huisken, Tingying Peng LightNet: Generative Model for Enhancement of Low-Light Images
Chaitra Desai, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chinmayee Mandi, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi Cite
LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling
Kaijing Ma, Xianghao Zang, Zerun Feng, Han Fang, Chao Ban, Yuhan Wei, Zhongjiang He, Yongxiang Li, Hao Sun Cite
Mapping Memes to Words for Multimodal Hateful Meme Classification
Giovanni Burbi, Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Alberto Del Bimbo Memory Population in Continual Learning via Outlier Elimination
Julio Hurtado, Alain Raymond-Saez, Vladimir Araujo, Vincenzo Lomonaco, Alvaro Soto, Davide Bacciu Memory-Augmented Variational Adaptation for Online Few-Shot Segmentation
Jie Liu, Yingjun Du, Zehao Xiao, Cees G. M. Snoek, Jan-Jakob Sonke, Efstratios Gavves Cite
MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection
Tianpeng Bao, Jiadong Chen, Wei Li, Xiang Wang, Jingjing Fei, Liwei Wu, Rui Zhao, Ye Zheng Mind the Clot: Automated LVO Detection on CTA Using Deep Learning
Shubham Kumar, Arjun Agarwal, Satish Golla, Swetha Tanamala, Ujjwal Upadhyay, Subhankar Chattoraj, Preetham Putha, Sasank Chilamkurthy Cite
Modeling Visual Impairments with Artificial Neural Networks: A Review
Lucia Schiatti, Monica Gori, Martin Schrimpf, Giulia Cappagli, Federica Morelli, Sabrina Signorini, Boris Katz, Andrei Barbu MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
Prajwal Ganugula, Y. S. S. S. Santosh Kumar, N. K. Sagar Reddy, Prabhath Chellingi, Avinash Thakur, Neeraj Kasera, C. Shyam Anand MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
Jakob Drachmann Havtorn, Amélie Royer, Tijmen Blankevoort, Babak Ehteshami Bejnordi Multi-Task Consistency for Active Learning
Aral Hekimoglu, Philipp Friedrich, Walter Zimmer, Michael Schmidt, Alvaro Marcos-Ramiro, Alois Knoll Multi-Task Hypergraphs for Semi-Supervised Learning Using Earth Observations
Mihai Cristian Pîrvu, Alina Marcu, Alexandra Dobrescu, Nabil Belbachir, Marius Leordeanu Multimodal Error Correction with Natural Language and Pointing Gestures
Stefan Constantin, Fevziye Irem Eyiokur, Dogucan Yaman, Leonard Bärmann, Alex Waibel Cite
Multimodal Neurons in Pretrained Text-Only Transformers
Sarah Schwettmann, Neil Chowdhury, Samuel Klein, David Bau, Antonio Torralba Multimodal Parameter-Efficient Few-Shot Class Incremental Learning
Marco D'Alessandro, Alberto Alonso, Enrique Calabrés, Mikel Galar NCQS: Nonlinear Convex Quadrature Surrogate Hyperparameter Optimization
Sophia J. Abraham, Kehelwala Dewage Gayan Maduranga, Jeffery Kinnison, Jonathan D. Hauenstein, Walter J. Scheirer Cite
NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions
Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc Van Gool Noise-in, Bias-Out: Balanced and Real-Time MoCap Solving
Georgios Albanis, Nikolaos Zioulis, Spyridon Thermos, Anargyros Chatzitofis, Kostas Kolomvatsos NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects
Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai nuScenes Knowledge Graph - A Comprehensive Semantic Representation of Traffic Scenes for Trajectory Prediction
Leon Mlodzian, Zhigang Sun, Hendrik Berkemeyer, Sebastian Monka, Zixu Wang, Stefan Dietze, Lavdim Halilaj, Juergen Luettin On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
Thomas De Min, Massimiliano Mancini, Karteek Alahari, Xavier Alameda-Pineda, Elisa Ricci On the Unreasonable Vulnerability of Transformers for Image Restoration - And an Easy Fix
Shashank Agnihotri, Kanchana Vaishnavi Gandikota, Julia Grabinski, Paramanand Chandramouli, Margret Keuper On-Device Real-Time Custom Hand Gesture Recognition
Esha Uboweja, David Tian, Qifei Wang, Yi-Chun Kuo, Joe Zou, Lu Wang, George Sung, Matthias Grundmann Online Detection of AI-Generated Images
David C. Epstein, Ishan Jain, Oliver Wang, Richard Zhang Padding Aware Neurons
Dario Garcia-Gasulla, Victor Gimenez-Abalos, Pablo Agustin Martin-Torres Painter: Teaching Auto-Regressive Language Models to Draw Sketches
Reza Pourreza, Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Pulkit Madan, Roland Memisevic Pathology-Based Ischemic Stroke Etiology Classification via Clot Composition Guided Multiple Instance Learning
Mara Pleasure, Ekaterina Redekop, Jennifer S. Polson, Haoyue Zhang, Naoki Kaneko, William Speier, Corey W. Arnold Cite
Personalized 3D Human Pose and Shape Refinement
Tom Wehrbein, Bodo Rosenhahn, Iain A. Matthews, Carsten Stoll Pigment Mapping for Tomb Murals Using Neural Representation and Physics-Based Model
Mayuka Tsuji, Yuki Fujimura, Takuya Funatomi, Yasuhiro Mukaigawa, Tetsuro Morimoto, Takeshi Oishi, Jun Takamatsu, Katsushi Ikeuchi Cite
Plant Root Occlusion Inpainting with Generative Adversarial Network
Hao Song, Karim Panjvani, Zhigang Liu, Huzaifa Amar, Leon Kochian, Shengjian Ye, Xuan Yang, J. Allan Feurtado, Krunal Chavda, Karina Angela Chimbo Huatatoca, Mark G. Eramian Cite
Pointing Out Human Answer Mistakes in a Goal-Oriented Visual Dialogue
Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima Cite
PRAT: PRofiling Adversarial aTtacks
Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh S. Rawat QBitOpt: Fast and Accurate Bitwidth Reallocation During Training
Jorn Peters, Marios Fournarakis, Markus Nagel, Mart van Baalen, Tijmen Blankevoort Quantized Generative Models for Solving Inverse Problems
Kartheek Kumar Reddy Nareddy, Vinayak Killedar, Chandra Sekhar Seelamantula Raising the Bar on the Evaluation of Out-of-Distribution Detection
Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim Rapid Building Damage Assessment Workflow: An Implementation for the 2023 Rolling Fork, Mississippi Tornado Event
Caleb Robinson, Simone Fobi Nsutezo, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, Cameron Birge, Kasie Richards, Kris Pitcher, Paulo Duarte, Juan M. Lavista Ferres Rapid Flood Inundation Forecast Using Fourier Neural Operator
Alexander Y. Sun, Zhi Li, Wonhyun Lee, Qixing Huang, Bridget R. Scanlon, Clint Dawson Rapid Tomato DUS Trait Analysis Using an Optimized Mobile-Based Coarse-to-Fine Instance Segmentation Algorithm
Dan Jeric Arcega Rustia, Guido Alexander Jansen, Selwin Hageraats, Joseph Peller, Rick van de Zedde, Cécile Marchennay, Wim Sangster, Gosia Blokker RCV2023 Challenges: Benchmarking Model Training and Inference for Resource-Constrained Deep Learning
Rishabh Tiwari, Arnav Chavan, Deepak K. Gupta, Gowreesh Mago, Animesh Gupta, Akash Gupta, Suraj Sharan, Yukun Yang, Shanwei Zhao, Shihao Wang, Youngjun Kwak, Seonghun Jeong, Yunseung Lee, Changick Kim, Subin Kim, Ganzorig Gankhuyag, Ho Jung, Junwhan Ryu, HaeMoon Kim, Byeong Hak Kim, Tu Vo, Sheir Zaheer, Alexander Holston, Chan Y. Park, Dheemant Dixit, Nahush Lele, Kushagra Bhushan, Debjani Bhowmick, Devanshu Arya, Sadaf Gulshad, Amirhossein Habibian, Amir Ghodrati, Babak Ehteshami Bejnordi, Jai Gupta, Zhuang Liu, Jiahui Yu, Dilip K. Prasad, Zhiqiang Shen Cite
Reinforcement Learning for Instance Segmentation with High-Level Priors
Paul Hilt, Maedeh Zarvandi, Edgar Kaziakhmedov, Sourabh Bhide, Maria Leptin, Constantin Pape, Anna Kreshuk Cite
Reinforcement Learning with Space Carving for Plant Scanning
Antonio Pico Villalpando, Matthias Kubisch, David Colliaux, Peter Hanappe, Verena V. Hafner Cite
Relational Prior Knowledge Graphs for Detection and Instance Segmentation
Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li RheumaVIT: Transformer-Based Model for Automated Scoring of Hand Joints in Rheumatoid Arthritis
Alexander Stolpovsky, Elizaveta Dakhova, Polina Druzhinina, Polina Postnikova, Daniil Kudinsky, Alexander Smirnov, Anastasia Sukhinina, Alexander Lila, Anvar Kurmukov Cite
Robust AMD Stage Grading with Exclusively OCTA Modality Leveraging 3D Volume
Haochen Zhang, Anna Heinke, Carlo Miguel B. Galang, Daniel N. Deussen, Bo Wen, Dirk-Uwe G. Bartsch, William R. Freeman, Truong Q. Nguyen, Cheolhong An S2RF: Semantically Stylized Radiance Fields
Moneish Kumar, Neeraj Panse, Dishani Lahiri SAM-Adapter: Adapting Segment Anything in Underperformed Scenes
Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Shangzhan Zhang, Zejian Li, Lingyun Sun, Ying Zang, Papa Mao Cite
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab Segmentation-Based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma
Christiaan G. A. Viviers, Mark Ramaekers, M. M. Amaan Valiuddin, Terese Hellström, Nick Tasios, John van der Ven, Igor Jacobs, Lotte Ewals, Joost Nederend, Peter H. N. de With, Misha Luyer, Fons van der Sommen Selective Freezing for Efficient Continual Learning
Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spampinato, Simone Palazzo Cite
Self-Supervised Anomaly Detection from Anomalous Training Data via Iterative Latent Token Masking
Ashay Patel, Petru-Daniel Tudosiu, Walter H. L. Pinaya, Mark S. Graham, Olusola Adeleke, Gary J. Cook, Vicky Goh, Sébastien Ourselin, M. Jorge Cardoso Self-Supervised Hypergraphs for Learning Multiple World Interpretations
Alina Marcu, Mihai Cristian Pîrvu, Dragos Costea, Emanuela Haller, Emil Slusanschi, Nabil Belbachir, Rahul Sukthankar, Marius Leordeanu Self-Supervised Semantic Segmentation: Consistency over Transformation
Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof Semantic Motif Segmentation of Archaeological Fresco Fragments
Aref Enayati, Luca Palmieri, Sebastiano Vascon, Marcello Pelillo, Sinem Aslan Semantic RGB-D Image Synthesis
Shijie Li, Rong Li, Juergen Gall SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi Semi-Supervised Quality Evaluation of Colonoscopy Procedures
Idan Kligvasser, George Leifman, Roman Goldenberg, Ehud Rivlin, Michael Elad SEPAL: Spatial Gene Expression Prediction from Local Graphs
Gabriel Mejía, Paula Cárdenas, Daniela Ruiz, Angela Castillo, Pablo Arbeláez Shapley Deep Learning: A Consensus for General-Purpose Vision Systems
Youcef Djenouri, Ahmed Nabil Belbachir, Tomasz P. Michalak, Anis Yazidi Cite
SHARP Challenge 2023: Solving CAD History and pArameters Recovery from Point Clouds and 3D Scans. Overview, Datasets, Metrics, and Baselines
Dimitrios Mallis, Sk Aziz Ali, Elona Dupont, Kseniya Cherenkova, Ahmet Serdar Karadeniz, Mohammad Sadil Khan, Anis Kacem, Gleb Gusev, Djamila Aouada ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty
Vanessa Wirth, Anna-Maria Liphardt, Birte Coppers, Johanna Bräunig, Simon Heinrich, Sigrid Leyendecker, Arnd Kleyer, Georg Schett, Martin Vossiek, Bernhard Egger, Marc Stamminger SHOWMe: Benchmarking Object-Agnostic Hand-Object 3D Reconstruction
Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Brégier, Matthieu Armando, Jean-Sébastien Franco, Grégory Rogez Single-Shot Pruning for Pre-Trained Models: Rethinking the Importance of Magnitude Pruning
Hirokazu Kohama, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi Cite
SoftMax Bias Correction for Quantized Generative Models
Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel Sparse Linear Concept Discovery Models
Konstantinos P. Panousis, Dino Ienco, Diego Marcos Spatio-Temporal Analysis of Patient-Derived Organoid Videos Using Deep Learning for the Prediction of Drug Efficacy
Leo Fillioux, Emilie Gontran, Jérôme Cartry, Jacques RR Mathieu, Sabrina Bedja, Alice Boilève, Paul-Henry Cournède, Fanny Jaulin, Stergios Christodoulidis, Maria Vakalopoulou Spatio-Temporal Convolution-Attention Video Network
Ali Diba, Vivek Sharma, Mohammad Mahdi Arzani, Luc Van Gool Cite
SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation
Rasmus Laurvig Haugaard, Frederik Hagelskjær, Thorbjørn Mosekjær Iversen STRIDE: Street View-Based Environmental Feature Detection and Pedestrian Collision Prediction
Cristina González, Nicolás Ayobi, Felipe Escallón, Laura Baldovino-Chiquillo, Maria Wilches-Mogollón, Donny Pasos, Nicole Ramírez, José Pinzón, Olga L. Sarmiento, D. Alex Quistberg, Pablo Arbeláez SynDrone - Multi-Modal UAV Dataset for Urban Scenarios
Giulia Rizzoli, Francesco Barbato, Matteo Caligiuri, Pietro Zanuttigh TeleViT: Teleconnection-Driven Transformers Improve Subseasonal to Seasonal Wildfire Forecasting
Ioannis Prapas, Nikolaos-Ioannis Bountos, Spyros Kondylatos, Dimitrios Michail, Gustau Camps-Valls, Ioannis Papoutsis Temporal DINO: A Self-Supervised Video Strategy to Enhance Action Prediction
Izzeddin Teeti, Rongali Sai Bhargav, Vivek Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin The First Visual Object Tracking Segmentation VOTS2023 Challenge Results
Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo Cite
The Robust Semantic Segmentation UNCV2023 Challenge Results
Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction
Tiago Rodrigues de Almeida, Andrey Rudenko, Tim Schreiter, Yufei Zhu, Eduardo Gutiérrez-Maestro, Lucas Morillo-Méndez, Tomasz Piotr Kucner, Óscar Martínez Mozos, Martin Magnusson, Luigi Palmieri, Kai O. Arras, Achim J. Lilienthal Cite
Tiny and Efficient Model for the Edge Detection Generalization
Xavier Soria, Yachuan Li, Mohammad Rouhani, Angel Domingo Sappa Towards Fixing Clever-Hans Predictors with Counterfactual Knowledge Distillation
Sidney Bender, Christopher J. Anders, Pattarawat Chormai, Heike Marxfeld, Jan Herrmann, Grégoire Montavon Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis
Thanh-Huy Nguyen, Quang-Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen-Quoc-Khanh Le TSOSVNet: Teacher-Student Collaborative Knowledge Distillation for Online Signature Verification
Chandra Sekhar Vorugunti, Avinash Gautam, Viswanath Pulabaigari, Sreeja Sr, Rama Krishna Sai G Cite
UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM
Erik Sandström, Kevin Ta, Luc Van Gool, Martin R. Oswald Undercover Deepfakes: Detecting Fake Segments in Videos
Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman K. Halgamuge Unified Automatic Plant Cover and Phenology Prediction
Matthias Körschens, Solveig Franziska Bucher, Christine Römermann, Joachim Denzler Cite
Unsupervised Domain Adaptation for Self-Driving from past Traversal Features
Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger Using and Abusing Equivariance
Tom Edixhoven, Attila Lengyel, Jan C. van Gemert Video Action Recognition with Adaptive Zooming Using Motion Residuals
Mostafa Shahabinejad, Irina Kezele, Seyed Shahabeddin Nabavi, Wentao Liu, Seel Patel, Yuanhao Yu, Yang Wang, Jin Tang Cite
Vision-Based Treatment Localization with Limited Data: Automated Documentation of Military Emergency Medical Procedures
Trevor Powers, Elaheh Hatamimajoumerd, William Chu, Vishakk Rajendran, Rishi Shah, Frank Diabour, Marc Vaillant, Richard Fletcher, Sarah Ostadabbas Cite
VLMAH: Visual-Linguistic Modeling of Action History for Effective Action Anticipation
Victoria Manousaki, Konstantinos Bacharidis, Konstantinos E. Papoutsakis, Antonis A. Argyros Cite
VSCHH 2023: A Benchmark for the View Synthesis Challenge of Human Heads
Youngkyoon Jang, Jiali Zheng, Jifei Song, Helisa Dhamo, Eduardo Pérez-Pellitero, Thomas Tanay, Matteo Maggioni, Richard Shaw, Sibi Catley-Chandar, Yiren Zhou, Jiankang Deng, Ruijie Zhu, Jiahao Chang, Ziyang Song, Jiahuan Yu, Tianzhu Zhang, Khanh-Binh Nguyen, Joon-Sung Yang, Andreea Dogaru, Bernhard Egger, Heng Yu, Aarush Gupta, Joel Julin, László A. Jeni, Hyeseong Kim, Jungbin Cho, Dosik Hwang, Deukhee Lee, Doyeon Kim, Dongseong Seo, SeungJin Jeon, YoungDon Choi, Jun Seok Kang, Ahmet Cagatay Seker, Sang Chul Ahn, Ales Leonardis, Stefanos Zafeiriou Cite
Weakly Semi-Supervised Detector-Based Video Classification with Temporal Context for Lung Ultrasound
Gary Y. Li, Li Chen, Mohsen Zahiri, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Cynthia Gregory, Kenton W. Gregory, Balasundar Raju, Jochen Kruecker, Alvin Chen Cite
Weed Mapping with Convolutional Neural Networks on High Resolution Whole-Field Images
Yuemin Wang, Thuan Ha, Kathryn Aldridge, Hema Sudhakar Duddu, Steve Shirtliffe, Ian Stavness Cite
What Does Really Count? Estimating Relevance of Corner Cases for Semantic Segmentation in Automated Driving
Jasmin Breitenstein, Florian Heidecker, Maria Lyssenko, Daniel Bogdoll, Maarten Bieshaar, J. Marius Zöllner, Bernhard Sick, Tim Fingscheidt Cite
When Layers Play the Lottery, All Tickets Win at Initialization
Artur Jordão, George Corrêa de Araújo, Helena de Almeida Maia, Hélio Pedrini Which Tokens to Use? Investigating Token Reduction in Vision Transformers
Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor, Thomas B. Moeslund YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems
Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah You Can Have Your Ensemble and Run It Too - Deep Ensembles Spread over Time
Isak Meding, Alexander Bodin, Adam Tonderski, Joakim Johnander, Christoffer Petersson, Lennart Svensson ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks
Kartikeya Bhardwaj, Hsin-Pai Cheng, Sweta Priyadarshi, Zhuojin Li