ICCVW 2023

498 papers

2D Cross-View Object Segmentation and Perceptual Grouping in Computer-Aided Design Drawings Mohamed Dhia Elhak Besbes, Zahra Vahidi Ferdousi, Hedi Tabia, Mouna Fradi
360° from a Single Camera: A Few-Shot Approach for LiDAR Segmentation Laurenz Reichardt, Nikolas Ebert, Oliver Wasenmüller
3D Surface Approximation of the Entire Bayeux Tapestry for Improved Pedagogical Access Marjorie Redon, Matthieu Pizenberg, Yvain Quéau, Abderrahim Elmoataz
PDF
A Comparative Study of Vision Transformer Encoders and Few-Shot Learning for Medical Image Classification Maxat Nurgazin, Nguyen Anh Tu
A Comprehensive Empirical Evaluation on Online Continual Learning Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost van de Weijer, Hamed Hemati
PDF
A Comprehensive Framework for Evaluating Deepfake Generators: Dataset, Metrics Performance, and Comparative Analysis Sahar Husseini, Jean-Luc Dugelay
A Comprehensive Study of Transfer Learning Under Constraints Tom Pégeot, Inna Kucher, Adrian Popescu, Bertrand Delezoide
PDF
A Cross-Dataset Study on the Brazilian Sign Language Translation Amanda Hellen de Avellar Sarmento, Moacir Antonelli Ponti
A Dual Perspective of Human Motion Analysis - 3D Pose Estimation and 2D Trajectory Prediction Mayssa Zaier, Hazem Wannous, Hassen Drira, Jacques Boonaert
PDF
A Gated Attention Transformer for Multi-Person Pose Tracking Andreas Doering, Juergen Gall
PDF
A Horse with No Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior Jose Sosa, David C. Hogg
PDF
A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition Youcef Djenouri, Ahmed Nabil Belbachir
A Lightweight Skeleton-Based 3D-CNN for Real-Time Fall Detection and Action Recognition Nadhira Noor, In Kyu Park
A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset Andreas Voskou, Konstantinos P. Panousis, Harris Partaourides, Kyriakos Tolias, Sotirios Chatzis
PDF
A New Large Dataset and a Transfer Learning Methodology for Plant Phenotyping in Vertical Farms Nico Samà, Etienne David, Simone Rossetti, Alessandro Antona, Benjamin Franchetti, Fiora Pirri
PDF
A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation Jan-Aike Termöhlen, Timo Bartels, Tim Fingscheidt
PDF
A Simple and Explainable Method for Uncertainty Estimation Using Attribute Prototype Networks Claudius Zelenka, Andrea Göhring, Daniyal Kazempour, Maximilian Hünemörder, Lars Schmarje, Peer Kröger
A Simple and Generic Framework for Feature Distillation via Channel-Wise Transformation Ziwei Liu, Yongtao Wang, Xiaojie Chu, Nan Dong, Shengxiang Qi, Haibin Ling
PDF
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation Applied to Vision Transformers Matteo Bastico, David Ryckelynck, Laurent Corté, Yannick Tillier, Etienne Decencière
PDF
A Simple Signal for Domain Shift Goirik Chakrabarty, Manogna Sreenivas, Soma Biswas
A Unified Approach for Occlusion Tolerant 3D Facial Pose Capture and Gaze Estimation Using MocapNETs Ammar Qammaz, Antonis A. Argyros
Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah Deeplite
PDF
Accidental Turntables: Learning 3D Pose by Watching Objects Turn Zezhou Cheng, Matheus Gadelha, Subhransu Maji
PDF
Accumulation Knowledge Distillation for Conditional GAN Compression Tingwei Gao, Rujiao Long
ACTIS: Improving Data Efficiency by Leveraging Semi-Supervised Augmentation Consistency Training for Instance Segmentation Josef Lorenz Rumberger, Jannik Franzen, Peter Hirsch, Jan Philipp Albrecht, Dagmar Kainmueller
Actor-Agnostic Multi-Label Action Recognition with Multi-Modal Query Anindya Mondal, Sauradip Nag, Joaquin M. Prada, Xiatian Zhu, Anjan Dutta
PDF
AD-CLIP: Adapting Domains in Prompt Space Using CLIP Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee
PDF
Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-Free Continual Learning Filip Szatkowski, Mateusz Pyla, Marcin Przewiezlikowski, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski
PDF
Adapting Vision Foundation Models for Plant Phenotyping Feng Chen, Mario Valerio Giuffrida, Sotirios A. Tsaftaris
PDF
Adaptive Self-Training for Object Detection Renaud Vandeghen, Gilles Louppe, Marc Van Droogenbroeck
PDF
Advanced Augmentation and Ensemble Approaches for Classifying Long-Tailed Multi-Label Chest X-Rays Trong-Hieu Nguyen Mau, Tuan-Luc Huynh, Thanh-Danh Le, Hai-Dang Nguyen, Minh-Triet Tran
Adversarial Attacks Against Uncertainty Quantification Emanuele Ledda, Daniele Angioni, Giorgio Piras, Giorgio Fumera, Battista Biggio, Fabio Roli
PDF
Adversarial Examples with Specular Highlights Vanshika Vats, Koteswar Rao Jerripothula
Affordance Segmentation of Hand-Occluded Containers from Exocentric Images Tommaso Apicella, Alessio Xompero, Edoardo Ragusa, Riccardo Berta, Andrea Cavallaro, Paolo Gastaldo
PDF
ALFA - Leveraging All Levels of Feature Abstraction for Enhancing the Generalization of Histopathology Image Classification Across Unseen Hospitals Milad Sikaroudi, Seyedeh Maryam Hosseini, Shahryar Rahnamayan, Hamid R. Tizhoosh
PDF
Alignment and Generation Adapter for Efficient Video-Text Understanding Han Fang, Zhifei Yang, Yuhan Wei, Xianghao Zang, Chao Ban, Zerun Feng, Zhongjiang He, Yongxiang Li, Hao Sun
All-Pairs Consistency Learning for Weakly Supervised Semantic Segmentation Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes
PDF
An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports Ethan Dack, Lorenzo Brigato, Matthew McMurray, Matthias Fontanellaz, Thomas Frauenfelder, Hanno Hoppe, Aristomenis K. Exadaktylos, Thomas Geiser, Manuela Funke-Chambour, Andreas Christe, Lukas Ebner, Stavroula G. Mougiakakou
PDF
An Empirical Analysis of Range for 3D Object Detection Neehar Peri, Mengtian Li, Benjamin Wilson, Yu-Xiong Wang, James Hays, Deva Ramanan
PDF
An Empirical Study of the Effect of Video Encoders on Temporal Video Grounding Ignacio M. De La Jara, Cristian Rodriguez Opazo, Edison Marrese-Taylor, Felipe Bravo-Marquez
An Experimental Protocol for Neural Architecture Search in Super-Resolution Jesús Leopoldo Llano García, Raúl Monroy, Víctor Adrián Sosa-Hernández
An Interactive Method for Adaptive Acquisition in Reflectance Transformation Imaging for Cultural Heritage Muhammad Arsalan Khawaja, Sony George, Franck Marzani, Jon Yngve Hardeberg, Alamin Mansouri
An Interpretable Framework to Characterize Compound Treatments on Filamentous Fungi Using Cell Painting and Deep Metric Learning Laurent Lejeune, Morgane Roussin, Bruno Leggio, Aurélia Vernay
An Optimized Ensemble Framework for Multi-Label Classification on Long-Tailed Chest X-Ray Data Jaehyup Jeong, Bosoung Jeoun, Yeonju Park, Bohyung Han
Analyzing the Behavior of Cauliflower Harvest-Readiness Models by Investigating Feature Relevances Niklas Penzel, Jana Kierdorf, Ribana Roscher, Joachim Denzler
Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li
PDF
AntiNODE: Evaluating Efficiency Robustness of Neural ODEs Mirazul Haque, Simin Chen, Wasif Arman Haque, Cong Liu, Wei Yang
APNet: Urban-Level Scene Segmentation of Aerial Images and Point Clouds Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers
PDF
AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation Damian Sójka, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski
PDF
Are Current Long-Term Video Understanding Datasets Long-Term? Ombretta Strafforello, Klamer Schutte, Jan C. van Gemert
PDF
Assessing the Impact of Diversity on the Resilience of Deep Learning Ensembles: A Comparative Study on Model Architecture, Output, Activation, and Attribution Rafael Rosales, Pablo Munoz, Michael Paulitsch
ASUR3D: Arbitrary Scale Upsampling and Refinement of 3D Point Clouds Using Local Occupancy Fields Akash Kumbar, Tejas Anvekar, Ramesh Ashok Tabib, Uma Mudenagudi
Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-Task Learning Pranav Balaji, Abhijit Das, Srijan Das, Antitza Dantcheva
PDF
Augmenting Features via Contrastive Learning-Based Generative Model for Long-Tailed Classification Minho Park, Hyung-Il Kim, Hwa Jeon Song, Dong-oh Kang
Autonomous Mobile Robot for Automatic Out of Stock Detection in a Supermarket Giuseppe De Simone, Pasquale Foggia, Alessia Saggese, Mario Vento
AW-Net: A Novel Fully Connected Attention-Based Medical Image Segmentation Model Debojyoti Pal, Tanushree Meena, Dwarikanath Mahapatra, Sudipta Roy
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models Jan Warchocki, Teodor Oprescu, Yunhan Wang, Alexandru Damacus, Paul Misterka, Robert-Jan Bruintjes, Attila Lengyel, Ombretta Strafforello, Jan van Gemert
PDF
Benchmarking Image Classifiers for Physical Out-of-Distribution Examples Detection Ojaswee, Akshay Agarwal, Nalini K. Ratha
Bi-Encoder Cascades for Efficient Image Search Robert Hönig, Jan Ackermann, Mingyuan Chi
PDF
Biased Class Disagreement: Detection of Out of Distribution Instances by Using Differently Biased Semantic Segmentation Models Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo
PDF
BiLMa: Bidirectional Local-Matching for Text-Based Person Re-Identification Takuro Fujii, Shuhei Tarashima
PDF
Black-Box Attacks on Image Activity Prediction and Its Natural Language Explanations Alina Elena Baia, Valentina Poggioni, Andrea Cavallaro
PDF
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields Ori Gordon, Omri Avrahami, Dani Lischinski
PDF
BluNF: Blueprint Neural Field Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton
PDF
BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis Angela Castillo, María Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali K. Thabet, Artsiom Sanakoyeu
PDF
Boosting Semi-Supervised Learning by Bridging High and Low-Confidence Predictions Khanh-Binh Nguyen, Joon-Sung Yang
PDF
BuilDiff: 3D Building Shape Generation Using Single-Image Conditional Point Cloud Diffusion Models Yao Wei, George Vosselman, Michael Ying Yang
PDF
Building CAD Model Reconstruction from Point Clouds via Instance Segmentation, Signed Distance Function, and Graph Cut Takayuki Shinohara, Yonghe Li, Mitsuteru Sakamoto, Toshiaki Satoh
Calibrated Out-of-Distribution Detection with a Generic Representation Tomás Vojír, Jan Sochman, Rahaf Aljundi, Jirí Matas
PDF
Camera-Based Road Snow Coverage Estimation Kai Cordes, Hellward Broszio
Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions? Prakash Chandra Chhipa, Johan Rodahl Holmgren, Kanjar De, Rajkumar Saini, Marcus Liwicki
PDF
Can Unstructured Pruning Reduce the Depth in Deep Neural Networks? Zhu Liao, Victor Quétu, Van-Tam Nguyen, Enzo Tartaglione
PDF
Causality-Driven One-Shot Learning for Prostate Cancer Grading from MRI Gianluca Carloni, Eva Pachetti, Sara Colantonio
PDF
Characterizing Face Recognition for Resource Efficient Deployment on Edge Ayan Biswas, Sai Amrit Patnaik, A. H. Abdul Hafez, Anoop M. Namboodiri
PDF
Chest X-Ray Feature Pyramid Sum Model with Diseased Area Data Augmentation Method Changhyun Kim, Giyeol Kim, Sooyoung Yang, Hyunsu Kim, Sangyool Lee, Hansu Cho
CheXFusion: Effective Fusion of Multi-View Features Using Transformers for Long-Tailed Chest X-Ray Classification Dongkyun Kim
PDF
Class-Aware Memory Guided Unbiased Weighting for Universal Domain Adaptive Object Detection Qinghai Lang, Zhenwei He, Xiaowei Fu, Lei Zhang
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels Jan Oscar Cross-Zamirski, Praveen Anand, Guy B. Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb
PDF
Class-Incremental Learning of Plant and Disease Detection: Growing Branches with Knowledge Distillation Mathieu Pagé Fortin
Class-Incremental Learning Using Diffusion Model for Distillation and Replay Quentin Jodelet, Xin Liu, Yin Jun Phua, Tsuyoshi Murata
PDF
Classification Robustness to Common Optical Aberrations Patrick Müller, Alexander Braun, Margret Keuper
PDF
CLIP Goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition Deepti Hegde, Jeya Maria Jose Valanarasu, Vishal M. Patel
PDF
CLIP-Decoder : ZeroShot Multilabel Classification Using Multimodal CLIP Aligned Representations Muhammad Ali, Salman H. Khan
CLIP-FO3D: Learning Free Open-World 3D Scene Representations from 2D Dense CLIP Junbo Zhang, Runpei Dong, Kaisheng Ma
PDF
CLIPath: Fine-Tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts Zhengfeng Lai, Zhuoheng Li, Luca Cerny Oliveira, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah
ClipCrop: Conditioned Cropping Driven by Vision-Language Model Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato
PDF
Clustering-Based Domain-Incremental Learning Christiaan Lamers, René Vidal, Nabil Belbachir, Niki van Stein, Thomas Bäck, Paris Giampouras
PDF
CNN Based Cuneiform Sign Detection Learned from Annotated 3D Renderings and Mapped Photographs with Illumination Augmentation Ernst Stötzner, Timo Homburg, Hubert Mara
PDF
CNOS: A Strong Baseline for CAD-Based Novel Object Segmentation Van Nguyen Nguyen, Thibault Groueix, Georgy Ponimatkin, Vincent Lepetit, Tomas Hodan
PDF
Coarse to Fine Frame Selection for Online Open-Ended Video Question Answering Sai Vidyaranya Nuthalapati, Anirudh Tunga
Combating Coronary Calcium Scoring Bias for Non-Gated CT by Semantic Learning on Gated CT Jiajian Li, Anwei Li, Jiansheng Fang, Yonghe Hou, Chao Song, Huifang Yang, Jingwen Wang, Hongbo Liu, Jiang Liu
Comparative Study of Natural Replay and Experience Replay in Online Object Detection Baptiste Wagner, Denis Pellerin, Sylvain Huet
PDF
Complex-Valued Retrievals from Noisy Images Using Diffusion Models Nadav Torem, Roi Ronen, Yoav Y. Schechner, Michael Elad
PDF
Comprehensive Multimodal Segmentation in Medical Imaging: Combining YOLOv8 with SAM and HQ-SAM Models Sumit Pandey, Kuan-Fu Chen, Erik B. Dam
PDF
Computational Evaluation of the Combination of Semi-Supervised and Active Learning for Histopathology Image Segmentation with Missing Annotations Laura Gálvez Jiménez, Lucile Dierckx, Maxime Amodei, Hamed Razavi Khosroshahi, Natarajan Chidambaram, Anh-Thu Phan Ho, Alberto Franzin
PDF
Confusing Large Models by Confusing Small Models Vítor Albiero, Raghav Mehta, Ivan Evtimov, Samuel J. Bell, Levent Sagun, Aram Markosyan
Confusion Mixup Regularized Multimodal Fusion Network for Continual Egocentric Activity Recognition Hanxin Wang, Shuchang Zhou, Qingbo Wu, Hongliang Li, Fanman Meng, Linfeng Xu, Heqian Qiu
Consistency Regularization for Generalizable Source-Free Domain Adaptation Longxiang Tang, Kai Li, Chunming He, Yulun Zhang, Xiu Li
PDF
Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering Nandita Naik, Christopher Potts, Elisa Kreiss
PDF
Continual Evidential Deep Learning for Out-of-Distribution Detection Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost van de Weijer
PDF
Continual Learning with Deep Streaming Regularized Discriminant Analysis Joe Khawand, Peter Hanappe, David Colliaux
PDF
Continuous Hand Gesture Recognition for Human-Robot Collaborative Assembly Bogdan Kwolek
Contrastive Image Synthesis and Self-Supervised Feature Adaptation for Cross-Modality Biomedical Image Segmentation Xinrong Hu, Corey Wang, Yiyu Shi
PDF
Controllable Inversion of Black-Box Face Recognition Models via Diffusion Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber
PDF
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, A. P. Prathosh
PDF
COSE: A Consistency-Sensitivity Metric for Saliency on Image Classification Rangel Daroya, Aaron Sun, Subhransu Maji
PDF
Cross-Dimensional Refined Learning for Real-Time 3D Visual Perception from Monocular Video Ziyang Hong, C. Patrick Yue
PDF
Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model Claudia Cuttano, Antonio Tavera, Fabio Cermelli, Giuseppe Averta, Barbara Caputo
PDF
Cross-Grained Contrastive Representation for Unsupervised Lesion Segmentation in Medical Images Ziqi Yu, Botao Zhao, Yipin Zhang, Shengjie Zhang, Xiang Chen, Haibo Yang, Tingying Peng, Xiao-Yong Zhang
Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering Benjamin Z. Reichman, Larry Heck
Cross-Model Temporal Cooperation via Saliency Maps for Efficient Frame Classification Tomaso Trinci, Tommaso Bianconcini, Leonardo Sarti, Leonardo Taccari, Francesco Sambo
D-ViSA: A Dataset for Detecting Visual Sentiment from Art Images Seoyun Kim, ChaeHee An, Junyeop Cha, Dongjae Kim, Eunil Park
Data Efficient Single Image Dehazing via Adversarial Auto-Augmentation and Extended Atmospheric Scattering Model Pranjay Shyam, Hyunjin Yoo
DatasetEquity: Are All Samples Created Equal? in the Quest for Equity Within Datasets Shubham Shrivastava, Xianling Zhang, Sushruth Nagesh, Armin Parchami
PDF
Decision Boundary Optimization for Few-Shot Class-Incremental Learning Chenxu Guo, Qi Zhao, Shuchang Lyu, Binghao Liu, Chunlei Wang, Lijiang Chen, Guangliang Cheng
Deep Generative Networks for Heterogeneous Augmentation of Cranial Defects Kamil Kwarciak, Marek Wodzinski
PDF
Deep Learning Based 3D Reconstruction for Phenotyping of Wheat Seeds: A Dataset, Challenge, and Baseline Method Vsevolod Cherepashkin, Erenus Yildiz, Andreas Fischbach, Leif Kobbelt, Hanno Scharr
Deep Learning Driven Detection of Tsunami Related Internal Gravity Waves: A Path Towards Open-Ocean Natural Hazards Detection Valentino Constantinou, Michela Ravanelli, Hamlin Liu, Jacob Bortnik
PDF
Deep Learning for Apple Fruit Quality Inspection Using X-Ray Imaging Astrid Tempelaere, Leen Van Doorselaer, Jiaqi He, Pieter Verboven, Tinne Tuytelaars, Bart M. Nicolaï
PDF
Deep Learning Framework Using Sparse Diffusion MRI for Diagnosis of Frontotemporal Dementia Abhishek Tiwari, Ananya Singhal, Saurabh J. Shigwan, Rajeev Kumar Singh
DeepContrast: Deep Tissue Contrast Enhancement Using Synthetic Data Degradations and OOD Model Predictions Nuno Pimpão Martins, Yannis Kalaidzidis, Marino Zerial, Florian Jug
PDF
DeepCut: Unsupervised Segmentation Using Graph Neural Networks Clustering Amit Aflalo, Shai Bagon, Tamar Kashti, Yonina C. Eldar
PDF
Deepfakes Signatures Detection in the Handcrafted Features Space Assia Hamadene, Abdeldjalil Ouahabi, Abdenour Hadid
DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image Datasets Alokendu Mazumder, Tirthajit Baruah, Akash Kumar Singh, Pagadala Krishna Murthy, Vishwajeet Pattanaik, Punit Rathore
PDF
Defense-Prefix for Preventing Typographic Attacks on CLIP Hiroki Azuma, Yusuke Matsui
PDF
DeFi: Detection and Filling of Holes in Point Clouds Towards Restoration of Digitized Cultural Heritage Models Ramesh Ashok Tabib, Dikshit Hegde, Tejas Anvekar, Uma Mudenagudi
DELO: Deep Evidential LiDAR Odometry Using Partial Optimal Transport Sk Aziz Ali, Djamila Aouada, Gerd Reis, Didier Stricker
PDF
Denoising Diffusion for 3D Hand Pose Estimation from Images Maksym Ivashechkin, Oscar Mendez, Richard Bowden
PDF
Detecting Images Generated by Deep Diffusion Models Using Their Local Intrinsic Dimensionality Peter Lorenz, Ricard L. Durall, Janis Keuper
PDF
Detection of Fusarium Damaged Kernels in Wheat Using Deep Semi-Supervised Learning on a Novel WheatSeedBelt Dataset Keyhan Najafian, Lingling Jin, H. Randy Kutcher, Mackenzie Hladun, Samuel Horovatin, Maria Alejandra Oviedo-Ludena, Sheila Maria Pereira De Andrade, Lipu Wang, Ian Stavness
Deterministic Neural Illumination Mapping for Efficient Auto-White Balance Correction Furkan Kinli, Doga Yilmaz, Baris Özcan, Furkan Kiraç
PDF
DetOFA: Efficient Training of Once-for-All Networks for Object Detection Using Path Filter Yuiko Sakuma, Masato Ishii, Takuya Narihira
PDF
Developing Robust and Lightweight Adversarial Defenders by Enforcing Orthogonality on Attack-Agnostic Denoising Autoencoders Aristeidis Bifis, Emmanouil Z. Psarakis, Dimitrios I. Kosmopoulos
DFM-X: Augmentation by Leveraging Prior Knowledge of Shortcut Learning Shunxin Wang, Christoph Brune, Raymond N. J. Veldhuis, Nicola Strisciuglio
PDF
Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation Jieming Zhou, Tong Zhang, Zeeshan Hayder, Lars Petersson, Mehrtash Harandi
PDF
DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion Cédric Rommel, Eduardo Valle, Mickaël Chen, Souhaiel Khalfaoui, Renaud Marlet, Matthieu Cord, Patrick Pérez
PDF
Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage Dario Cioni, Lorenzo Berlincioni, Federico Becattini, Alberto Del Bimbo
PDF
Direct Unsupervised Denoising Benjamin Salmon, Alexander Krull
PDF
Discrete Representation Learning for Modeling Imaging-Based Spatial Transcriptomics Data Dig Vijay Kumar Yarlagadda, Joan Massagué, Christina S. Leslie
DISGAN: Wavelet-Informed Discriminator Guides GAN to MRI Super-Resolution with Noise Cleaning Qi Wang, Lucas Mahler, Julius Steiglechner, Florian Birk, Klaus Scheffler, Gabriele Lohmann
PDF
Disjoint Pose and Shape for 3D Face Reconstruction Raja Kumar, Jiahao Luo, Alex Pang, James Davis
PDF
Distance Matters for Improving Performance Estimation Under Covariate Shift Mélanie Roschewitz, Ben Glocker
PDF
Distilling Part-Whole Hierarchical Knowledge from a Huge Pretrained Class Agnostic Segmentation Framework Ahmed Radwan, Mohamed S. Shehata
Do Planar Constraints Improve Camera Pose Estimation in Monocular SLAM? Charlotte Arndt, Reza Sabzevari, Javier Civera
Domain Adversarial Learning Towards Underwater Image Enhancement Meghna Kapoor, Rohan Baghel, Badri Narayan Subudhi, Vinit Jakhetiya, Ankur Bansal
DONNAv2 - Lightweight Neural Architecture Search for Vision Tasks Sweta Priyadarshi, Tianyu Jiang, Hsin-Pai Cheng, Sendil Krishna, Viswanath Ganapathy, Chirag Patel
PDF
Drones4Good: Supporting Disaster Relief Through Remote Sensing and AI Nina Merkle, Reza Bahmanyar, Corentin Henry, Seyed Majid Azimi, Xiangtian Yuan, Simon Schopferer, Veronika Gstaiger, Stefan Auer, Anne Schneibel, Marc Wieland, Thomas Kraft
PDF
Dual-Contrastive Dual-Consistency Dual-Transformer: A Semi-Supervised Approach to Medical Image Segmentation Ziyang Wang, Congying Ma
Dual-Level Interaction for Domain Adaptive Semantic Segmentation Dongyu Yao, Boheng Li
PDF
Dynamic Multiview Refinement of 3D Hand Datasets Using Differentiable Ray Tracing Giorgos Karvounas, Nikolaos Kyriazis, Iason Oikonomidis, Antonis A. Argyros
Dynamic Neural Network Is All You Need: Understanding the Robustness of Dynamic Mechanisms in Neural Networks Mirazul Haque, Wei Yang
PDF
Dynamic Scene Graph Representation for Surgical Video Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab
PDF
Dynamic Texts from UAV Perspective Natural Images Hidetomo Sakaino
ECO: Ensembling Context Optimization for Vision-Language Models Lorenzo Agnolucci, Alberto Baldrati, Francesco Todino, Federico Becattini, Marco Bertini, Alberto Del Bimbo
PDF
Effect of Stage Training for Long-Tailed Multi-Label Image Classification Yosuke Yamagishi, Shohei Hanaoka
Effective Whole-Body Pose Estimation with Two-Stages Distillation Zhendong Yang, Ailing Zeng, Chun Yuan, Yu Li
PDF
Efficient 3D Reconstruction, Streaming and Visualization of Static and Dynamic Scene Parts for Multi-Client Live-Telepresence in Large-Scale Environments Leif Van Holland, Patrick Stotko, Stefan Krumpen, Reinhard Klein, Michael Weinmann
PDF
Efficient Grapevine Structure Estimation in Vineyards Conditions Theophile Gentilhomme, Michael Villamizar, Jerome Corre, Jean-Marc Odobez
Efficient Neural PDE-Solvers Using Quantization Aware Training Winfried van den Dool, Tijmen Blankevoort, Max Welling, Yuki M. Asano
PDF
Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning Nobline Yoo, Olga Russakovsky
PDF
Embedded Deformation-Based Compression for Human 3D Dynamic Meshes with Changing Topology Huong Hoang, Kunyao Chen, Truong Nguyen, Pamela C. Cosman
Embedded Plant Recognition: A Benchmark for Low Footprint Deep Neural Networks Mohammed El Amine Sehaba, Carlos Fernando Crispim Junior, Laure Tougne Rodet
PDF
End-to-End Deep Learning for Reconstructing Segmented 3D CT Image from Multi-Energy X-Ray Projections Siqi Wang, Tatsuya Yatagawa, Yutaka Ohtake, Toru Aoki, Jun Hotta
Enhancing Classification Accuracy on Limited Data via Unconditional GAN Chunsan Hong, Byunghee Cha, Bohyung Kim, Tae-Hyun Oh
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor
PDF
Enhancing Differentiable Architecture Search: A Study on Small Number of Cell Blocks in the Search Stage, and Important Branches-Based Cells Selection Bedionita Soro, Chong Song
Enhancing Human-Robot Collaborative Object Search Through Human Behavior Observation and Dialog Takahiro Ishii, Jun Miura, Kotaro Hayashi
Enhancing Medical Image Segmentation: Optimizing Cross-Entropy Weights and Post-Processing with Autoencoders Pranav Singh, Luoyao Chen, Mei Chen, Jinqian Pan, Raviteja Chukkapalli, Shravan Chaudhari, Jacopo Cirrone
PDF
Enhancing Multi-Label Long-Tailed Classification on Chest X-Rays Through ML-GCN Augmentation Hyeryeong Seo, Minhyuk Lee, Woojin Cheong, Hyekyung Yoon, Sohyung Kim, Myungjoo Kang
Ensuring a Connected Structure for Retinal Vessels Deep-Learning Segmentation Idriss Dulau, Catherine Helmer, Cécile Delcourt, Marie Beurton-Aimar
Entropic Score Metric: Decoupling Topology and Size in Training-Free NAS Niccolò Cavagnero, Luca Robbiano, Francesca Pistilli, Barbara Caputo, Giuseppe Averta
Estimation of Crop Production by Fusing Images and Crop Features Ángela Casado-García, Jónathan Heras, Xabier Simon Martínez-Goñi, Jon Miranda-Apodaca, Usue Pérez-López
Estimation of Human Condition at Disaster Site Using Aerial Drone Images Tomoki Arai, Kenji Iwata, Kensho Hara, Yutaka Satoh
PDF
Evaluation of 3D Reconstruction for Cultural Heritage Applications Cristián Llull, Nelson Baloian, Benjamin Bustos, Kornelius Kupczik, Ivan Sipiran, Andres Baloian
Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Concetto Spampinato, Simone Palazzo, Bruno Casella, Marco Aldinucci
Explaining Through Transformer Input Sampling Alexandre Englebert, Sédrick Stassin, Géraldin Nanfack, Sidi Ahmed Mahmoudi, Xavier Siebert, Olivier Cornu, Christophe De Vleeschouwer
Explaining Vision and Language Through Graphs of Events in Space and Time Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu
PDF
Exploring Image Classification Robustness and Interpretability with Right for the Right Reasons Data Augmentation Flávio Arthur Oliveira Santos, Cleber Zanchettin
Exploring Inlier and Outlier Specification for Improved Medical OOD Detection Vivek Sivaraman Narayanaswamy, Yamen Mubarka, Rushil Anirudh, Deepta Rajan, Jayaraman J. Thiagarajan
Exploring the Road Graph in Trajectory Forecasting for Autonomous Driving Rémy Sun, Diane Lingrand, Frédéric Precioso
PDF
Expressive Talking Head Video Encoding in StyleGAN2 Latent Space Trevine Oorloff, Yaser Yacoob
PDF
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images Hongkuan Zhang, Edward Whittaker, Ikuo Kitagishi
PDF
External Commonsense Knowledge as a Modality for Social Intelligence Question-Answering Sanika Natu, Shounak Sural, Sulagna Sarkar
Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee
PDF
Facsimiles-Based Deep Learning for Matching Relief-Printed Decorations on Medieval Ceramic Sherds Khawla Brahim, Sylvie Treuillet, Matthieu Exbrayat, Sébastien Jesset
Factorized Dynamic Fully-Connected Layers for Neural Networks Francesca Babiloni, Thomas Tanay, Jiankang Deng, Matteo Maggioni, Stefanos Zafeiriou
Fair Robust Active Learning by Joint Inconsistency Tsung-Han Wu, Hung-Ting Su, Shang-Tse Chen, Winston H. Hsu
PDF
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection Silvio Galesso, Max Argus, Thomas Brox
PDF
FArMARe: A Furniture-Aware Multi-Task Methodology for Recommending Apartments Based on the User Interests Ali Abdari, Alex Falcon, Giuseppe Serra
PDF
Fast Object Detection in High-Resolution Videos Ryan Tran, Atul Kanaujia, Vasu Parameswaran
FedLID: Self-Supervised Federated Learning for Leveraging Limited Image Data Athanasios Psaltis, Anestis Kastellos, Charalampos Z. Patrikakis, Petros Daras
PDF
FedRCIL: Federated Knowledge Distillation for Representation Based Contrastive Incremental Learning Athanasios Psaltis, Christos Chatzikonstantinou, Charalampos Z. Patrikakis, Petros Daras
PDF
Few Labels Are Enough! Semi-Supervised Graph Learning for Social Interaction Nicola Corbellini, Jhony H. Giraldo, Giovanna Varni, Gualtiero Volpe
PDF
FewFaceNet: A Lightweight Few-Shot Learning-Based Incremental Face Authentication for Edge Cameras Abu Sufian, Anirudha Ghosh, Debaditya Barman, Marco Leo, Cosimo Distante, Baihua Li
PDF
Fine-Grained Is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation Maëlic Neau, Paulo E. Santos, Anne-Gwenn Bosser, Cédric Buche
PDF
Fine-Tuned but Zero-Shot 3D Shape Sketch View Similarity and Retrieval Gianluca Berardi, Yulia Gryaditskaya
PDF
FireFly: A Synthetic Dataset for Ember Detection in Wildfire Yue Hu, Xinan Ye, Yifei Liu, Souvik Kundu, Gourav Datta, Srikar Mutnuri, Namo Asavisanu, Nora Ayanian, Konstantinos Psounis, Peter A. Beerel
PDF
FIVA: Facial Image and Video Anonymization and Anonymization Defense Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez
PDF
Flashback for Continual Learning Leila Mahmoodi, Mehrtash Harandi, Peyman Moghadam
Floor Plan Reconstruction from Sparse Views: Combining Graph Neural Network with Constrained Diffusion Arnaud Gueze, Matthieu Ospici, Damien Rohmer, Marie-Paule Cani
PDF
Focus on Content Not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN Jonas Utz, Tobias Weise, Maja Schlereth, Fabian Wagner, Mareike Thies, Mingxuan Gu, Stefan Uderhardt, Katharina Breininger
PDF
Frequency-Aware Self-Supervised Long-Tailed Learning Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang
PDF
From Scarcity to Understanding: Transfer Learning for the Extremely Low Resource Irish Sign Language Ruth Holmes, Ellen Rushe, Mathieu De Coster, Maxim Bonnaerens, Shinichi Satoh, Akihiro Sugimoto, Anthony Ventresque
PDF
Fusing VHR Post-Disaster Aerial Imagery and LiDAR Data for Roof Classification in the Caribbean Isabelle Tingzon, Nuala Margaret Cowan, Pierre Chrzanowski
PDF
Fusion Approaches to Predict Post-Stroke Aphasia Severity from Multimodal Neuroimaging Data Saurav Chennuri, Sha Lai, Anne Billot, Maria Varkanitsa, Emily J. Braun, Swathi Kiran, Archana Venkataraman, Janusz Konrad, Prakash Ishwar, Margrit Betke
G2L: A High-Dimensional Geometric Approach for Automatic Generation of Highly Accurate Pseudo-Labels John R. Kender, Parijat Dube, Zhengyang Han, Bishwaranjan Bhattacharjee
GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Rubén Vera-Rodríguez, Dominik Lawatsch, Florian Domin, Maxim Schaubert
PDF
Gaussian Image Anomaly Detection with Greedy Eigencomponent Selection Tetiana Gula, João P. C. Bertoldo
PDF
Gaussian Latent Representations for Uncertainty Estimation Using Mahalanobis Distance in Deep Classifiers Aishwarya Venkataramanan, Assia Benbihi, Martin Laviale, Cédric Pradalier
PDF
Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection Samayan Bhattacharya, Avigyan Bhattacharya, Sk Shahnawaz
Generative Approach for Probabilistic Human Mesh Recovery Using Diffusion Models Hanbyel Cho, Junmo Kim
PDF
Geodesic Regression Characterizes 3D Shape Changes in the Female Brain During Menstruation Adele Myers, Caitlin M. Taylor, Emily G. Jacobs, Nina Miolane
PDF
Geometric Contrastive Learning Yeskendir Koishekenov, Sharvaree P. Vadgama, Riccardo Valperga, Erik J. Bekkers
Geometric Superpixel Representations for Efficient Image Classification with Graph Neural Networks Radu A. Cosma, Lukas Knobel, Putri A. van der Linden, David M. Knigge, Erik J. Bekkers
Good Fences Make Good Neighbours Imanol González Estepa, Jesús M. Rodríguez-de-Vera, Bhalaji Nagarajan, Petia Radeva
GPS-GLASS: Learning Nighttime Semantic Segmentation Using Daytime Video and GPS Data Hongjae Lee, Changwoo Han, Jun-Sang Yoo, Seung-Won Jung
PDF
Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models Byounggyu Lew, Donghyun Son, Buru Chang
PDF
Group-Conditional Conformal Prediction via Quantile Regression Calibration for Crop and Weed Classification Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre Da Costa
PDF
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse Juanita Puentes, Angela Castillo, Wilmar Osejo, Yuly Calderón, Viviana Quintero, Lina Saldarriaga, Diana Agudelo, Pablo Arbeláez
PDF
Guiding Video Prediction with Explicit Procedural Knowledge Patrick Takenaka, Johannes Maucher, Marco F. Huber
PDF
Haystack: A Panoptic Scene Graph Dataset to Evaluate Rare Predicate Classes Julian Lorenz, Florian Barthel, Daniel Kienzle, Rainer Lienhart
PDF
Hierarchical Spatiotemporal Transformers for Video Object Segmentation Jun-Sang Yoo, Hongjae Lee, Seung-Won Jung
PDF
HyperCoil-Recon: A Hypernetwork-Based Adaptive Coil Configuration Task Switching Network for MRI Reconstruction Sriprabha Ramanarayanan, Mohammad Al Fahim, G. S. Rahul, Amrit Kumar Jethi, Keerthi Ram, Mohanasankar Sivaprakasam
PDF
HyperSparse Neural Networks: Shifting Exploration to Exploitation Through Adaptive Regularization Patrick Glandorf, Timo Kaiser, Bodo Rosenhahn
PDF
Hyperspectral Imaging of In-Site Stained Glasses: Illumination Variation Compensation Using Two Perpendicular Scans Suzan Joseph Kessy, Takuya Funatomi, Kazuya Kitano, Yuki Fujimura, Guillaume Caron, El Mustapha Mouaddib, Yasuhiro Mukaigawa
PDF
Identification of Novel Classes for Improving Few-Shot Object Detection Zeyu Shangguan, Mohammad Rostami
PDF
Identifying Out-of-Domain Objects with Dirichlet Deep Neural Networks Ahmed Hammam, Frank Bonarens, Seyed Eghbal Ghobadi, Christoph Stiller
Identifying Systematic Errors in Object Detectors with the SCROD Pipeline Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen
PDF
IDTransformer: Transformer for Intrinsic Image Decomposition Partha Das, Maxime Gevers, Sezer Karaoglu, Theo Gevers
IFPNet: Integrated Feature Pyramid Network with Fusion Factor for Lane Detection Zinan Lv, Dong Han, Wenzhe Wang, Cheng Chen
ILSH: The Imperial Light-Stage Head Dataset for Human Head View Synthesis Jiali Zheng, Youngkyoon Jang, Athanasios Papaioannou, Christos Kampouris, Rolandos Alexandros Potamias, Foivos Paraperas Papantoniou, Efstathios Galanakis, Ales Leonardis, Stefanos Zafeiriou
Image Guided Inpainting with Parameter Efficient Learning Sangbeom Lim, Seungryong Kim
Implicit Neural Representation in Medical Imaging: A Comparative Survey Amirali Molaei, Amirhossein Aminimehr, Armin Tavakoli, Amirhossein Kazerouni, Bobby Azad, Reza Azad, Dorit Merhof
PDF
Improving Automatic Endoscopic Stone Recognition Using a Multi-View Fusion Approach Enhanced with Two-Step Transfer Learning Francisco Javier López-Tiro, Elias Villalvazo-Avila, Juan Pablo Betancur-Rengifo, Iván Reyes-Amezcua, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian Daul
PDF
Improving Deep Learning on Hyperspectral Images of Grain by Incorporating Domain Knowledge from Chemometrics Ole-Christian Galbo Engstrøm, Erik Schou Dreier, Birthe Møller Jespersen, Kim Steenstrup Pedersen
PDF
Improving Replay Sample Selection and Storage for Less Forgetting in Continual Learning Daniel Brignac, Niels Lobo, Abhijit Mahalanobis
PDF
Inductive Conformal Prediction for Harvest-Readiness Classification of Cauliflower Plants: A Comparative Study of Uncertainty Quantification Methods Mohamed M. Farag, Jana Kierdorf, Ribana Roscher
InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-Based Video Editing Anant Khandelwal
PDF
Instant Continual Learning of Neural Radiance Fields Ryan Po, Zhengyang Dong, Alexander W. Bergman, Gordon Wetzstein
PDF
InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah
PDF
Interaction Acceptance Modelling and Estimation for a Proactive Engagement in the Context of Human-Robot Interactions Timothée Dhaussy, Bassam Jabaian, Fabrice Lefèvre
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection Wei-Jhe Huang, Jheng-Hsien Yeh, Min-Hung Chen, Gueter Josmy Faure, Shang-Hong Lai
PDF
Interactive Image Segmentation with Cross-Modality Vision Transformers Kun Li, George Vosselman, Michael Ying Yang
PDF
InterAug: A Tuning-Free Augmentation Policy for Data-Efficient and Robust Object Detection Kowshik Thopalli, Devi S, Jayaraman J. Thiagarajan
PDF
Interpretable-Through-Prototypes Deepfake Detection for Diffusion Models Agil Aghasanli, Dmitry Kangin, Plamen Angelov
Intrinsic Appearance Decomposition Using Point Cloud Representation Xiaoyan Xing, Konrad Groh, Sezer Karaoglu, Theo Gevers
Introspection of 2D Object Detection Using Processed Neural Activation Patterns in Automated Driving Systems Hakan Yekta Yatbaz, Mehrdad Dianati, Konstantinos Koufos, Roger Woodman
PDF
IPCert: Provably Robust Intellectual Property Protection for Machine Learning Zhengyuan Jiang, Minghong Fang, Neil Zhenqiang Gong
Is Context All You Need? Scaling Neural Sign Language Translation to Large Domains of Discourse Ozge Mercanoglu Sincan, Necati Cihan Camgöz, Richard Bowden
PDF
Is There Progress in Activity Progress Prediction? Frans de Boer, Jan C. van Gemert, Jouke Dijkstra, Silvia L. Pintea
PDF
Iterative Robust Visual Grounding with Masked Reference Based Centerpoint Supervision Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao
PDF
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition Lucian Bicsi, Bogdan Alexe, Radu Tudor Ionescu, Marius Leordeanu
PDF
Just Ask Plus: Using Transcripts for VideoQA Mohammad Javad Pirhadi, Motahhare Mirzaei, Sauleh Eetemadi
Kinship Representation Learning with Face Componential Relation Wen-Tai Su, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-Chun Chen
PDF
Knowledge Informed Sequential Scene Graph Verification Using VQA Dao Thauvin, Stéphane Herbin
PDF
Language-Enhanced RNR-mAP: Querying Renderable Neural Radiance Field Maps with Natural Language Francesco Taioli, Federico Cunico, Federico Girella, Riccardo Bologna, Alessandro Farinelli, Marco Cristani
PDF
LatentSwap3D: Semantic Edits on 3D Image GANs Enis Simsar, Alessio Tonioni, Evin Pinar Örnek, Federico Tombari
PDF
Learning Interpretable Forensic Representations via Local Window Modulation Sowmen Das, Md. Ruhul Amin
Learning to Prompt CLIP for Monocular Depth Estimation: Exploring the Limits of Human Language Dylan Auty, Krystian Mikolajczyk
Learning to Rank Approach for Refining Image Retrieval in Visual Arts Tetiana Yemelianenko, Iuliia Tkachenko, Tess Masclef, Mihaela Scuturici, Serge Miguet
PDF
Learning Universal Semantic Correspondences with No Supervision and Automatic Data Curation Aleksandar Shtedritski, Andrea Vedaldi, Christian Rupprecht
PDF
Learnt Contrastive Concept Embeddings for Sign Recognition Ryan Wong, Necati Cihan Camgöz, Richard Bowden
PDF
LEMMS: Label Estimation of Multi-Feature Movie Segments Bartolomeo Vacchetti, Dawit Mureja Argaw, Tania Cequtelli
Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration Tomás Chobola, Gesine Müller, Veit Dausmann, Anton Theileis, Jan Taucher, Jan Huisken, Tingying Peng
PDF
Leveraging Visual Attention for Out-of-Distribution Detection Luca Cultrera, Lorenzo Seidenari, Alberto Del Bimbo
PDF
LightNet: Generative Model for Enhancement of Low-Light Images Chaitra Desai, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chinmayee Mandi, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi
Lightweight Vision Transformer with Spatial and Channel Enhanced Self-Attention Jiahao Zheng, Longqi Yang, Yiying Li, Ke Yang, Zhiyuan Wang, Jun Zhou
LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling Kaijing Ma, Xianghao Zang, Zerun Feng, Han Fang, Chao Ban, Yuhan Wei, Zhongjiang He, Yongxiang Li, Hao Sun
Logarithm-Transform Aided Gaussian Sampling for Few-Shot Learning Vaibhav Ganatra
PDF
Looking at Words and Points with Attention: A Benchmark for Text-to-Shape Coherence Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano
PDF
Looking Through the past: Better Knowledge Retention for Generative Replay in Continual Learning Valeriya Khan, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski
PDF
LORD: Leveraging Open-Set Recognition with Unknown Data Tobias Koch, Christian Riess, Thomas Köhler
PDF
M2C: Concise Music Representation for 3D Dance Generation Matthew Marchellus, In Kyu Park
MAMMOS: MApping Multiple Human MOtion with Scene Understanding and Natural Interactions Donggeun Lim, Cheongi Jeong, Young Min Kim
Mapping Memes to Words for Multimodal Hateful Meme Classification Giovanni Burbi, Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Alberto Del Bimbo
PDF
Margin Contrastive Learning with Learnable-Vector for Continual Learning Kotaro Nagata, Kazuhiro Hotta
MARL: Multi-Scale Archetype Representation Learning for Urban Building Energy Modeling Xinwei Zhuang, Zixun Huang, Wentao Zeng, Luisa Caldas
PDF
Masking Strategies for Background Bias Removal in Computer Vision Models Ananthu Aniraj, Cássio F. Dantas, Dino Ienco, Diego Marcos
PDF
MatchMakerNet: Enabling Fragment Matching for Cultural Heritage Analysis Ariana M. Villegas-Suarez, Cristian Lopez, Ivan Sipiran
Memory Population in Continual Learning via Outlier Elimination Julio Hurtado, Alain Raymond-Saez, Vladimir Araujo, Vincenzo Lomonaco, Alvaro Soto, Davide Bacciu
PDF
Memory-Augmented Variational Adaptation for Online Few-Shot Segmentation Jie Liu, Yingjun Du, Zehao Xiao, Cees G. M. Snoek, Jan-Jakob Sonke, Efstratios Gavves
MGiaD: Multigrid in All Dimensions. Efficiency and Robustness by Weight Sharing and Coarsening in Resolution and Channel Dimensions Antonia van Betteray, Matthias Rottmann, Karsten Kahl
MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection Tianpeng Bao, Jiadong Chen, Wei Li, Xiang Wang, Jingjing Fei, Liwei Wu, Rui Zhao, Ye Zheng
PDF
Mind the Clot: Automated LVO Detection on CTA Using Deep Learning Shubham Kumar, Arjun Agarwal, Satish Golla, Swetha Tanamala, Ujjwal Upadhyay, Subhankar Chattoraj, Preetham Putha, Sasank Chilamkurthy
Mirror U-Net: Marrying Multimodal Fission with Multi-Task Learning for Semantic Segmentation in Medical Imaging Zdravko Marinov, Simon Reiß, David Kersting, Jens Kleesiek, Rainer Stiefelhagen
PDF
Misalignment-Free Relation Aggregation for Multi-Source-Free Domain Adaptation Hao-Wei Yeh, Qier Meng, Tatsuya Harada
MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering Mobeen Ahmad, Geonwoo Park, Dongchan Park, Sanguk Park
Modeling Visual Impairments with Artificial Neural Networks: A Review Lucia Schiatti, Monica Gori, Martin Schrimpf, Giulia Cappagli, Federica Morelli, Sabrina Signorini, Boris Katz, Andrei Barbu
PDF
MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices Xiangyu Chen, Ruiwen Zhen, Shuai Li, Xiaotian Li, Guanghui Wang
PDF
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP Prajwal Ganugula, Y. S. S. S. Santosh Kumar, N. K. Sagar Reddy, Prabhath Chellingi, Avinash Thakur, Neeraj Kasera, C. Shyam Anand
PDF
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Jakob Drachmann Havtorn, Amélie Royer, Tijmen Blankevoort, Babak Ehteshami Bejnordi
PDF
Multi-Camera 3D Position Estimation Using Conditional Random Field Shusuke Matsuda, Nattaon Techasarntikul, Hideyuki Shimonishi
Multi-Exit Resource-Efficient Neural Architecture for Image Classification with Optimized Fusion Block Youva Addad, Alexis Lechervy, Frédéric Jurie
PDF
Multi-Modal Correlated Network with Emotional Reasoning Knowledge for Social Intelligence Question-Answering Baijun Xie, Chung Hyuk Park
Multi-Task Consistency for Active Learning Aral Hekimoglu, Philipp Friedrich, Walter Zimmer, Michael Schmidt, Alvaro Marcos-Ramiro, Alois Knoll
PDF
Multi-Task Hypergraphs for Semi-Supervised Learning Using Earth Observations Mihai Cristian Pîrvu, Alina Marcu, Alexandra Dobrescu, Nabil Belbachir, Marius Leordeanu
PDF
Multimodal Contrastive Learning and Tabular Attention for Automated Alzheimer's Disease Prediction Weichen Huang
PDF
Multimodal Error Correction with Natural Language and Pointing Gestures Stefan Constantin, Fevziye Irem Eyiokur, Dogucan Yaman, Leonard Bärmann, Alex Waibel
Multimodal Neurons in Pretrained Text-Only Transformers Sarah Schwettmann, Neil Chowdhury, Samuel Klein, David Bau, Antonio Torralba
PDF
Multimodal Parameter-Efficient Few-Shot Class Incremental Learning Marco D'Alessandro, Alberto Alonso, Enrique Calabrés, Mikel Galar
PDF
NCQS: Nonlinear Convex Quadrature Surrogate Hyperparameter Optimization Sophia J. Abraham, Kehelwala Dewage Gayan Maduranga, Jeffery Kinnison, Jonathan D. Hauenstein, Walter J. Scheirer
NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc Van Gool
PDF
NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-Supervised 6d Object Pose Estimation Fu Li, Shishir Reddy Vutukur, Hao Yu, Ivan Shugurov, Benjamin Busam, Shaowu Yang, Slobodan Ilic
PDF
No Data Augmentation? Alternative Regularizations for Effective Training on Small Datasets Lorenzo Brigato, Stavroula G. Mougiakakou
PDF
Noise-in, Bias-Out: Balanced and Real-Time MoCap Solving Georgios Albanis, Nikolaos Zioulis, Spyridon Thermos, Anargyros Chatzitofis, Kostas Kolomvatsos
PDF
Non-Destructive Infield Quality Estimation of Strawberries Using Deep Architectures Cees Jol, Junhan Wen, Jan van Gemert
PDF
NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai
PDF
NU-Net: A Self-Supervised Smart Filter for Enhancing Blobs in Bioimages Seongbin Lim, Emmanuel Beaurepaire, Anatole Chessel
PDF
nuScenes Knowledge Graph - A Comprehensive Semantic Representation of Traffic Scenes for Trajectory Prediction Leon Mlodzian, Zhigang Sun, Hendrik Berkemeyer, Sebastian Monka, Zixu Wang, Stefan Dietze, Lavdim Halilaj, Juergen Luettin
PDF
Occluded Gait Recognition via Silhouette Registration Guided by Automated Occlusion Degree Estimation Chi Xu, Shogo Tsuji, Yasushi Makihara, Xiang Li, Yasushi Yagi
OMG-Attack: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks Ofir Bar Tal, Adi Haviv, Amit H. Bermano
PDF
On Moving Object Segmentation from Monocular Video with Transformers Christian Homeyer, Christoph Schnörr
On Offline Evaluation of 3D Object Detection for Autonomous Driving Tim Schreier, Katrin Renz, Andreas Geiger, Kashyap Chitta
PDF
On the Adversarial Robustness of Multi-Modal Foundation Models Christian Schlarmann, Matthias Hein
PDF
On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers Thomas De Min, Massimiliano Mancini, Karteek Alahari, Xavier Alameda-Pineda, Elisa Ricci
PDF
On the Interplay of Convolutional Padding and Adversarial Robustness Paul Gavrikov, Janis Keuper
PDF
On the Risk of Manual Annotations in 3D Confocal Microscopy Image Segmentation Justin Sonneck, Shuo Zhao, Jianxu Chen
On the Unreasonable Vulnerability of Transformers for Image Restoration - And an Easy Fix Shashank Agnihotri, Kanchana Vaishnavi Gandikota, Julia Grabinski, Paramanand Chandramouli, Margret Keuper
PDF
On-Device Real-Time Custom Hand Gesture Recognition Esha Uboweja, David Tian, Qifei Wang, Yi-Chun Kuo, Joe Zou, Lu Wang, George Sung, Matthias Grundmann
PDF
Online Detection of AI-Generated Images David C. Epstein, Ishan Jain, Oliver Wang, Richard Zhang
PDF
Open Problems in Computer Vision for Wilderness SAR and the Search for Patricia Wu-Murad Thomas Manzini, Robin R. Murphy
PDF
Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments Ruiping Liu, Jiaming Zhang, Kunyu Peng, Junwei Zheng, Ke Cao, Yufan Chen, Kailun Yang, Rainer Stiefelhagen
PDF
OpenIncrement: A Unified Framework for Open Set Recognition and Deep Class-Incremental Learning Jiawen Xu, Claas Grohnfeldt, Odej Kao
Optical Solutions for Spectral Imaging Inverse Problems with a Shift-Variant System Sergio Urrea, Roman Jacome, M. Salman Asif, Henry Arguello, Hans Garcia
Order-ViT: Order Learning Vision Transformer for Cancer Classification in Pathology Images Ju Cheon Lee, Jin Tae Kwak
Padding Aware Neurons Dario Garcia-Gasulla, Victor Gimenez-Abalos, Pablo Agustin Martin-Torres
PDF
Painter: Teaching Auto-Regressive Language Models to Draw Sketches Reza Pourreza, Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Pulkit Madan, Roland Memisevic
PDF
PanoStyle: Semantic, Geometry-Aware and Shading Independent Photorealistic Style Transfer for Indoor Panoramic Scenes Muhammad Tukur, Atiq ur Rehman, Giovanni Pintore, Enrico Gobbetti, Jens Schneider, Marco Agus
PARTICLE: Part Discovery and Contrastive Learning for Fine-Grained Recognition Oindrila Saha, Subhransu Maji
PDF
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton
PDF
PatFig: Generating Short and Long Captions for Patent Figures Dana Aubakirova, Kim Gerdes, Lufei Liu
PDF
Pathology-Based Ischemic Stroke Etiology Classification via Clot Composition Guided Multiple Instance Learning Mara Pleasure, Ekaterina Redekop, Jennifer S. Polson, Haoyue Zhang, Naoki Kaneko, William Speier, Corey W. Arnold
PCTrans: Position-Guided Transformer with Query Contrast for Biological Instance Segmentation Qi Chen, Wei Huang, Xiaoyu Liu, Jiacheng Li, Zhiwei Xiong
Personalized 3D Human Pose and Shape Refinement Tom Wehrbein, Bodo Rosenhahn, Iain A. Matthews, Carsten Stoll
PDF
Personalized Monitoring in Home Healthcare: An Assistive System for Post Hip Replacement Rehabilitation Alaa Kryeem, Shmuel Raz, Dana Eluz, Dorit Itah, Hagit Hel-Or, Ilan Shimshoni
Pigment Mapping for Tomb Murals Using Neural Representation and Physics-Based Model Mayuka Tsuji, Yuki Fujimura, Takuya Funatomi, Yasuhiro Mukaigawa, Tetsuro Morimoto, Takeshi Oishi, Jun Takamatsu, Katsushi Ikeuchi
Plant Root Occlusion Inpainting with Generative Adversarial Network Hao Song, Karim Panjvani, Zhigang Liu, Huzaifa Amar, Leon Kochian, Shengjian Ye, Xuan Yang, J. Allan Feurtado, Krunal Chavda, Karina Angela Chimbo Huatatoca, Mark G. Eramian
Pointing Gesture Recognition via Self-Supervised Regularization for ASD Screening Cheol-Hwan Yoo, Jang-Hee Yoo, Ho-Won Kim, ByungOk Han
Pointing Out Human Answer Mistakes in a Goal-Oriented Visual Dialogue Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima
Pollinators as Data Collectors: Estimating Floral Diversity with Bees and Computer Vision Frederic Tausch, Jan Wagner, Simon Klaus
Polygon Detection for Room Layout Estimation Using Heterogeneous Graphs and Wireframes David Gillsjö, Gabrielle Flood, Kalle Åström
PDF
PoseBias: On Dataset Bias and Task Difficulty - Is There an Optimal Camera Position for Facial Image Analysis? Mohit Choithwani, Sneha Almeida, Bernhard Egger
PoseMatcher: One-Shot 6d Object Pose Estimation by Deep Feature Matching Pedro Castro, Tae-Kyun Kim
PDF
Post Training Mixed Precision Quantization of Neural Networks Using First-Order Information Arun Chauhan, Utsav Tiwari, N. R Vikram
POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition Ce Zheng, Matías Mendieta, Chen Chen
PDF
PRAT: PRofiling Adversarial aTtacks Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh S. Rawat
PDF
Probabilistic MIMO U-Net: Efficient and Accurate Uncertainty Estimation for Pixel-Wise Regression Anton Baumann, Thomas Roßberg, Michael Schmitt
PDF
Progressive Feature Adjustment for Semi-Supervised Learning from Pretrained Models Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix
PDF
ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion Zhizhang Hu, Xinliang Zhu, Son Tran, René Vidal, Arnab Dhua
QBitOpt: Fast and Accurate Bitwidth Reallocation During Training Jorn Peters, Marios Fournarakis, Markus Nagel, Mart van Baalen, Tijmen Blankevoort
PDF
Quantized Generative Models for Solving Inverse Problems Kartheek Kumar Reddy Nareddy, Vinayak Killedar, Chandra Sekhar Seelamantula
PDF
Raising the Bar on the Evaluation of Out-of-Distribution Detection Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim
PDF
Rapid Building Damage Assessment Workflow: An Implementation for the 2023 Rolling Fork, Mississippi Tornado Event Caleb Robinson, Simone Fobi Nsutezo, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, Cameron Birge, Kasie Richards, Kris Pitcher, Paulo Duarte, Juan M. Lavista Ferres
PDF
Rapid Flood Inundation Forecast Using Fourier Neural Operator Alexander Y. Sun, Zhi Li, Wonhyun Lee, Qixing Huang, Bridget R. Scanlon, Clint Dawson
PDF
Rapid Tomato DUS Trait Analysis Using an Optimized Mobile-Based Coarse-to-Fine Instance Segmentation Algorithm Dan Jeric Arcega Rustia, Guido Alexander Jansen, Selwin Hageraats, Joseph Peller, Rick van de Zedde, Cécile Marchennay, Wim Sangster, Gosia Blokker
PDF
Ray-Patch: An Efficient Querying for Light Field Transformers Tomás Berriel Martins, Javier Civera
RCD-SGD: Resource-Constrained Distributed SGD in Heterogeneous Environment via Submodular Partitioning Haoze He, Parijat Dube
PDF
RCV2023 Challenges: Benchmarking Model Training and Inference for Resource-Constrained Deep Learning Rishabh Tiwari, Arnav Chavan, Deepak K. Gupta, Gowreesh Mago, Animesh Gupta, Akash Gupta, Suraj Sharan, Yukun Yang, Shanwei Zhao, Shihao Wang, Youngjun Kwak, Seonghun Jeong, Yunseung Lee, Changick Kim, Subin Kim, Ganzorig Gankhuyag, Ho Jung, Junwhan Ryu, HaeMoon Kim, Byeong Hak Kim, Tu Vo, Sheir Zaheer, Alexander Holston, Chan Y. Park, Dheemant Dixit, Nahush Lele, Kushagra Bhushan, Debjani Bhowmick, Devanshu Arya, Sadaf Gulshad, Amirhossein Habibian, Amir Ghodrati, Babak Ehteshami Bejnordi, Jai Gupta, Zhuang Liu, Jiahui Yu, Dilip K. Prasad, Zhiqiang Shen
Real-Time Optimisation-Based Path Planning for Visually Impaired People in Dynamic Environments Hadeel R. Surougi, Julie A. McCann
Reconstructing Pruned Filters Using Cheap Spatial Transformations Roy Miles, Krystian Mikolajczyk
PDF
Reconstruction of 3D Interaction Models from Images Using Shape Prior Mehrshad Mirmohammadi, Parham Saremi, Yen-Ling Kuo, Xi Wang
Reinforcement Learning for Instance Segmentation with High-Level Priors Paul Hilt, Maedeh Zarvandi, Edgar Kaziakhmedov, Sourabh Bhide, Maria Leptin, Constantin Pape, Anna Kreshuk
Reinforcement Learning with Space Carving for Plant Scanning Antonio Pico Villalpando, Matthias Kubisch, David Colliaux, Peter Hanappe, Verena V. Hafner
Relational Prior Knowledge Graphs for Detection and Instance Segmentation Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald
PDF
Repetition-Aware Image Sequence Sampling for Recognizing Repetitive Human Actions Konstantinos Bacharidis, Antonis A. Argyros
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li
PDF
Revisiting Fully Convolutional Geometric Features for Object 6d Pose Estimation Jaime Corsetti, Davide Boscaini, Fabio Poiesi
PDF
Revisiting Generalizability in Deepfake Detection: Improving Metrics and Stabilizing Transfer Sarthak Kamat, Shruti Agarwal, Trevor Darrell, Anna Rohrbach
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-Form Video Understanding Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim
PDF
RheumaVIT: Transformer-Based Model for Automated Scoring of Hand Joints in Rheumatoid Arthritis Alexander Stolpovsky, Elizaveta Dakhova, Polina Druzhinina, Polina Postnikova, Daniil Kudinsky, Alexander Smirnov, Anastasia Sukhinina, Alexander Lila, Anvar Kurmukov
Robust AMD Stage Grading with Exclusively OCTA Modality Leveraging 3D Volume Haochen Zhang, Anna Heinke, Carlo Miguel B. Galang, Daniel N. Deussen, Bo Wen, Dirk-Uwe G. Bartsch, William R. Freeman, Truong Q. Nguyen, Cheolhong An
PDF
Robust Asymmetric Loss for Multi-Label Long-Tailed Learning Wongi Park, Inhyuk Park, Sungeun Kim, Jongbin Ryu
PDF
Robust MSFM Learning Network for Classification and Weakly Supervised Localization Komal Kumar, Balakrishna Pailla, Kalyan Tadepalli, Sudipta Roy
Rotation-Invariant Hierarchical Segmentation on Poincaré Ball for 3D Point Cloud Pierre Onghena, Leonardo Gigli, Santiago Velasco-Forero
PDF
RRc-UNet 3D for Lung Tumor Segmentation from CT Scans of Non-Small Cell Lung Cancer Patients Van-Linh Le, Olivier Saut
PDF
RV-VAE: Integrating Random Variable Algebra into Variational Autoencoders Vassilis C. Nicodemou, Iason Oikonomidis, Antonis A. Argyros
S2RF: Semantically Stylized Radiance Fields Moneish Kumar, Neeraj Panse, Dishani Lahiri
PDF
SAM-Adapter: Adapting Segment Anything in Underperformed Scenes Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Shangzhan Zhang, Zejian Li, Lingyun Sun, Ying Zang, Papa Mao
SATHUR: Self Augmenting Task Hallucinal Unified Representation for Generalized Class Incremental Learning Sathursan Kanagarajah, Thanuja D. Ambegoda, Ranga Rodrigo
PDF
SC2GAN: Rethinking Entanglement by Self-Correcting Correlated GAN Space Zikun Chen, Han Zhao, Parham Aarabi, Ruowei Jiang
Scalable MAV Indoor Reconstruction with Neural Implicit Surfaces Haoda Li, Puyuan Yi, Yunhao Liu, Avideh Zakhor
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab
PDF
SCoTTi: Save Computation at Training Time with an Adaptive Framework Ziyu Li, Enzo Tartaglione, Van-Tam Nguyen
PDF
ScrollNet: Dynamic Weight Importance for Continual Learning Fei Yang, Kai Wang, Joost van de Weijer
PDF
SCSC: Spatial Cross-Scale Convolution Module to Strengthen Both CNNs and Transformers Xijun Wang, Xiaojie Chu, Chunrui Han, Xiangyu Zhang
PDF
SegDA: Maximum Separable Segment Mask with Pseudo Labels for Domain Adaptive Semantic Segmentation Anant Khandelwal
PDF
Segmentation-Based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma Christiaan G. A. Viviers, Mark Ramaekers, M. M. Amaan Valiuddin, Terese Hellström, Nick Tasios, John van der Ven, Igor Jacobs, Lotte Ewals, Joost Nederend, Peter H. N. de With, Misha Luyer, Fons van der Sommen
PDF
Selective Freezing for Efficient Continual Learning Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spampinato, Simone Palazzo
SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation Youssef Dawoud, Gustavo Carneiro, Vasileios Belagiannis
PDF
Self-Supervised Anomaly Detection from Anomalous Training Data via Iterative Latent Token Masking Ashay Patel, Petru-Daniel Tudosiu, Walter H. L. Pinaya, Mark S. Graham, Olusola Adeleke, Gary J. Cook, Vicky Goh, Sébastien Ourselin, M. Jorge Cardoso
PDF
Self-Supervised Hypergraphs for Learning Multiple World Interpretations Alina Marcu, Mihai Cristian Pîrvu, Dragos Costea, Emanuela Haller, Emil Slusanschi, Nabil Belbachir, Rahul Sukthankar, Marius Leordeanu
PDF
Self-Supervised Learning of Contextualized Local Visual Embeddings Thalles Silva, Hélio Pedrini, Adín Ramírez Rivera
PDF
Self-Supervised Semantic Segmentation: Consistency over Transformation Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof
PDF
Self-Training and Multi-Task Learning for Limited Data: Evaluation Study on Object Detection Hoàng-Ân Lê, Minh-Tan Pham
PDF
SelfGraphVQA: A Self-Supervised Graph Neural Network for Scene-Based Question Answering Bruno Souza, Marius Aasan, Hélio Pedrini, Adín Ramírez Rivera
PDF
Semantic Motif Segmentation of Archaeological Fresco Fragments Aref Enayati, Luca Palmieri, Sebastiano Vascon, Marcello Pelillo, Sinem Aslan
PDF
Semantic Parsing of Colonoscopy Videos with Multi-Label Temporal Networks Ori Kelner, Or Weinstein, Ehud Rivlin, Roman Goldenberg
PDF
Semantic RGB-D Image Synthesis Shijie Li, Rong Li, Juergen Gall
PDF
Semantic Segmentation of Crops and Weeds with Probabilistic Modeling and Uncertainty Quantification Ekin Celikkan, Mohammadmehdi Saberioon, Martin Herold, Nadja Klein
Semantic Segmentation Using Foundation Models for Cultural Heritage: An Experimental Study on Notre-Dame De Paris Kévin Réby, Anaïs Guilhelm, Livio De Luca
PDF
Semantically Enhanced Scene Captions with Physical and Weather Condition Changes Hidetomo Sakaino
SeMask: Semantically Masked Transformers for Semantic Segmentation Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi
PDF
Semi-Supervised Quality Evaluation of Colonoscopy Procedures Idan Kligvasser, George Leifman, Roman Goldenberg, Ehud Rivlin, Michael Elad
PDF
Sensitivity Analysis of AI-Based Algorithms for Autonomous Driving on Optical Wavefront Aberrations Induced by the Windshield Dominik Werner Wolf, Markus Ulrich, Nikhil Kapoor
PDF
SEPAL: Spatial Gene Expression Prediction from Local Graphs Gabriel Mejía, Paula Cárdenas, Daniela Ruiz, Angela Castillo, Pablo Arbeláez
PDF
Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes Dana Cohen-Bar, Elad Richardson, Gal Metzer, Raja Giryes, Daniel Cohen-Or
PDF
Shannon Strikes Again! Entropy-Based Pruning in Deep Neural Networks for Transfer Learning Under Extreme Memory and Computation Budgets Gabriele Spadaro, Riccardo Renzulli, Andrea Bragagnolo, Jhony H. Giraldo, Attilio Fiandrotti, Marco Grangetto, Enzo Tartaglione
Shapley Deep Learning: A Consensus for General-Purpose Vision Systems Youcef Djenouri, Ahmed Nabil Belbachir, Tomasz P. Michalak, Anis Yazidi
Sharing Is Caring: Concurrent Interactive Segmentation and Model Training Using a Joint Model Ivan Mikhailov, Benoit Chauveau, Nicolas Bourdel, Adrien Bartoli
SHARP Challenge 2023: Solving CAD History and pArameters Recovery from Point Clouds and 3D Scans. Overview, Datasets, Metrics, and Baselines Dimitrios Mallis, Sk Aziz Ali, Elona Dupont, Kseniya Cherenkova, Ahmet Serdar Karadeniz, Mohammad Sadil Khan, Anis Kacem, Gleb Gusev, Djamila Aouada
PDF
ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty Vanessa Wirth, Anna-Maria Liphardt, Birte Coppers, Johanna Bräunig, Simon Heinrich, Sigrid Leyendecker, Arnd Kleyer, Georg Schett, Martin Vossiek, Bernhard Egger, Marc Stamminger
PDF
SHOWMe: Benchmarking Object-Agnostic Hand-Object 3D Reconstruction Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Brégier, Matthieu Armando, Jean-Sébastien Franco, Grégory Rogez
PDF
Single-Shot Pruning for Pre-Trained Models: Rethinking the Importance of Magnitude Pruning Hirokazu Kohama, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi
SoftMax Bias Correction for Quantized Generative Models Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel
PDF
SortedAP: Rethinking Evaluation Metrics for Instance Segmentation Long Chen, Yuli Wu, Johannes Stegmaier, Dorit Merhof
PDF
SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images Abdullah Hamdi, Bernard Ghanem, Matthias Nießner
PDF
Sparse Linear Concept Discovery Models Konstantinos P. Panousis, Dino Ienco, Diego Marcos
PDF
Spatio-Temporal Analysis of Patient-Derived Organoid Videos Using Deep Learning for the Prediction of Drug Efficacy Leo Fillioux, Emilie Gontran, Jérôme Cartry, Jacques RR Mathieu, Sabrina Bedja, Alice Boilève, Paul-Henry Cournède, Fanny Jaulin, Stergios Christodoulidis, Maria Vakalopoulou
PDF
Spatio-Temporal Convolution-Attention Video Network Ali Diba, Vivek Sharma, Mohammad Mahdi Arzani, Luc Van Gool
SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation Rasmus Laurvig Haugaard, Frederik Hagelskjær, Thorbjørn Mosekjær Iversen
PDF
SSIG: A Visually-Guided Graph Edit Distance for Floor Plan Similarity Casper C. J. van Engelenburg, Seyran Khademi, Jan C. van Gemert
PDF
STRIDE: Street View-Based Environmental Feature Detection and Pedestrian Collision Prediction Cristina González, Nicolás Ayobi, Felipe Escallón, Laura Baldovino-Chiquillo, Maria Wilches-Mogollón, Donny Pasos, Nicole Ramírez, José Pinzón, Olga L. Sarmiento, D. Alex Quistberg, Pablo Arbeláez
PDF
Studying the Impact of Augmentations on Medical Confidence Calibration Adrit Rao, Joon-Young Lee, Oliver O. Aalami
PDF
Sub-Ensembles for Fast Uncertainty Estimation in Neural Networks Matias Valdenegro-Toro
PDF
Surround the Nonlinearity: Inserting Foldable Convolutional Autoencoders to Reduce Activation Footprint Baptiste Rossigneux, Inna Kucher, Vincent Lorrain, Emmanuel Casseau
PDF
Surround-View Vision-Based 3D Detection for Autonomous Driving: A Survey Apoorv Singh
PDF
SynDrone - Multi-Modal UAV Dataset for Urban Scenarios Giulia Rizzoli, Francesco Barbato, Matteo Caligiuri, Pietro Zanuttigh
PDF
Synthetic Dataset Acquisition for a Specific Target Domain Joshua Niemeijer, Sudhanshu Mittal, Thomas Brox
PDF
T-FFTRadNet: Object Detection with Swin Vision Transformers from Raw ADC Radar Signals James Giroux, Martin Bouchard, Robert Laganière
PDF
Targeted Adversarial Attacks on Generalizable Neural Radiance Fields András Horváth, Csaba Mate Józsa
PDF
TeleViT: Teleconnection-Driven Transformers Improve Subseasonal to Seasonal Wildfire Forecasting Ioannis Prapas, Nikolaos-Ioannis Bountos, Spyros Kondylatos, Dimitrios Michail, Gustau Camps-Valls, Ioannis Papoutsis
PDF
Template-Guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction Felix Hertlein, Alexander Naumann
Temporal DINO: A Self-Supervised Video Strategy to Enhance Action Prediction Izzeddin Teeti, Rongali Sai Bhargav, Vivek Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin
PDF
Temporally Consistent Semantic Segmentation Using Spatially Aware Multi-View Semantic Fusion for Indoor RGB-D Videos Fengyuan Sun, Sezer Karaoglu, Theo Gevers
Tensor Factorization for Leveraging Cross-Modal Knowledge in Data-Constrained Infrared Object Detection Manish Sharma, Moitreya Chatterjee, Kuan-Chuan Peng, Suhas Lohit, Michael J. Jones
PDF
The Change You Want to See (Now in 3D) Ragav Sachdeva, Andrew Zisserman
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo
The Robust Semantic Segmentation UNCV2023 Challenge Results Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi
PDF
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures Christoph Reich, Tim Prangemeier, Heinz Koeppl
PDF
THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction Tiago Rodrigues de Almeida, Andrey Rudenko, Tim Schreiter, Yufei Zhu, Eduardo Gutiérrez-Maestro, Lucas Morillo-Méndez, Tomasz Piotr Kucner, Óscar Martínez Mozos, Martin Magnusson, Luigi Palmieri, Kai O. Arras, Achim J. Lilienthal
Tiny and Efficient Model for the Edge Detection Generalization Xavier Soria, Yachuan Li, Mohammad Rouhani, Angel Domingo Sappa
PDF
TKIL: Tangent Kernel Optimization for Class Balanced Incremental Learning Jinlin Xiang, Eli Shlizerman
Topo-CXR: Chest X-Ray TB and Pneumonia Screening with Topological Machine Learning Faisal Ahmed, Brighton Nuwagira, Furkan Torlak, Baris Coskunuzer
Towards an Exhaustive Evaluation of Vision-Language Foundation Models Emmanuelle Salin, Stéphane Ayache, Benoît Favre
PDF
Towards Automated Regulation of Jacobaea Vulgaris in Grassland Using Deep Neural Networks Moritz Schauer, Renke Hohl, Dennis Vaupel, Diethelm Bienhaus, Seyed Eghbal Ghobadi
Towards Estimation of Human Intent in Assistive Robotic Teleoperation Using Kinaesthetic and Visual Feedback Muneeb Ahmed, Brejesh Lall, Rajesh Kumar, Arzad Alam Kherani
Towards Fixing Clever-Hans Predictors with Counterfactual Knowledge Distillation Sidney Bender, Christopher J. Anders, Pattarawat Chormai, Heike Marxfeld, Jan Herrmann, Grégoire Montavon
PDF
Towards Hierarchical Regional Transformer-Based Multiple Instance Learning Josef Cersovsky, Sadegh Mohammadi, Dagmar Kainmueller, Johannes Höhne
PDF
Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis Thanh-Huy Nguyen, Quang-Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen-Quoc-Khanh Le
PDF
Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang
PDF
TP-NoDe: Topology-Aware Progressive Noising and Denoising of Point Clouds Towards Upsampling Akash Kumbar, Tejas Anvekar, Tulasi Amitha Vikrama, Ramesh Ashok Tabib, Uma Mudenagudi
Tracing the Influence of Predecessors on Trajectory Prediction Mengmeng Liu, Hao Cheng, Michael Ying Yang
PDF
Traffic Mirror Detection and Annotation Methods from Street Images of Open Data for Preventing Accidents at Intersections by Alert Da Li, Hikaru Hagura, Taichi Miyabashira, Yukiko Kawai, Shintaro Ono
TrainFors: A Large Benchmark Training Dataset for Image Manipulation Detection and Localization Soumyaroop Nandi, Prem Natarajan, Wael Abd-Almageed
PDF
Trajectory-Prediction with Vision: A Survey Apoorv Singh
PDF
Transformer-Based Detection of Microorganisms on High-Resolution Petri Dish Images Nikolas Ebert, Didier Stricker, Oliver Wasenmüller
PDF
Transformer-Based Sensor Fusion for Autonomous Driving: A Survey Apoorv Singh
PDF
Transformers Pay Attention to Convolutions Leveraging Emerging Properties of ViTs by Dual Attention-Image Network Yousef Yeganeh, Azade Farshad, Peter Weinberger, Seyed-Ahmad Ahmadi, Ehsan Adeli, Nassir Navab
TransInpaint: Transformer-Based Image Inpainting with Context Adaptation Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger
TSOSVNet: Teacher-Student Collaborative Knowledge Distillation for Online Signature Verification Chandra Sekhar Vorugunti, Avinash Gautam, Viswanath Pulabaigari, Sreeja Sr, Rama Krishna Sai G
UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM Erik Sandström, Kevin Ta, Luc Van Gool, Martin R. Oswald
PDF
Undercover Deepfakes: Detecting Fake Segments in Videos Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman K. Halgamuge
PDF
Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar
PDF
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks Fawaz Sammani, Nikos Deligiannis
PDF
Unified Automatic Plant Cover and Phenology Prediction Matthias Körschens, Solveig Franziska Bucher, Christine Römermann, Joachim Denzler
Unlocking Comparative Plant Scoring with Siamese Neural Networks and Pairwise Pseudo Labelling Zane K. J. Hartley, Rob J. Lind, Nicholas Smith, Bob Collison, Andrew P. French
PDF
Unraveling a Decade: A Comprehensive Survey on Isolated Sign Language Recognition Noha A. Sarhan, Simone Frintrop
Unseen and Adverse Outdoor Scenes Recognition Through Event-Based Captions Hidetomo Sakaino
Unsupervised Camouflaged Object Segmentation as Domain Adaptation Yi Zhang, Chengyi Wu
PDF
Unsupervised Confidence Approximation: Trustworthy Learning from Noisy Labelled Data Navid Rabbani, Adrien Bartoli
Unsupervised Domain Adaptation for Self-Driving from past Traversal Features Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger
PDF
UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
PDF
Using and Abusing Equivariance Tom Edixhoven, Attila Lengyel, Jan C. van Gemert
PDF
Using Large Text to Image Models with Structured Prompts for Skin Disease Identification: A Case Study Sajith Rajapaksa, Jean Marie Uwabeza Vianney, Renell Castro, Farzad Khalvati, Shubhra Aich
PDF
VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao
PDF
Video Action Recognition with Adaptive Zooming Using Motion Residuals Mostafa Shahabinejad, Irina Kezele, Seyed Shahabeddin Nabavi, Wentao Liu, Seel Patel, Yuanhao Yu, Yang Wang, Jin Tang
Video Attribute Prototype Network: A New Perspective for Zero-Shot Video Classification Bo Wang, Kaili Zhao, Hongyang Zhao, Shi Pu, Bo Xiao, Jun Guo
Video BagNet: Short Temporal Receptive Fields Increase Robustness in Long-Term Action Recognition Ombretta Strafforello, Xin Liu, Klamer Schutte, Jan van Gemert
PDF
Video-and-Language (VidL) Models and Their Cognitive Relevance Anne Zonneveld, Albert Gatt, Iacer Calixto
Virtual Perturbations to Assess Explainability of Deep-Learning Based Cell Fate Predictors Christopher J. Soelistyo, Guillaume Charras, Alan R. Lowe
PDF
Vision-Based Monitoring of the Short-Term Dynamic Behaviour of Plants for Automated Phenotyping Nikolaus Wagner, Grzegorz Cielniak
Vision-Based Treatment Localization with Limited Data: Automated Documentation of Military Emergency Medical Procedures Trevor Powers, Elaheh Hatamimajoumerd, William Chu, Vishakk Rajendran, Rishi Shah, Frank Diabour, Marc Vaillant, Richard Fletcher, Sarah Ostadabbas
Vision-Language Models Performing Zero-Shot Tasks Exhibit Disparities Between Gender Groups Melissa Hall, Laura Gustafson, Aaron Adcock, Ishan Misra, Candace Ross
VLMAH: Visual-Linguistic Modeling of Action History for Effective Action Anticipation Victoria Manousaki, Konstantinos Bacharidis, Konstantinos E. Papoutsakis, Antonis A. Argyros
Volumetric Fast Fourier Convolution for Detecting Ink on the Carbonized Herculaneum Papyri Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara
PDF
VSCHH 2023: A Benchmark for the View Synthesis Challenge of Human Heads Youngkyoon Jang, Jiali Zheng, Jifei Song, Helisa Dhamo, Eduardo Pérez-Pellitero, Thomas Tanay, Matteo Maggioni, Richard Shaw, Sibi Catley-Chandar, Yiren Zhou, Jiankang Deng, Ruijie Zhu, Jiahao Chang, Ziyang Song, Jiahuan Yu, Tianzhu Zhang, Khanh-Binh Nguyen, Joon-Sung Yang, Andreea Dogaru, Bernhard Egger, Heng Yu, Aarush Gupta, Joel Julin, László A. Jeni, Hyeseong Kim, Jungbin Cho, Dosik Hwang, Deukhee Lee, Doyeon Kim, Dongseong Seo, SeungJin Jeon, YoungDon Choi, Jun Seok Kang, Ahmet Cagatay Seker, Sang Chul Ahn, Ales Leonardis, Stefanos Zafeiriou
WaterLo: Protect Images from Deepfakes Using Localized Semi-Fragile Watermark Nicolas Beuve, Wassim Hamidouche, Olivier Déforges
Weakly Semi-Supervised Detector-Based Video Classification with Temporal Context for Lung Ultrasound Gary Y. Li, Li Chen, Mohsen Zahiri, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Cynthia Gregory, Kenton W. Gregory, Balasundar Raju, Jochen Kruecker, Alvin Chen
Weed Mapping with Convolutional Neural Networks on High Resolution Whole-Field Images Yuemin Wang, Thuan Ha, Kathryn Aldridge, Hema Sudhakar Duddu, Steve Shirtliffe, Ian Stavness
What Does Really Count? Estimating Relevance of Corner Cases for Semantic Segmentation in Automated Driving Jasmin Breitenstein, Florian Heidecker, Maria Lyssenko, Daniel Bogdoll, Maarten Bieshaar, J. Marius Zöllner, Bernhard Sick, Tim Fingscheidt
What if the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-Modal Language Models Letian Zhang, Xiaotong Zhai, Zhongkai Zhao, Xin Wen, Bingchen Zhao
PDF
When Layers Play the Lottery, All Tickets Win at Initialization Artur Jordão, George Corrêa de Araújo, Helena de Almeida Maia, Hélio Pedrini
PDF
Which Tokens to Use? Investigating Token Reduction in Vision Transformers Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor, Thomas B. Moeslund
PDF
Window-Based Model Averaging Improves Generalization in Heterogeneous Federated Learning Debora Caldarola, Barbara Caputo, Marco Ciccone
PDF
YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah
PDF
You Can Have Your Ensemble and Run It Too - Deep Ensembles Spread over Time Isak Meding, Alexander Bodin, Adam Tonderski, Joakim Johnander, Christoffer Petersson, Lennart Svensson
PDF
Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts Deniz Engin, Yannis Avrithis
PDF
ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks Kartikeya Bhardwaj, Hsin-Pai Cheng, Sweta Priyadarshi, Zhuojin Li
PDF