ECCVW 2024

552 papers

✉ AirLetters 💨: An Open Vide 🎞 Dataset of Characters Drawn in the Air Rishit Dagli, Guillaume Berger, Joanna Materzynska, Ingo Bax, Roland Memisevic
PDF
360u-Former: HDR Illumination Estimation with Panoramic Adapted Vision Transformers Jack Oliver Hilliard, Adrian Hilton, Jean-Yves Guillemaut
PDF
3D Object Detection and Tracking Refinement with Ensemble Methods and Spatiotemporal Filtering Sandesh Rajendra Jain, Surendrabikram Thapa, Sanjana Bharadwaj, Abhijit Sarkar, A. Lynn Abbott, Jianhua Xuan
PDF
3D Phenotyping of Canopy Occupation Volume as a Major Predictor for Canopy Photosynthesis in Rice (Oryza Sativa L.) Jiaren Zhou, Man Zhang, Mengqi Zhang, Minjuan Wang
PDF
7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Abhinav Dhall, Shreya Ghosh, Chunchang Shao, Guanyu Hu
PDF
A Biologically-Inspired Approach to Biomedical Image Segmentation Luca Ciampi, Gabriele Lagani, Giuseppe Amato, Fabrizio Falchi
PDF
A Bottom-up Approach to Class-Agnostic Image Segmentation Sebastian Dille, Ari Blondal, Sylvain Paris, Yagiz Aksoy
PDF
A Computer Vision System for Automatic Edge Detection of Magnetic Grain Profile Zhe Liu, Ying Weng, Yiming Zhang
PDF
A CycleGAN Model to Synthesize Missing and Unpaired MRI Sequences for Under-Represented Multiple Sclerosis Lesions Flavio D' Amato, Alessia Cipriani, Alessandro Di Matteo, Daniele Lozzi, Enrico Mattei, Matteo Polsinelli, Giuseppe Placidi
PDF
A Data-Centric Module for Neural Rendering Emanuele Balloni, Lorenzo Stacchio, Lucrezia Gorgoglione, Marina Paolanti, Roberto Pierdicca, Adriano Mancini, Emanuele Frontoni, Primo Zingaretti
PDF
A Disentangled Approach to Predict the Aesthetic Outcomes of Breast Cancer Treatment Helena Montenegro, Maria João Cardoso, Jaime S. Cardoso
PDF
A Framework for Critical Evaluation of Text-to-Image Models: Integrating Art Historical Analysis, Artistic Exploration, and Critical Prompt Engineering Amalia F. Foka
PDF
A Framework for Enhanced Decision Support in Digital Agriculture Using Explainable Machine Learning Ahmed Emam, Mohamed M. Farag, Jana Kierdorf, Lasse Klingbeil, Uwe Rascher, Ribana Roscher
PDF
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision Alexey Magay, Dhurba Tripathi, Yu Hao, Yi Fang
PDF
A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-Time Adaptation for Vision-Language Models Mario Döbler, Robert A. Marsden, Tobias Raichle, Bin Yang
PDF
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection Carlo Sgaravatti, Roberto Basla, Riccardo Pieroni, Matteo Corno, Sergio M. Savaresi, Luca Magri, Giacomo Boracchi
PDF
A Semiotic Methodology for Assessing the Compositional Effectiveness of Generative Text-to-Image Models (Midjourney and DALL·E) Enzo D'armenio, Adrien Deliège, Maria Giulia Dondero
PDF
A Simple Approach to Pavement Cell Segmentation Rostislav Shepel, Andrew Romanowski, Mario Valerio Giuffrida
PDF
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers Marius Aasan, Odd Kolbjørnsen, Anne H. Schistad Solberg, Adín Ramírez Rivera
PDF
A System 1 and System 2 Perspective on Continual Learning for Practical Implementation Vivek Chavan, Oliver Heimann, Jörg Krüger
PDF
A Vision-Based Framework for Human Behavior Understanding in Industrial Assembly Lines Konstantinos E. Papoutsakis, Nikolaos Bakalos, Konstantinos Fragkoulis, Athena Zacharia, Georgia Kapetadimitri, Maria Pateraki
PDF
AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data Mirko Zaffaroni, Federico Signoretta, Marco Grangetto, Attilio Fiandrotti
PDF
ABAW7 Challenge: A Facial Affect Recognition Approach Based on Transformer Encoder and Multilayer Perceptron Xuxiong Liu, Kang Shen, Jun Yao, Boyan Wang, Yu Wang, Yujie Guan, Xin Liu, Gengchen Li, Liuwei An, Zishun Cui, Minrui Liu, Xiao Sun, Weijie Feng
PDF
Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka, Kazuhiro Hotta
PDF
Across-Game Engagement Modelling via Few-Shot Learning Kosmas Pinitas, Konstantinos Makantasis, Georgios N. Yannakakis
PDF
Adapting Large Language Model for Cross-Subject Semantic Decoding from Video-Stimulated fMRI Ruizhe Zheng, Lichao Sun
PDF
Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss Qifan Fu, Xiaohang Yang, Muhammad Asad, Changjae Oh, Shanxin Yuan, Gregory G. Slabaugh
PDF
Advancing Few-Shot Novel View Synthesis with Teacher-Student Guided Scene Geometry Refinement Yan Xing, Pan Wang, Yali Guo, Yongxin Wu, Shuangguan Liu, Youcheng Cai, Ligang Liu
PDF
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes Bappaditya Dey, Matthias Monden, Víctor Blanco, Sandip Halder, Stefan De Gendt
PDF
Adversarial Attacks on Hyperbolic Networks Max van Spengler, Jan Zahálka, Pascal Mettes
PDF
AEPnP: A Less-Constrained EPnP Solver for Pose Estimation with Anisotropic Scaling Jiaxin Wei, Stefan Leutenegger, Laurent Kneip
PDF
Affective Behavior Analysis Using Task-Adaptive and AU-Assisted Graph Xiaodong Li, Wenchao Du, Hongyu Yang
PDF
Affective Behaviour Analysis via Progressive Learning Chen Liu, Wei Zhang, Feng Qiu, Lincheng Li, Dadong Wang, Xin Yu
PDF
AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large Language Models Yutong Zhou, Masahiro Ryo
PDF
AHMF: Adaptive Hybrid-Memory-Fusion Model for Driver Attention Prediction Dongyang Xu, Qingfan Wang, Ji Ma, Xiangyun Zeng, Lei Chen
PDF
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results Maksim Smirnov, Aleksandr Gushchin, Anastasia Antsiferova, Dmitriy S. Vatolin, Radu Timofte, Ziheng Jia, Zicheng Zhang, Wei Sun, Jiaying Qian, Yuqin Cao, Yinan Sun, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Kanjar De, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Wenhui Meng, Xiaoheng Tan, Haiqiang Wang, Xiaozhong Xu, Shan Liu, Zhenzhong Chen, Zhengxue Cheng, Jiahao Xiao, Jun Xu, Chenlong He, Qi Zheng, Ruoxi Zhu, Min Li, Yibo Fan, Zhengzhong Tu
PDF
AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content Marcos V. Conde, Zhijun Lei, Wen Li, Christos G. Bampis, Ioannis Katsavounidis, Radu Timofte, Qing Luo, Jie Song, Linyan Jiang, Haibo Lei, Yaqing Li, Ziqi Luo, Rongkang Dong, Cuixin Yang, Zongqi He, Jun Xiao, Zhe Xiao, Yushen Zuo, Zihang Lyu, Kin-Man Lam, Yuxuan Jiang, Jakub Nawala, Chen Feng, Fan Zhang, Xiaoqing Zhu, Joel Sole, David Bull, Jae-Hyeon Lee, Dong-Hyeop Son, Ui-Jin Choi, Mingjun Zheng, Zhongbao Yang, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang
PDF
AIM 2024 Challenge on UHD Blind Photo Quality Assessment Vlad Hosu, Marcos V. Conde, Lorenzo Agnolucci, Nabajeet Barman, Saman Zadtootaghaj, Radu Timofte, Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, Zicheng Zhang, Xiongkuo Min, Guangtao Zhai, Songbai Tan, Lixin Zhang, Guanghui Yue, Daekyu Kwon, Dongyoung Kim, Seon Joo Kim, Yunchen Zhang, Xiangkai Xu, Hong Gao, Yiming Bao, Ji Shi, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu, Zewen Chen, Shunhan Xu, Haochen Guo, Yun Zeng, Shuai Liu, Jian Guo, Juan Wang, Bing Li, Dehua Liu, Hesong Liu, Grigory Malivenko, Asile Gerek, Xingyuan Ma, Cheng Li, Joonhee Lee, Junseo Bang, Se Young Chun
PDF
AIM 2024 Challenge on Video Saliency Prediction: Methods and Results Andrey Moskalenko, Alexey Bryncev, Dmitry S. Vatolin, Radu Timofte, Gen Zhan, Li Yang, Yunlong Tang, Yiting Liao, Jiongzhi Lin, Baitao Huang, Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo, Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai, Hao Fang, Runmin Cong, Xiankai Lu, Xiaofei Zhou, Wei Zhang, Chunyu Zhao, Wentao Mu, Tao Deng, Hamed R. Tavakoli
PDF
AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results Ivan Molodetskikh, Artem Borisov, Dmitriy S. Vatolin, Radu Timofte, Jianzhao Liu, Tianwu Zhi, Yabin Zhang, Yang Li, Jingwen Xu, Yiting Liao, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Yuqin Cao, Wei Sun, Weixia Zhang, Yinan Sun, Ziheng Jia, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Weihua Luo, Yupeng Zhang, Hong Yi
PDF
AIM 2024 Sparse Neural Rendering Challenge: Dataset and Benchmark Michal Nazarczuk, Thomas Tanay, Sibi Catley-Chandar, Richard Shaw, Radu Timofte, Eduardo Pérez-Pellitero
PDF
AIM 2024 Sparse Neural Rendering Challenge: Methods and Results Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay, Richard Shaw, Eduardo Pérez-Pellitero, Radu Timofte, Xing Yan, Pan Wang, Yali Guo, Yongxin Wu, Youcheng Cai, Yanan Yang, Junting Li, Yanghong Zhou, P. Y. Mok, Zongqi He, Zhe Xiao, Kin-Chung Chan, Hana Lebeta Goshu, Cuixin Yang, Rongkang Dong, Jun Xiao, Kin-Man Lam, Jiayao Hao, Qiong Gao, Yanyan Zu, Junpei Zhang, Licheng Jiao, Xu Liu, Kuldeep Purohit
PDF
Alfie: Democratising RGBA Image Generation with No $$$ Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara
PDF
Aligning Object Detector Bounding Boxes with Human Preference Ombretta Strafforello, Osman Semih Kayhan, Oana Inel, Klamer Schutte, Jan van Gemert
PDF
Aligning Vision Language Models with Contrastive Learning Kenan E. Ak, Jay Mohta, Dimitris Dimitriadis, Saurav Manchanda, Yan Xu, Mingwei Shen
PDF
An Approach for Dataset Extension for Object Detection in Artworks Using Open-Vocabulary Models Tetiana Yemelianenko, Iuliia Tkachenko, Tess Masclef, Mihaela Scuturici, Serge Miguet
PDF
An Art-Centric Perspective on AI-Based Content Moderation of Nudity Piera Riccio, Georgina Curto, Thomas Hofmann, Nuria Oliver
PDF
An Augmentation-Based Model Re-Adaptation Framework for Robust Image Segmentation Zheming Zuo, Joseph Smith, Jonathan Stonehouse, Boguslaw Obara
PDF
An Infrastructure-Based Localization Method for Articulated Vehicles Alberto Justo, Iker Pacho, Javier Araluce, Jesús Murgoitio Larrauri, Luis Miguel Bergasa
PDF
An Investigation on the Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu, Hanchen Xie, Jiazhi Li, Mahyar Khayatkhoei, Wael AbdAlmageed
PDF
Analysis of Hybrid Compositions in Animation Film with Weakly Supervised Learning Mónica Apellaniz Portos, Roberto Labadie Tamayo, Claudius Stemmler, Erwin Feyersinger, Andreas Babic, Franziska Bruckner, Vrääth Öhner, Matthias Zeppelzauer
PDF
AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection Mykhailo Koshil, Tilman Wegener, Detlef Mentrup, Simone Frintrop, Christian Wilms
PDF
AnomalyFactory: Regard Anomaly Generation as Unsupervised Anomaly Localization Ying Zhao
PDF
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving Daniel Bogdoll, Iramm Hamdard, Lukas Namgyu Rößler, Felix Geisler, Muhammed Bayram, Felix Wang, Jan Imhof, Miguel de Campos, Anushervon Tabarov, Yitian Yang, Martin Gontscharow, Hanno Gottschalk, J. Marius Zöllner
PDF
Architecture-Agnostic Unsupervised Gradient Regularization for Parameter-Efficient Transfer Learning Wenjie Zhu, Yabin Zhang, Pengfei Wang, Xin Jin, Wenjun Zeng, Lei Zhang
PDF
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting Takuma Nishimura, Andreea Dogaru, Martin Oeggerli, Bernhard Egger
PDF
Are CLIP Features All You Need for Universal Synthetic Image Origin Attribution? Dario Cioni, Christos Tzelepis, Lorenzo Seidenari, Ioannis Patras
PDF
Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos, Viktor Kocur, Yaqing Ding, Torsten Sattler, Zuzana Kukelova
PDF
Are Visual-Language Models Effective in Action Recognition? a Comparative Study Mahmoud Ali, Di Yang, François Brémond
PDF
Are We Friends? End-to-End Prediction of Child Rapport in Guided Play Marc Fraile, Giovanna Varni, Joakim Lindblad, Natasa Sladoje, Ginevra Castellano
PDF
Art Forgery Detection Using Kolmogorov Arnold and Convolutional Neural Networks Sandro Boccuzzo, Deborah Desirée Meyer, Ludovica Schaerf
PDF
Art2Mus: Bridging Visual Arts and Music Through Cross-Modal Generation Ivan Rinaldi, Nicola Fanelli, Giovanna Castellano, Gennaro Vessio
PDF
Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, Zicheng Zhang, Xiongkuo Min, Guangtao Zhai
PDF
Assistive Visual Tool: Enhancing Safe Navigation with Video Remapping in AR Headsets Arezoo Sadeghzadeh, Md Baharul Islam, Md. Nur Uddin, Tarkan Aydin
PDF
Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan, Asmar Nadeem, Muhammad Junaid Awan, Armin Mustafa, Syed Sameed Husain
PDF
Autobiasing Event Cameras Mehdi Sefidgar Dilmaghani, Waseem Shariff, Cian Ryan, Joseph Lemley, Peter Corcoran
PDF
Automated Generation of Accurate, Compact and Focused Crop and Weed Segmentation Models Soma Dasgupta, Swarnava Dey
PDF
Automatic Die Studies for Ancient Numismatics Clément Cornet, Héloïse Aumaître, Romaric Besançon, Julien Olivier, Thomas Faucher, Hervé Le Borgne
PDF
Automatic Generation of Fashion Images Using Prompting in Generative Machine Learning Models Georgia Argyrou, Angeliki Dimitriou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou
PDF
Autonomous Drone-Person Tracking and Following in Uniform Appearance Scenarios Mohamad Alansari, Oussama Abdul Hay, Sajid Javed, Hazem Elrefaei, Khaled Alnuaimi, Bilal Hassan, Jorge Dias, Yahya H. Zweiri, Naoufel Werghi
PDF
Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications Brayan Monroy, Kebin Contreras, Jorge Bacca
PDF
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko
PDF
AVSal: Enhancing Video Saliency Prediction Through Audio-Visual Fusion and Temporal Aggregation Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai
PDF
BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment Ombretta Strafforello, Gonzalo Muradas Odriozola, Fatemeh Behrad, Li-Wei Chen, Anne-Sofie Maerten, Derya Soydaner, Johan Wagemans
PDF
Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer Simone Ricci, Niccolò Biondi, Federico Pernici, Alberto Del Bimbo
PDF
BBD-Polyp: Weakly Supervised Polyp Segmentation via Bounding Box and Depth mAP Thao Nguyen Phuong, Vinh Nguyen Duy, Hidetomo Sakaino
PDF
BehAVE: Behaviour Alignment of Video Game Encodings Nemanja Rasajski, Chintan Trivedi, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakis
PDF
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar, Abdur Razzaq Fayjie, Jurgen Hannaert, Patrick Vandewalle
PDF
Better Spanish Emotion Recognition In-the-Wild: Bringing Attention to Deep Spectrum Voice Analysis Elena Ortega-Beltrán, Josep Cabacas-Maso, Ismael Benito-Altamirano, Carles Ventura
PDF
Beyond Annotations: Efficient Wheat Head Segmentation Using L-Systems, Game Engines, and Student-Teacher Models Hosein Beheshtifard, Elijah Mickelson, Keyhan Najafian, Farhad Maleki
PDF
Beyond the Surface: A Comprehensive Analysis of Implicit Bias in Vision-Language Models Giacomo Capitani, Alice Lucarini, Lorenzo Bonicelli, Federico Bolelli, Simone Calderara, Loris Vezzali, Elisa Ficarra
PDF
BodyShapeGPT: SMPL Body Shape Manipulation with LLMs Baldomero R. Árbol, Dan Casas
PDF
Boosting Pose Estimators via Cross-Representation Distillation Kang Liu, Zhendong Yang, Jingyun Zhang, Jun Wang, Shaoming Wang, Chun Yuan, Rizen Guo
PDF
BootPIG: Bootstrapping Zero-Shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Senthil Purushwalkam, Akash Gokul, Shafiq Joty, Nikhil Naik
PDF
Boundary Attention: Learning Curves, Corners, Junctions and Grouping Mia Gaia Polansky, Charles Herrmann, Junhwa Hur, Deqing Sun, Dor Verbin, Todd E. Zickler
PDF
Boundary Matching and Refinement Network with Cross-Modal Contrastive Learning for Temporal Moment Localization Jinyoung Moon, Muah Seol, Jonghee Kim
PDF
Bridging Text and Image for Artist Style Transfer via Contrastive Learning Zhi-Song Liu, Li-Wen Wang, Jun Xiao, Vicky Kalogeiton
PDF
BurnSafe : Automatic Assistive Tool for Burn Severity Assessment by Semantic Segmentation Corneliu Florea, Laura Florea, Constantin Vertan, Andreea Nitu, Silviu Badoiu
PDF
CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge Gabriele Lagani, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
PDF
Calibration of Network Confidence for Unsupervised Domain Adaptation Using Estimated Accuracy Coby Penso, Jacob Goldberger
PDF
Can Your Generative Model Detect Out-of-Distribution Covariate Shift? Christiaan G. A. Viviers, M. M. Amaan Valiuddin, Francisco Caetano, Lemar Abdi, Lena Filatova, Peter H. N. de With, Fons van der Sommen
PDF
Capturing and Modeling Real Cloth Deformations for Virtual Garment Design Pietro Musoni, Simone Melzi, Umberto Castellani
PDF
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Wenhao Xu, Wenming Weng, Yueyi Zhang, Zhiwei Xiong
PDF
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild Arya Farkhondeh, Samy Tafasca, Jean-Marc Odobez
PDF
Civiverse: A Dataset for Analyzing User Engagement with Open-Source Text-to-Image Models Maria-Teresa De Rosa Palmini, Laura Wagner, Eva Cetinic
PDF
Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation Andrei Jelea, Ahmed Nabil Belbachir, Marius Leordeanu
PDF
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling Ruihan Yang, Hannes Gamper, Sebastian Braun
PDF
Coarse-to-Fine Human Mesh Recovery with Transformers Vatsal Agarwal, Mara Levy, Max Ehrlich, Youbao Tang, Ning Zhang, Abhinav Shrivastava
PDF
Collaborative Control for Geometry-Conditioned PBR Image Generation Shimon Vainer, Mark Boss, Mathias Parger, Konstantin Kutsy, Dante De Nigris, Ciara Rowles, Nicolas Perony, Simon Donné
PDF
ColorwAI: Generative Colorways of Textiles Through GAN and Diffusion Disentanglement Ludovica Schaerf, Andrea Alfarano, Eric O. Postma
PDF
ComiCap: A VLMs Pipeline for Dense Captioning of Comic Panels Emanuele Vivoli, Niccolò Biondi, Marco Bertini, Dimosthenis Karatzas
PDF
Comparative Analysis of Synthetic and Real Melanoma Images in AI-Driven Diagnosis Alessia Auriemma Citarella, Fabiola De Marco, Luigi Di Biasi, Genoveffa Tortora
PDF
Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection Ahmet Oguz Saltik, Alicia Allmendinger, Anthony Stein
PDF
Compositional Text-to-Image Generation with Feedforward Layout Generation Sifei Liu, Weili Nie, An-Chieh Cheng, Morteza Mardani, Chao Liu, Benjamin Eckart, Arash Vahdat
PDF
Compound Expression Recognition via Curriculum Learning Chen Liu, Feng Qiu, Wei Zhang, Lincheng Li, Dadong Wang, Xin Yu
PDF
Compressed Depth mAP Super-Resolution and Restoration: AIM 2024 Challenge Results Marcos V. Conde, Florin-Alexandru Vasluianu, Jinhui Xiong, Wei Ye, Rakesh Ranjan, Radu Timofte, Huan Zheng, Wencheng Han, Tianyi Yan, Jianbing Shen, Pihai Sun, Yuanqi Yao, Kui Jiang, Wenbo Zhao, Xianming Liu, Evgeny Burnaev, Junjun Jiang, Woojae Han, Kyeonghyun Lee, Seongmin Hong, Se Young Chun, Jinseong Kim, Dohyeong Kim, Jeahwan Kim, Yubo Wang, Chi Zhang, Huizhen Luo, Yansai Wu, Mengcheng Huang, Chengji Liu, Chongli Yve, Jianhang Sun, Cheng Guo, Yingcai Du, Huang Jianhao, Liu Shuai, Li Chenghua
PDF
Compression-RQ-VQA: Leveraging Rich Quality-Aware Features for Compressed Video Quality Assessment Ziheng Jia, Jiaying Qian, Wei Sun, Zicheng Zhang, Yuqin Cao, Yinan Sun, Yuxin Zhu, Guangtao Zhai, Xiongkuo Min
PDF
Concept-Based Explanations in Computer Vision: Where Are We and Where Could We Go? Jae Hee Lee, Georgii Mikriukov, Gesina Schwalbe, Stefan Wermter, Diedrich Wolter
PDF
Conditional Hand Image Generation Using Latent Space Supervision in Random Variable Variational Autoencoders Vassilis C. Nicodemou, Iason Oikonomidis, Giorgos Karvounas, Antonis A. Argyros
PDF
Conditional Unscented Autoencoders for Trajectory Prediction Faris Janjos, Marcel Hallgarten, Anthony Knittel, Maxim Dolgov, Andreas Zell, J. Marius Zöllner
PDF
CondSeg: Ellipse Estimation of Pupil and Iris via Conditioned Segmentation Zhuang Jia, Jiangfan Deng, Liying Chi, Xiang Long, Daniel K. Du
PDF
Connectivity-Inspired Network for Context-Aware Recognition Gianluca Carloni, Sara Colantonio
PDF
Consolidation of Symbolic Instances Using Sensor Data via Tracklet Merging for Long-Term Monitoring of Crops Mark Niemeyer, Joachim Hertzberg, Grzegorz Cielniak
PDF
Context-Aware Full Body Anonymization Pascal Zwick, Kevin Rösch, Marvin Klemp, Oliver Bringmann
PDF
Context-Infused Visual Grounding for Art Selina Khan, Nanne van Noord
PDF
Contextual Knowledge Pursuit for Faithful Visual Synthesis Jinqi Luo, Kwan Ho Ryan Chan, Dimitris Dimos, René Vidal
PDF
Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving Qi Deng, Ruyang Li, Qifu Hu, Tengfei Zhang, Heng Zhang
PDF
Control+Shift: Generating Controllable Distribution Shifts Roy Friedman, Rhea Chowers
PDF
Cultural Heritage 3D Reconstruction with Diffusion Networks Pablo Jaramillo, Ivan Sipiran
PDF
CycleBNN: Cyclic Precision Training in Binary Neural Networks Federico Fontana, Romeo Lanzino, Anxhelo Diko, Gian Luca Foresti, Luigi Cinque
PDF
DailyMAE: Towards Pretraining Masked Autoencoders in One Day Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais
PDF
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling Kyuheon Jung, Yongdeuk Seo, Seongwoo Cho, Jaeyoung Kim, Hyun-seok Min, Sungchul Choi
PDF
DARES: Depth Anything in Robotic Endoscopic Surgery with Self-Supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin, Chiara Lena, Jiongqi Qu, Luca Carlini, Mattia Magro, Seunghoi Kim, Elena De Momi, Sophia Bano, Matthew Grech-Sollars, Evangelos B. Mazomenos, Daniel C. Alexander, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam
PDF
DAS3D: Dual-Modality Anomaly Synthesis for 3D Anomaly Detection Kecen Li, Bingquan Dai, Jingjing Fu, Xinwen Hou
PDF
Data-Efficient Generation for Dataset Distillation Zhe Li, Weitong Zhang, Sarah Cechnicka, Bernhard Kainz
PDF
DAVIDE: Depth-Aware Video Deblurring German F. Torres, Jussi Kalliola, Soumya Tripathy, Erman Acar, Joni-Kristian Kämäräinen
PDF
DebiasPI: Inference-Time Debiasing by Prompt Iteration of a Text-to-Image Generative Model Sarah Bonna, Yu-Cheng Huang, Ekaterina Novozhilova, Sejin Paik, Zhengyang Shan, Michelle Yilin Feng, Ge Gao, Yonish Tayal, Rushil Kulkarni, Jialin Yu, Nupur Divekar, Deepti Ghadiyaram, Derry Wijaya, Margrit Betke
PDF
Deep Armocromia: A Novel Dataset for Face Seasonal Color Analysis and Classification Lorenzo Stacchio, Marina Paolanti, Francesca Spigarelli, Emanuele Frontoni
PDF
Deep Learning Based Growth Modeling of Plant Phenotypes Renke Hohl, Moritz Schauer, Seyed Eghbal Ghobadi
PDF
Deep Learning for Automated Shark Detection and Biometrics Without Keypoints Jaden V. Clark, Chinmay K. Lalgudi, Mark E. Leone, Jayson Meribe, Sergio Madrigal-Mora, Mario Espinoza
PDF
Deep Learning Meets Satellite Images - An Evaluation on Handcrafted and Learning-Based Features for Multi-Date Satellite Stereo Images Shuang Song, Luca Morelli, Xinyi Wu, Rongjun Qin, Hessah Albanwan, Fabio Remondino
PDF
Deep Unsupervised Segmentation of Log Point Clouds Fedor Zolotarev, Tuomas Eerola, Tomi Kauppi
PDF
DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights Using the Fisher Diagonal Jialei Shi, Kostis Gourgoulias, John F. Buford, Sean J. Moran, Najah Ghalyan
PDF
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation Daniele Rege Cambrin, Isaac Corley, Paolo Garza
PDF
Depth-Based Privileged Information for Boosting 3D Human Pose Estimation on RGB Alessandro Simoni, Francesco Marchetti, Guido Borghi, Federico Becattini, Davide Davoli, Lorenzo Garattoni, Gianpiero Francesca, Lorenzo Seidenari, Roberto Vezzani
PDF
Detect Fake with Fake: Leveraging Synthetic Data-Driven Representation for Synthetic Image Detection Hina Otake, Yoshihiro Fukuhara, Yoshiki Kubotani, Shigeo Morishima
PDF
Detecting Forged Sentinel-2 Images Through Parallax-Based Cloud Analysis Matthieu Serfaty, Quentin Bammey, Tina Nikoukhah, Rafael Grompone von Gioi, Carlo de Franchis
PDF
DIE-VIS: An Automated Visual Inspection System for Cardboard Box Manufacturing Flavia Monti, Matteo Marinacci, Francesco Leotta, Massimo Mecella
PDF
DIFF-NST: Diffusion Interleaving for deFormable Neural Style Transfer Dan Ruta, Gemma Canet Tarres, Andrew Gilbert, Eli Shechtman, Nicholas I. Kolkin, John P. Collomosse
PDF
DiffAugment: Diffusion Based Long-Tailed Visual Relationship Recognition Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do
PDF
DiffSign: AI-Assisted Generation of Customizable Sign Language Videos with Enhanced Realism Sudha Krishnamurthy, Vimal Bhat, Abhinav Jain
PDF
Diffusion-Based Light Field Synthesis Ruisheng Gao, Yutong Liu, Zeyu Xiao, Zhiwei Xiong
PDF
Diffusion-Based Synthetic Dataset Generation for Egocentric 3D Human Pose Estimation Kyohei Hayakawa, Dong-Hyun Hwang, Chen-Chieh Liao, Hideki Koike
PDF
Diffusion-Promoted HDR Video Reconstruction Yuanshen Guan, Ruikang Xu, Mingde Yao, Ruisheng Gao, Lizhi Wang, Zhiwei Xiong
PDF
DiM: Distilling Dataset into Generative Model Kai Wang, Jianyang Gu, Hansong Zhang, Daquan Zhou, Zheng Zhu, Wei Jiang, Yang You
PDF
Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents Duomin Wang, Bin Dai, Yu Deng, Baoyuan Wang
PDF
DIVA: Deep Indic Virtual Apparel Try-on Kuppa Sai Sri Teja, Hrishit Mitra, Rongali Simhachala Venkata Girish, Kaushik Mitra
PDF
DiVR: Incorporating Context from Diverse VR Scenes for Human Trajectory Prediction Franz Franco Gallo, Hui-Yin Wu, Lucile Sassatelli
PDF
Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation? Kerem Cekmeceli, Meva Himmetoglu, Guney I. Tombak, Anna Susmelj, Ertunc Erdil, Ender Konukoglu
PDF
Down-Sampling Inter-Layer Adapter for Parameter and Computation Efficient Ultra-Fine-Grained Image Recognition Edwin Arkel Rios, Femiloye Oyerinde, Min-Chun Hu, Bo-Cheng Lai
PDF
DreamTexture: High-Fidelity Synthetic 3D Data Generation Through Decoupled Geometry and Texture Synthesis Jing Li, Yawei Luo, Ying Li, Xueying Li, Xiaoxue Li, Yuwen Hao, Lijun Wang, Zhengping Li
PDF
DreamWalk: Style Space Exploration Using Diffusion Guidance Michelle Shu, Charles Herrmann, Richard Strong Bowen, Forrester Cole, Ramin Zabih
PDF
Drone Detection Using a Low-Power Neuromorphic Virtual Tripwire Anton Eldeborg Lundin, Rasmus Winzell, Hanna Hamrell, David Gustafsson, Hannes Ovrén
PDF
Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso, Francesco Pelosin, Alessandro Simoni, Marco Boschetti
PDF
Edge-Aware Consistent Stereo Video Depth Estimation Elena Kosheleva, Sunil Prasad Jaiswal, Faranak Shamsafar, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek
PDF
Effective Prior Regularized Sparse Learning Junting Li, Yanghong Zhou, Jintu Fan, Dahua Shou, Sa Xu, P. Y. Mok
PDF
EMAG: Ego-Motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos Masashi Hatano, Ryo Hachiuma, Hideo Saito
PDF
Embedding Geometries of Contrastive Language-Image Pre-Training Jason Chuan-Chih Chou, Nahid Alam
PDF
Empowering Autonomous Shuttles with Next-Generation Infrastructure Sven Ochs, Melih Yazgan, Rupert Polley, Albert Schotschneider, Stefan Orf, Marc Uecker, Maximilian Zipfl, Julian Burger, Abhishek Vivekanandan, Jennifer Amritzer, Marc René Zofka, J. Marius Zöllner
PDF
Enhanced Action Quality Assessment with Dual-Stream Pose and Video Feature Integration Yanting Zhang, Xia Li, Wenguang Zeng, Shuai Yu, Zijian Wang, Zhijun Fang
PDF
Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement Chuhao Zhou, Chenxi Jiang, Yi Xie, Haozhi Cao, Jianfei Yang
PDF
Enhancing Facial Expression Recognition Through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge Josep Cabacas-Maso, Elena Ortega-Beltrán, Ismael Benito-Altamirano, Carles Ventura
PDF
Enhancing Gait Recognition: Data Augmentation via Physics-Based Biomechanical Simulation Mritula Chandrasekaran, Jarek Francik, Dimitrios Makris
PDF
Enhancing Human-Robot Collaborative Search Through Efficient Space Sharing with On-Demand Bidirectional Interaction Nicholas Lim Hong Da, Jun Miura, Kotaro Hayashi
PDF
Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity Wassim A. El Ahmar, Dhanvin Kolhatkar, Farzan Erlik Nowruzi, Robert Laganière
PDF
Enhancing Weed Detection Performance by Means of GenAI-Based Image Augmentation Sourav Modak, Anthony Stein
PDF
Enstrect: A Stage-Based Approach to 2.5d Structural Damage Detection Christian Benz, Volker Rodehorst
PDF
EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans Nicola Garau, Giulia Martinelli, Niccolò Bisagno, Denis Tomè, Carsten Stoll
PDF
EPTQ: Enhanced Post-Training Quantization via Hessian-Guided Network-Wise Optimization Ofir Gordon, Elad Cohen, Hai Victor Habi, Arnon Netzer
PDF
ERF-NAS: Efficient Receptive Field-Based Zero-Shot NAS for Object Detection Xinyi Yu, Runan Yin, Zhihao Lin, Yongtao Wang
PDF
ES-PTAM: Event-Based Stereo Parallel Tracking and Mapping Suman Ghosh, Valentina Cavinato, Guillermo Gallego
PDF
EUFCC-CIR: A Composed Image Retrieval Dataset for GLAM Collections Francesc Net, Lluís Gómez
PDF
Evaluating Human Pose Estimation Algorithms for Resource-Constrained Smart Eyewear Device Hao Quan, Francesca Palermo, Simone Mentasti, Diana Trojaniello, Matteo Matteucci
PDF
Evaluating Image-Based Face and Eye Tracking with Event Cameras Khadija Iddrisu, Waseem Shariff, Noel E. O'Connor, Joseph Lemley, Suzanne Little
PDF
Evaluating Usability and Engagement of Large Language Models in Virtual Reality for Traditional Scottish Curling Ka Hei Carrie Lau, Efe Bozkir, Hong Gao, Enkelejda Kasneci
PDF
Evaluation Framework for Feedback Generation Methods in Skeletal Movement Assessment Tal Hakim
PDF
Evaluation of Illustration Generators with Domain-Specific Representations Tomoya Sawada, Marie Katsurai
PDF
EvDownsampling: A Robust Method for Downsampling Event Camera Data Anindya Ghosh, Thomas Nowotny, James C. Knight
PDF
Event Stream Super-Resolution Using Sigma Delta Neural Network Waseem Shariff, Joe Lemley, Peter Corcoran
PDF
Event-Based Motion Deblurring with Dual Channel Attention Weiqi Luo, Chi Zhang, Lei Yu
PDF
EventSleep: Sleep Activity Recognition with Event Cameras Carlos Plou, Nerea Gallego, Alberto Sabater, Pablo Urcola, Eduardo Montijano, Luis Montesano, Ruben Martinez-Cantin, Ana C. Murillo
PDF
Evolution of Detection Performance Throughout the Online Lifespan of Synthetic Images Dimitrios Karageorgiou, Quentin Bammey, Valentin Porcellini, Bertrand Goupil, Denis Teyssou, Symeon Papadopoulos
PDF
EVP: Enhanced Visual Perception Using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment Mykola Lavreniuk, Shariq Farooq Bhat, Matthias Müller, Peter Wonka
PDF
ExeChecker: Where Did I Go Wrong? Yiwen Gu, Mahir Patel, Margrit Betke
PDF
Explanation Alignment: Quantifying the Correctness of Model Reasoning at Scale Hyemin Bang, Angie W. Boggust, Arvind Satyanarayan
PDF
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves Madeleine Darbyshire, Elizabeth Sklar, Simon Parsons
PDF
Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance Simone Maurizio La Cava, Sara Concas, Ruben Tolosana, Roberto Casula, Giulia Orrù, Martin Drahanský, Julian Fierrez, Gian Luca Marcialis
PDF
Exploring Multi-Modal Neural Scene Representations with Applications on Thermal Imaging Mert Özer, Maximilian Weiherer, Martin Hundhausen, Bernhard Egger
PDF
Exploring Strengths and Weaknesses of Super-Resolution Attack in Deepfake Detection Davide Alessandro Coccomini, Roberto Caldelli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
PDF
Exploring the Boundaries of Content Moderation in Text-to-Image Generation Piera Riccio, Georgina Curto, Nuria Oliver
PDF
FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte, Elisabetta Fedele, Jonathan Thomm, Lukas Wolf
PDF
FaceOracle: Chat with a Face Image Oracle Wassim Kabbani, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch
PDF
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech Yunji Chu, Yunseob Shim, Unsang Park
PDF
Fairness of AI Systems in the Legal Context Veronica Paternolli, Mila Dalla Preda, Roberto Giacobazzi
PDF
Fairness Under Cover: Evaluating the Impact of Occlusions on Demographic Bias in Facial Recognition Rafael M. Mamede, Pedro C. Neto, Ana Filipa Sequeira
PDF
Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets Patrick Grommelt, Louis Weiss, Franz-Josef Pfreundt, Janis Keuper
PDF
FALCON: Fair Active Learning for Content Moderation Zuhui Wang, Sandra Sajeev, Gaurav Mittal, Matthew Hall, Ye Yu, Zhaozheng Yin, Mei Chen
PDF
Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion Hui Shen, Zhongwei Wan, Xin Wang, Mi Zhang
PDF
Fashion Attribute Extraction Under an Evolving Ontology Aditya Kanade, Manasi Patwardhan, Mayur Patidar, Lovekesh Vig, Bagyalakshmi Vasudevan
PDF
FastTalker: Jointly Generating Speech and Conversational Gestures from Text Zixin Guo, Jian Zhang
PDF
Feature Contribution in Monocular Depth Estimation Hui Yu Lau, Srinandan Dasmahapatra, Hansung Kim
PDF
Few-Shot Novel View Synthesis Using Depth Aware 3D Gaussian Splatting Raja Kumar, Vanshika Vats
PDF
Find the Assembly Mistakes: Error Segmentation for Industrial Applications Dan Lehman, Tim J. Schoonbeek, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen
PDF
Fine-Tuning for Bird Sound Classification: An Empirical Study David Stein, Bjoern Andres
PDF
FlexControl: Flexible and Efficient Full-Body Controllable Text-to-Motion Generation Qingyuan Liu, Ke Lu, Zehai Niu, Kun Dong, Jian Xue, Xiaoyu Qin, Jinbao Wang
PDF
Foreground-Aware Knowledge Distillation for Enhanced Damage Detection Pantelis Menteidis, Christos Papaioannidis, Ioannis Pitas
PDF
Foundation Model or Finetune? Evaluation of Few-Shot Semantic Segmentation for River Pollution Marga Don, Stijn Pinson, Blanca Guillen Cebrian, Yuki M. Asano
PDF
Frequency Matters: Explaining Biases of Face Recognition in the Frequency Domain Marco Huber, Fadi Boutros, Naser Damer
PDF
Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models Jun Xiao, Zihang Lyu, Hao Xie, Cong Zhang, Yakun Ju, Changjian Shui, Kin-Man Lam
PDF
From Flexibility to Manipulation: The Slippery Slope of XAI Evaluation Kristoffer Wickstrøm, Marina M.-C. Höhne, Anna Hedström
PDF
FruitBin: A Tunable Large-Scale Dataset for Advancing 6d Pose Estimation in Fruit Bin-Picking Automation Guillaume Duret, Mahmoud Ali, Nicolas Cazin, Danylo Mazurak, Anna Samsonenko, Alexandre Chapin, Florence Zara, Emmanuel Dellandréa, Liming Chen, Jan Peters
PDF
Garment Attribute Manipulation with Multi-Level Attention Vittorio Casula, Lorenzo Berlincioni, Luca Cultrera, Federico Becattini, Chiara Pero, Carmen Bisogni, Marco Bertini, Alberto Del Bimbo
PDF
GECO: GPT-Driven Estimation of 3D Human-Scene Contact in the Wild Chaehong Lee, Simranjit Singh, Michael Fore, Georgios Pavlakos, Dimitrios Stamoulis
PDF
Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones Carlos Plou, Pablo Pueyo, Ruben Martinez-Cantin, Mac Schwager, Ana C. Murillo, Eduardo Montijano
PDF
Generalizability Analysis of Deep Learning Predictions of Human Brain Responses to Augmented and Semantically Novel Visual Stimuli Valentyn Piskovskyi, Riccardo Chimisso, Sabrina Patania, Tom Foulsham, Giuseppe Vizzari, Dimitri Ognibene
PDF
Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes Sota Kato, Hinako Mitsuoka, Kazuhiro Hotta
PDF
Generalizing Fairness to Generative Language Models via Reformulation of Non-Discrimination Criteria Sara Sterlie, Nina Weng, Aasa Feragen
PDF
Generated Bias: Auditing Internal Bias Dynamics of Text-to-Image Generative Models Abhishek Mandal, Susan Leavy, Suzanne Little
PDF
Generating Binary Species Range Maps Filip Dorm, Christian Lange, Scott Loarie, Oisin Mac Aodha
PDF
Generative Dataset Distillation Based on Diffusion Model Duo Su, Junjie Hou, Guang Li, Ren Togo, Rui Song, Takahiro Ogawa, Miki Haseyama
PDF
Generative Dataset Distillation Using Min-Max Diffusion Model Junqiao Fan, Yunjiao Zhou, Min Chang Jordan Ren, Jianfei Yang
PDF
Generative Hierarchical Temporal Transformer for Hand Pose and Action Modeling Yilin Wen, Hao Pan, Takehiko Ohkawa, Lei Yang, Jia Pan, Yoichi Sato, Taku Komura, Wenping Wang
PDF
GeoTransfer: Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning Shubhendu Jena, Franck Multon, Adnane Boukhayma
PDF
Giving Each Task What It Needs Leveraging Structured Sparsity for Tailored Multi-Task Learning Richa Upadhyay, Ronald Phlypo, Rajkumar Saini, Marcus Liwicki
PDF
Glia Cell Inspired Reinforcement Learning Agent for Neural Network Optimization Alessio Fagioli, Luigi Cinque, Damiano Distante, Gian Luca Foresti, Marco Cascio
PDF
GLoFool: Global Enhancements and Local Perturbations to Craft Adversarial Images Mirko Agarla, Andrea Cavallaro
PDF
Good Data Is All Imitation Learning Needs Amir Samadi, Konstantinos Koufos, Kurt Debattista, Mehrdad Dianati
PDF
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Yu Liu, Kai Chen, Ping Luo
PDF
Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints Keisuke Toida, Naoki Kato, Osamu Segawa, Takeshi Nakamura, Kazuhiro Hotta
PDF
Growing Deep Neural Network Considering with Similarity Between Neurons Taigo Sakai, Kazuhiro Hotta
PDF
GSK-C2F: Graph Skeleton Modelization for Action Segmentation and Recognition Using a Coarse-to-Fine Strategy Toufik Benmessabih, Rim Slama, Vincent Havard, David Baudry
PDF
GSTAM: Efficient Graph Distillation with Structural Attention-Matching Arash Rasti-Meymandi, Ahmad Sajedi, Zhaopan Xu, Konstantinos N. Plataniotis
PDF
Guidelines for Query and Gallery Image Extraction in Person Re-Identification Systems Rita Delussu, Lorenzo Putzu, Giorgio Fumera
PDF
Hand Gesture Recognition Using Dual Graph Hierarchical Edges Representation and Graph Transformer Network Mohamed Youssef Memmi, Rim Slama, Stefano Berretti
PDF
Hand2Any: Hand-to-Any Motion Mapping with Few-Shot User Adaptation for Avatar Manipulation Riku Shinohara, Atsushi Hashimoto, Tadashi Kozuno, Shigeo Yoshida, Yutaro Hirao, Monica Perusquía-Hernández, Hideaki Uchiyama, Kiyoshi Kiyokawa
PDF
HAVANA: Hierarchical Stochastic Neighbor Embedding for Accelerated Video ANnotAtions Alexandru Bobe, Jan C. van Gemert
PDF
HEAD: A Bandwidth-Efficient Cooperative Perception Approach for Heterogeneous Connected and Autonomous Vehicles Deyuan Qu, Qi Chen, Yongqi Zhu, Yihao Zhu, Sergei S. Avedisov, Song Fu, Qing Yang
PDF
Helios: An Extremely Low Power Event-Based Gesture Recognition for Always-on Smart Eyewear Prarthana Bhattacharyya, Joshua Mitton, Ryan Page, Owen Morgan, Ben Menzies, Gabriel Homewood, Kemi Jacobs, Paolo Baesso, Dave Trickett, Chris Mair, Taru Muhonen, Rory Clark, Louis Berridge, Richard Vigars, Iain Wallace
PDF
High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving Kebin Contreras, Brayan Monroy, Jorge Bacca
PDF
High-Frequency Near-Eye Ground Truth for Event-Based Eye Tracking Andrea Simpsi, Andrea Aspesi, Simone Mentasti, Luca Merigo, Tommaso Ongarello, Matteo Matteucci
PDF
Higher Fidelity Perceptual Image and Video Compression with a Latent Conditioned Residual Denoising Diffusion Model Jonas Brenig, Radu Timofte
PDF
How Green Is Continual Learning, Really? Analyzing the Energy Consumption in Continual Training of Vision Foundation Models Tomaso Trinci, Simone Magistri, Roberto Verdecchia, Andrew D. Bagdanov
PDF
How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition Pedro C. Neto, Ivona Colakovic, Saso Karakatic, Ana Filipa Sequeira
PDF
How to Squeeze an Explanation Out of Your Model Tiago Roxo, Joana Cabral Costa, Pedro R. M. Inácio, Hugo Proença
PDF
How Were You Created? Explaining Synthetic Face Images Generated by Diffusion Models Bhushan Atote, Victor Sanchez
PDF
HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision Burak Ercan, Onur Eker, Aykut Erdem, Erkut Erdem
PDF
Human-Based Low-Level Visual Processing Neural Network for Image Segmentation Alessio Fagioli, Luigi Cinque, Damiano Distante, Gian Luca Foresti, Marco Cascio
PDF
HumanSim: Human-like Multi-Agent Novel Driving Simulation for Corner Case Generation Lingfeng Zhou, Mohan Jiang, Dequan Wang
PDF
Hybrid Spatial-Spectral Neural Network for Hyperspectral Image Denoising Hao Liang, Chengjie Ke, Kun Li
PDF
HybridFormer: Bridging Local and Global Spatio-Temporal Dynamics for Efficient Skeleton-Based Action Recognition Zeyun Zhong, Tianrui Li, Manuel Martin, Mickael Cormier, Chengzhi Wu, Frederik Diederichs, Juergen Beyerer
PDF
Hyperbolic Learning with Multimodal Large Language Models Paolo Mandica, Luca Franco, Konstantinos Kallidromitis, Suzanne Petryk, Fabio Galasso
PDF
Hyperbolic Metric Learning for Visual Outlier Detection Álvaro González-Jiménez, Simone Lionetti, Dena Bazazian, Philippe Gottfrois, Fabian Gröger, Alexander A. Navarini, Marc Pouly
PDF
Hyperspectral Imaging and Computer Vision Based Remote Monitoring of SO2 Emissions in Maritime Vessels Arnoud Jochemsen, Hege Indresand, Martin Chamberland, Etienne Drouin, Jan Robert Fiksdal, Xuan Zhang, Nabil Belbachir
PDF
I-Design: Personalized LLM Interior Designer Ata Çelen, Guo Han, Konrad Schindler, Luc Van Gool, Iro Armeni, Anton Obukhov, Xi Wang
PDF
Ig3D: Integrating 3D Face Representations in Facial Expression Inference Lu Dong, Xiao Wang, Srirangaraj Setlur, Venu Govindaraju, Ifeoma Nwogu
PDF
iIPPC-V2X: Multi-Modality Fusion Perception System for Cooperative Vehicle Infrastructure System with Self-Supervised Learning Guoyu Zhang, Rongjie Yu, Jian Sun, Peng Hang
PDF
Image Color Consistency in Datasets: The Smooth-TPS3D Method Ismael Benito-Altamirano, David Martínez-Carpena, Hanna Lizarzaburu-Aguilar, Carles Ventura, Cristian Fàbrega, Joan Daniel Prades
PDF
Image Translation with Kernel Prediction Networks for Semantic Segmentation Cristina Mata, Michael S. Ryoo, Henrik Turbell
PDF
Image-Guided Topic Modeling for Interpretable Privacy Classification Alina Elena Baia, Andrea Cavallaro
PDF
Improved Baselines for Data-Efficient Perceptual Augmentation of LLMs Théophane Vallaeys, Mustafa Shukor, Matthieu Cord, Jakob Verbeek
PDF
Improving Face Generation Quality and Prompt Following with Synthetic Captions Michail Tarasiou, Stylianos Moschoglou, Jiankang Deng, Stefanos Zafeiriou
PDF
Improving Generalization in Visual Reasoning via Self-Ensemble Tien-Huy Nguyen, Quang-Khai Tran, Anh-Tuan Quang-Hoang
PDF
Improving Hyperparameter Optimization with Checkpointed Model Weights Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat, James Lucas, Arun George Zachariah
PDF
Improving in Situ Real-Time Classification of Long-Tail Marine Plankton Images for Ecosystem Studies Noushin Eftekhari, Sophie Pitois, Mojtaba Masoudi, Robert E. Blackwell, James Scott, Sarah L. C. Giering, Matthew Fry
PDF
Improving Online Source-Free Domain Adaptation for Object Detection by Unsupervised Data Acquisition Xiangyu Shi, Yanyuan Qiao, Qi Wu, Lingqiao Liu, Feras Dayoub
PDF
Improving Post-Earthquake Crack Detection Using Semi-Synthetic Generated Images Piercarlo Dondi, Alessio Gullotti, Michele Inchingolo, Ilaria Senaldi, Chiara Casarotti, Luca Lombardi, Marco Piastra
PDF
Incremental and Decremental Continual Learning for Privacy-Preserving Video Recognition Lorenzo Caselli, Simone Magistri, Tommaso Bianconcini, Andrea Benericetti, Douglas Coimbra de Andrade, Andrew D. Bagdanov
PDF
Integrating Local and Global Interpretability for Deep Concept-Based Reasoning Models David Debot, Giuseppe Marra
PDF
Interactive Explainable Anomaly Detection for Industrial Settings Daniel Gramelt, Timon Höfer, Ute Schmid
PDF
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective Tim Bader, Leon Eisemann, Adrian Pogorzelski, Namrata Jangid, Attila-Balazs Kis
PDF
Introducing Gating and Context into Temporal Action Detection Aglind Reka, Diana Laura Borza, Dominick Reilly, Michal Balazia, François Brémond
PDF
IPAdapter-Instruct: Resolving Ambiguity in Image-Based Conditioning Using Instruct Prompts Ciara Rowles, Shimon Vainer, Dante De Nigris, Slava Elizarov, Konstantin Kutsy, Simon Donné
PDF
KAN You See It? KANs and Sentinel for Effective and Explainable Crop Field Segmentation Daniele Rege Cambrin, Eleonora Poeta, Eliana Pastor, Tania Cerquitelli, Elena Baralis, Paolo Garza
PDF
KAN-Mixer: Kolmogorov-Arnold Networks for Gene Expression Prediction in Plant Species Jin Gao, Juntu Zhao, Keyu Li, Dequan Wang
PDF
Khattat: Enhancing Readability and Concept Representation of Semantic Typography Ahmed Hussein, Alaa Elsetohy, Sama Hadhoud, Tameem Bakr, Yasser Rohaim, Badr AlKhamissi
PDF
KRONC: Keypoint-Based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci, Alessandro Simoni, Matteo Tomei, Luca Ciuffreda, Roberto Vezzani, Rita Cucchiara
PDF
Landmark-Based Screening: Femoral Head Coverage and Graf Classification in Infant Developmental Dysplasia of the Hip Allison Clement, Abhinav Singh, Irina Voiculescu
PDF
LanPose: Language-Instructed 6d Object Pose Estimation for Robotic Assembly Bowen Fu, Sek Kun Leong, Yan Di, Gu Wang, Jiwen Tang, Federico Tombari, Xiangyang Ji
PDF
LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model Nasim Jamshidi Avanaki, Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj
PDF
Larval Hostplant Prediction from Luehdorfia Japonica Image Using Multi-Label ABN Tsubasa Hirakawa, Takaaki Arai, Takayoshi Yamashita, Hironobu Fujiyoshi, Yuichi Oba, Hiromichi Fukui, Masaya Yago
PDF
Latent Distillation for Continual Object Detection at the Edge Francesco Pasti, Marina Ceccon, Davide Dalle Pezze, Francesco Paissan, Elisabetta Farella, Gian Antonio Susto, Nicola Bellotto
PDF
Learning from Strong to Weak an Enhanced Quality Comparison Network via Efficient Transfer Learning Yunchen Zhang, Xiangkai Xu, Hong Gao, Ji Shi, Yiming Bao, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu
PDF
Learning Multi-Manifold Embedding for Out-of-Distribution Detection Jeng-Lin Li, Ming-Ching Chang, Wei-Chao Chen
PDF
Level up Your Tutorials: VLMs for Game Tutorials Quality Assessment Daniele Rege Cambrin, Gabriele Scaffidi Militone, Luca Colomba, Giovanni Malnati, Daniele Apiletti, Paolo Garza
PDF
Leveraging FINCH and K-Means for Enhanced Cluster-Based Instance Selection Panagiota Zotou, Konstantinos Bacharidis, Antonis A. Argyros
PDF
Leveraging Key-Points Encoded Human Pose Images for Human Activity Recognition Gaia Virginia Dobici, Luca Minutillo, Ermanno Cordelli, Francesco Chirico, Goffredo Foglia, Paolo Soda
PDF
Leveraging Object Priors for Point Tracking Bikram Boote, Anh Thai, Wenqi Jia, Ozgur Kara, Stefan Stojanov, James M. Rehg, Sangmin Lee
PDF
LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field Huan Wang, Feitong Tan, Ziqian Bai, Yinda Zhang, Shichen Liu, Qiangeng Xu, Menglei Chai, Anish Prabhu, Rohit Pandey, Sean Fanello, Zeng Huang, Yun Fu
PDF
Lightweight Deep Learning Model for Defective Pixel Detection and Recovery from the Image Sensors Ganzorig Gankhuyag, Byoung-Il Mun, Changyun Cho, Jinman Park, Haengseon Son, Kyoungwon Min
PDF
Limited but Consistent Gains in Adversarial Robustness by Co-Training Object Recognition Models with Human EEG Manshan Guo, Bhavin Choksi, Sari Sadiya, Alessandro T. Gifford, Martina G. Vilas, Radoslaw Martin Cichy, Gemma Roig
PDF
Lincoln's Annotated Spatio-Temporal Strawberry Dataset (LAST-Straw) Katherine Margaret Frances James, Karoline Heiwolt, Daniel James Sargent, Grzegorz Cielniak
PDF
Llama-NAS: Efficient Neural Architecture Search for Large Language Models Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan
PDF
LLaMAPed: Multi-Modal Pedestrian Crossing Intention Prediction Je-Seok Ham, Sunghun Kim, Jia Huang, Peng Jiang, Jinyoung Moon, Srikanth Saripalli, Changick Kim
PDF
Localization-Guided Supervision for Robust Medical Image Classification by Vision Transformers Sagi Ben Itzhak, Nahum Kiryati, Orith Portnoy, Arnaldo Mayer
PDF
LocalMamba: Visual State Space Model with Windowed Selective Scan Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu
PDF
Logit Disagreement: OoD Detection with Bayesian Neural Networks Kevin Raina
PDF
Loop Mining Large-Scale Unlabeled Data for Corner Case Detection in Autonomous Driving Jiawei Zhao, Yiting Duan, Jinming Su, Wangwang Yang, Tingyi Guo, Xingyue Chen, Junfeng Luo
PDF
Lossy Encoding of Time-Aggregated Neuromorphic Vision Sensor Data Based on Point Cloud Compression Jayasingam Adhuran, Nabeel Khan, Maria G. Martini
PDF
Low-Cost Stereoscopic Optical-Coding Design for Depth Estimation Using End-to-End Optimization Jhon Lopez, Edwin Vargas, Andrés Jerez, Henry Arguello
PDF
LSVOS Challenge Report: Large-Scale Complex and Long Video Object Segmentation Henghui Ding, Lingyi Hong, Chang Liu, Ning Xu, Linjie Yang, Yuchen Fan, Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Jinming Chai, Qin Ma, Junpei Zhang, Licheng Jiao, Fang Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Xu Liu, Lingling Li, Hao Fang, Feiyu Pan, Xiankai Lu, Wei Zhang, Runmin Cong, Tuyen Tran, Bin Cao, Yisi Zhang, Hanyi Wang, Xingjian He, Jing Liu
PDF
LucidDreaming: Controllable Object-Centric 3D Generation Zhaoning Wang, Ming Li, Chen Chen
PDF
LVG-SfM: Learning-Based View-Graph Generation for Robust On-the-Fly SfM Wentian Gan, Yifei Yu, Giulio Perda, Luca Morelli, Rui Xia, Zongqian Zhan, Xin Wang, Fabio Remondino
PDF
MACGaussian: Robust 3D Gaussian Splatting from Sparse Input Views Using High-Precision Measurement-Arm-Camera (MAC) Capture Saptarshi Neil Sinha, Muhammad Ali Shahid, Michael Weinmann
PDF
Machine Learning Approaches for Analyzing Physiological Data in Remote Patient Monitoring Anuradha Banerjee, Abu Sufian, Marco Leo
PDF
Machine Learning-Driven Marketing Personas for the Luxury Fashion Market Rocco Pietrini, Alessandro Galdelli, Adriano Mancini, Emanuele Frontoni, Primo Zingaretti
PDF
Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma, Daquan Zhou, Xue-She Wang, Chun-Hsiao Yeh, Xiuyu Li, Huanrui Yang, Zhen Dong, Kurt Keutzer, Jiashi Feng
PDF
Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar
PDF
Making Images from Images: Tightly Constrained Parallel Denoising Shumeet Baluja, David Marwood, Ashwin Baluja
PDF
Manipulating and Mitigating Generative Model Biases Without Retraining Jordan Vice, Naveed Akhtar, Richard I. Hartley, Ajmal Mian
PDF
MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making Yicheng Guo, Jiaqi Liu, Rongjie Yu, Peng Hang, Jian Sun
PDF
MaskSDM: Adaptive Species Distribution Modeling Through Data Masking Robin Zbinden, Nina Van Tiel, Gencer Sumbul, Benjamin Kellenberger, Devis Tuia
PDF
Massively Multi-Person 3D Human Motion Forecasting with Scene Context Felix B. Mueller, Julian Tanke, Juergen Gall
PDF
Maximally Separated Active Learning Tejaswi Kasarla, Abhishek Jha, Faye Tervoort, Rita Cucchiara, Pascal Mettes
PDF
MCRE: Multimodal Conditional Representation and Editing for Text-Motion Generation Tengjiao Sun, Xiang Li, Tianyu Shi, Jiahui Peng, Sheng Zheng, Hansung Kim
PDF
MCUBench: A Benchmark of Tiny Object Detectors on MCUs Sudhakar Sah, Darshan C. Ganji, Matteo Grimaldi, Ravish Kumar, Alexander Hoffman, Honnesh Rohmetra, Ehsan Saboori
PDF
MDiFF: Exploiting Multimodal Score-Based Diffusion Models for New Fashion Product Performance Forecasting Andrea Avogaro, Luigi Capogrosso, Franco Fummi, Marco Cristani
PDF
MEDCO: Medical Education Copilots Based on a Multi-Agent Framework Hao Wei, Jianing Qiu, Haibao Yu, Wu Yuan
PDF
Medical Image Segmentation with SAM-Generated Annotations Iira Häkkinen, Iaroslav Melekhov, Erik Englesson, Hossein Azizpour, Juho Kannala
PDF
Memory-Efficient Vision Transformers: An Activation-Aware Mixed-Rank Compression Strategy Seyedarmin Azizi, Mahdi Nazemi, Massoud Pedram
PDF
Memory-Optimized Once-for-All Network Maxime Girard, Victor Quétu, Samuel Tardieu, Van-Tam Nguyen, Enzo Tartaglione
PDF
Meta Learning-Driven Iterative Refinement for Robust Anomaly Detection in Industrial Inspection Muhammad Aqeel, Shakiba Sharifi, Marco Cristani, Francesco Setti
PDF
MI-NeRF: Learning a Single NeRF for Multiple Identities Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras
PDF
Millisecond-Latency Visual Fault-Buttons Using Event-Cameras Stefano Chiavazza, Chiara Bartolozzi, Arren Glover
PDF
Mining Field Data for Tree Species Recognition at Scale Dimitri Gominski, Daniel Ortiz-Gonzalo, Martin Brandt, Maurice Mugabowindekwe, Rasmus Fensholt
PDF
Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla, Chiara Di Vece, Rema Daher, Xinwei Ju, Danail Stoyanov, Francisco Vasconcelos, Sophia Bano
PDF
Mixed Non-Linear Quantization for Vision Transformers Gihwan Kim, Jemin Lee, Sihyeong Park, Yongin Kwon, Hyungshin Kim
PDF
MM2Latent: Text-to-Facial Image Generation and Editing in GANs with Multimodal Assistance Debin Meng, Christos Tzelepis, Ioannis Patras, Georgios Tzimiropoulos
PDF
MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation Dimitrios Kollias, Andreas Psaroudakis, Anastasios Arsenos, Paraskevi Theofilou, Chunchang Shao, Guanyu Hu, Ioannis Patras
PDF
MobileIQA: Exploiting Mobile-Level Diverse Opinion Network for No-Reference Image Quality Assessment Using Knowledge Distillation Zewen Chen, Sunhan Xu, Yun Zeng, Haochen Guo, Jian Guo, Shuai Liu, Juan Wang, Bing Li, Weiming Hu, Dehua Liu, Hesong Li
PDF
Modelling the Distribution of Human Motion for Sign Language Assessment Oliver Cory, Ozge Mercanoglu Sincan, Matthew J. Vowels, Alessia Battisti, Franz Holzknecht, Katja Tissi, Sandra Sidler-Miserez, Tobias Haug, Sarah Ebling, Richard Bowden
PDF
Monitoring Viewer Attention During Online Ads Mina Bishay, Graham Page, Waleed Emad, Mohammad Mavadati
PDF
MOSAIC: Skeleton-Based Human Motion Recognition with Compositional Representations Federico Figari Tomenotti, Nicoletta Noceti
PDF
Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking Zehai Niu, Ke Lu, Kun Dong, Jian Xue, Xiaoyu Qin, Jinbao Wang
PDF
MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice Friedhelm Hamann, Hanxiong Li, Paul Mieske, Lars Lewejohann, Guillermo Gallego
PDF
MPL: Lifting 3D Human Pose from Multi-View 2D Poses Seyed Abolfazl Ghasemzadeh, Alexandre Alahi, Christophe De Vleeschouwer
PDF
MPVO: Motion-Prior Based Visual Odometry for PointGoal Navigation Sayan Paul, Ruddra Dev Roychoudhury, Brojeshwar Bhowmick
PDF
MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition Eduarda Caldeira, Jaime S. Cardoso, Ana Filipa Sequeira, Pedro C. Neto
PDF
Multi-Agent Collaborative Perception for Robotic Fleet: A Systematic Review Apoorv Singh, Gaurav Raut, Alka Choudhary
PDF
Multi-Camera Industrial Open-Set Person Re-Identification and Tracking Federico Cunico, Marco Cristani
PDF
Multi-Label Out-of-Distribution Detection via Evidential Learning Eduardo Aguilar, Bogdan Raducanu, Petia Radeva
PDF
Multi-Scale and Multimodal Species Distribution Modeling Nina Van Tiel, Robin Zbinden, Emanuele Dalsasso, Benjamin Kellenberger, Loïc Pellissier, Devis Tuia
PDF
Multi-Task Affective Behaviour Analysis Based on MT-EmotiNet Models Andrey V. Savchenko
PDF
Multi-View Pose Fusion for Occlusion-Aware 3D Human Pose Estimation Laura Bragagnolo, Matteo Terreran, Davide Allegro, Stefano Ghidoni
PDF
Multimodal Computer Vision Techniques for Wooden Utility Pole Density Estimation with Contact-Free Sensing Luis Gonzalez-Naharro, Arnoud Jochemsen, Nabil Belbachir, Erik T. Hauge
PDF
Multimodal Fusion Strategies for Mapping Biophysical Landscape Features Lucia Gordon, Nico Lang, Catherine Ressijac, Andrew Davies
PDF
MVP: Multimodal Emotion Recognition Based on Video and Physiological Signals Valeriya Strizhkova, Hadi Kachmar, Hava Chaptoukaev, Raphael Kalandadze, Natia Kukhilava, Tatia Tsmindashvili, Nibras Abo-Alzahab, Maria A. Zuluaga, Michal Balazia, Antitza Dantcheva, François Brémond, Laura M. Ferrari
PDF
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition Mallika Garg, Debashis Ghosh, Pyari Mohan Pradhan
PDF
N Heads Are Better than One: Exploring Theoretical Performance Bounds of 3D Face Reconstruction Methods Will Rowan, Patrik Huber, Nick E. Pears, Andrew Keeling
PDF
NeAT: Neural Artistic Tracing for High Resolution Style Transfer Dan Ruta, Andrew Gilbert, John P. Collomosse, Eli Shechtman, Nicholas I. Kolkin
PDF
NeRF-Supervised Feature Point Detection and Description Ali Youssef, Francisco Vasconcelos
PDF
NeRFmentation: Improving Monocular Depth Estimation with NeRF-Based Data Augmentation Casimir Feldmann, Niall Siegenheim, Nikolas Hars, Lovro Rabuzin, Mert Ertugrul, Luca Wolfart, Marc Pollefeys, Zuria Bauer, Martin R. Oswald
PDF
Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis Romeo Lanzino, Federico Fontana, Luigi Cinque, Francesco Scarcello, Atsuto Maki
PDF
Neuromorphic Drone Detection: An Event-RGB Multimodal Approach Gabriele Magrini, Federico Becattini, Pietro Pala, Alberto Del Bimbo, Antonio Porta
PDF
Neuromorphic Facial Analysis with Cross-Modal Supervision Federico Becattini, Luca Cultrera, Lorenzo Berlincioni, Claudio Ferrari, Andrea Leonardo, Alberto Del Bimbo
PDF
Neurosymbolic Visual Transform Based on Logic Tensor Network for Defect Detection Youcef Djenouri, Ahmed Nabil Belbachir, Asma Belhadi, Tomasz P. Michalak
PDF
NIGHT - Non-Line-of-Sight Imaging from Indirect Time of Flight Data Matteo Caligiuri, Adriano Simonetto, Pietro Zanuttigh
PDF
NimbleD: Enhancing Self-Supervised Monocular Depth Estimation with Pseudo-Labels and Large-Scale Video Pre-Training Albert Luginov, Muhammad Shahzad
PDF
Non-Verbal Interaction and Interface with a Quadruped Robot Using Body and Hand Gestures: Design and User Experience Evaluation Soohyun Shin, Trevor Evetts, Hunter Saylor, Hyunji Kim, Soojin Woo, Wonhwha Rhee, Seong-Woo Kim
PDF
Normalized Validity Scores for DNNs in Regression Based Eye Feature Extraction and Real-Time Models for the Raspberry Pi Wolfgang Fuhl
PDF
Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces Shumei Liu, Haiting Huang, Mathias Zinnen, Andreas K. Maier, Vincent Christlein
PDF
NToP: NeRF-Powered Large-Scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images Jingrui Yu, Dipankar Nandi, Roman Seidel, Gangolf Hirtz
PDF
Object Pose Estimation Using Implicit Representation for Transparent Objects Varun Burde, Artem Moroz, Vit Zeman, Pavel Burget
PDF
On Camera and LiDAR Positions in End-to-End Autonomous Driving Malte Stelzer, Jan Pirklbauer, Jan Bickerdt, Volker Schomerus, Jan Piewek, Thorsten Bagdonat, Tim Fingscheidt
PDF
On Scaling up 3D Gaussian Splatting Training Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie
PDF
On the Application of Egocentric Computer Vision to Industrial Inspection Vivek Chavan, Oliver Heimann, Jörg Krüger
PDF
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes Sadia Ilyas, Ido Freeman, Matthias Rottmann
PDF
On the Relationship Between Visual Anomaly-Free and Anomalous Representations Riya Sadrani, Ayush Bachan, Hrishikesh Sharma
PDF
One-Shot Image Restoration Deborah Pereg
PDF
Online Learning via Memory: Retrieval-Augmented Detector Adaptation Yanan Jian, Fuxun Yu, Qi Zhang, William Levine, Brandon Dubbs, Nikolaos Karianakis
PDF
Online Stochastic Optimization for Data with Temporal Dependencies Shivang Patel, Ram J. Zaveri, Samuel Chambers, Zaigham A. Randhawa, Gianfranco Doretto
PDF
Open-Set Object Detection: Towards Unified Problem Formulation and Benchmarking Hejer Ammar, Nikita Kiselov, Guillaume Lapouge, Romaric Audigier
PDF
Open-Set Plankton Recognition Joona Kareinen, Annaliina Skyttä, Tuomas Eerola, Kaisa Kraft, Lasse Lensu, Sanna Suikkanen, Maiju Lehtiniemi, Heikki Kälviäinen
PDF
Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts Prakash Chandra Chhipa, Kanjar De, Meenakshi Subhash Chippa, Rajkumar Saini, Marcus Liwicki
PDF
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez Ur Rahman, Piero Simonetto, Anna Polato, Francesco Pasti, Luca Tonin, Sebastiano Vascon
PDF
OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare Longfei Chen, Subramanian Ramamoorthy, Robert B. Fisher
PDF
Optimal OnTheFly Feedback Control of Event Sensors Valery Vishenvskiy, Greg Burman, Sebastian Kozerke, Diederik Paul Moeys
PDF
Optimization of Layer Skipping and Frequency Scaling for Convolutional Neural Networks Under Latency Constraint Minh David Thao Chan, Ruoyu Zhao, Yukuan Jia, Ruiqing Mao, Sheng Zhou
PDF
Optimizing Dataset Distillation Using DATM: Adjusting Learning Rate and Upper Bound Minjun Kim, Junhee Cho, Junseok Kwon
PDF
Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection Federico Betti, Lorenzo Baraldi, Lorenzo Baraldi, Rita Cucchiara, Nicu Sebe
PDF
Ordinal-Meta Learning for Fine-Grained Fruit Quality Prediction Aayush Mishra, Manasi Patwardhan, Parijat Deshpande, Beena Rai
PDF
OSSA: Unsupervised One-Shot Style Adaptation Robin Gerster, Holger Caesar, Matthias Rapp, Alexander Wolpert, Michael Teutsch
PDF
PackMamba: Efficient Processing of Variable-Length Sequences in Mamba Training Haoran Xu, Ziqian Liu, Rong Fu, Zhongling Su, Zerui Wang, Zheng Cai, Zhilin Pei, Xingcheng Zhang
PDF
PAFUSE: Part-Based Diffusion for 3D Whole-Body Pose Estimation Nermin Samet, Cédric Rommel, David Picard, Eduardo Valle
PDF
PCR-99: A Practical Method for Point Cloud Registration with 99 Percent Outliers Seong Hun Lee, Javier Civera, Patrick Vandewalle
PDF
PDB UNet: A Spatio Temporal Video Fixed Pattern Noise Removal Network Hortensia Barral, Pablo Arias, Axel Davy
PDF
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis Davide Bucciarelli, Nicholas Moratelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
PDF
Perspective-Equivariance for Unsupervised Imaging with Camera Geometry Andrew Wang, Mike Davies
PDF
Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance Yuanyou Xu, Zongxin Yang, Yi Yang
PDF
Pixels of Faith: Exploiting Visual Saliency to Detect Religious Image Manipulation Giuseppe Cartella, Vittorio Cuculo, Marcella Cornia, Marco Papasidero, Federico Ruozzi, Rita Cucchiara
PDF
PlaMo: Plan and Move in Rich 3D Physical Environments Assaf Hallak, Gal Dalal, Chen Tessler, Kelly Guo, Shie Mannor, Gal Chechik
PDF
POLO - Point-Based, Multi-Class Animal Detection Giacomo May, Emanuele Dalsasso, Benjamin Kellenberger, Devis Tuia
PDF
Pose-Independent 3D Anthropometry from Sparse Data David Bojanic, Stefanie Wuhrer, Tomislav Petkovic, Tomislav Pribanic
PDF
PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash Objects Luis F. W. Batista, Salim Khazem, Mehran Adibi, Seth Hutchinson, Cédric Pradalier
PDF
Practical Dataset Distillation Based on Deep Support Vectors Hyunho Lee, Junhoo Lee, Nojun Kwak
PDF
Predicting Emotions in Interpersonal Interaction Videos: I Know What You Feel Hajer Guerdelli, Claudio Ferrari, Stefano Berretti, Walid Barhoumi, Alberto Del Bimbo
PDF
PRISM: Progressive Restoration for Scene Graph-Based Image Manipulation Pavel Jahoda, Yousef Yeganeh, Ehsan Adeli, Nassir Navab, Azade Farshad
PDF
ProGBA: Prompt Guided Bayesian Augmentation for Zero-Shot Domain Adaptation Jian Zou, Guanglei Yang, Tao Luo, Chun-Mei Feng, Wangmeng Zuo
PDF
Prompt and Prejudice Lorenzo Berlincioni, Luca Cultrera, Federico Becattini, Marco Bertini, Alberto Del Bimbo
PDF
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models Deepak Sridhar, Nuno Vasconcelos
PDF
ProxyDR: Deep Hyperspherical Metric Learning with Distance Ratio-Based Formulation Hyeongji Kim, Changkyu Choi, Michael Kampffmeyer, Terje Berge, Pekka Parviainen, Ketil Malde
PDF
Pruning by Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin
PDF
Pushing Joint Image Denoising and Classification to the Edge Thomas C. Markhorst, Jan C. van Gemert, Osman Semih Kayhan
PDF
Pushing the Boundaries of Event Subsampling in Event-Based Video Classification Using CNNs Hesam Araghi, Jan van Gemert, Nergis Tomen
PDF
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng Zheng, Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu, Feiyu Pan, Hao Fang, Xiankai Lu
PDF
QSD: Query-Selection Denoising Score for Image Editing in Latent Diffusion Model Jungmin Hwang, Changwon Lim, Wonsook Lee
PDF
Real-Time 2nd-Order Gaze Metrics Andrew T. Duchowski, Krzysztof Krejtz, Izabela Krejtz
PDF
Real-Time Neural Cloth Deformation Using a Compact Latent Space and a Latent Vector Predictor Chanhaeng Lee, Maksym Perepichka, Saeed Ghorbani, Sudhir P. Mudur, Eric Paquette, Tiberiu Popa
PDF
Recent Event Camera Innovations: A Survey Bharatesh Chakravarthi, Aayush Atul Verma, Kostas Daniilidis, Cornelia Fermüller, Yezhou Yang
PDF
Reducing Catastrophic Forgetting in Online Class Incremental Learning Using Self-Distillation Kotaro Nagata, Hiromu Ono, Kazuhiro Hotta
PDF
ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable Yuan Yin, Pegah Khayatan, Éloi Zablocki, Alexandre Boulch, Matthieu Cord
PDF
RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation Yilin Wang, Chuan Guo, Li Cheng, Hai Jiang
PDF
Reliable Probabilistic Human Trajectory Prediction for Autonomous Applications Manuel Hetzel, Hannes Reichert, Konrad Doll, Bernhard Sick
PDF
RenDetNet: Weakly-Supervised Shadow Detection with Shadow Caster Verification Nikolina Kubiak, Elliot Wortman, Armin Mustafa, Graeme Phillipson, Stephen Jolly, Simon Hadfield
PDF
Representation Learning in a Decomposed Encoder Design for Bio-Inspired Hebbian Learning Achref Jaziri, Sina Ditzel, Iuliia Pliushch, Visvanathan Ramesh
PDF
REST-HANDS: Rehabilitation with Egocentric Vision Using Smartglasses for Treatment of Hands After Surviving Stroke Wiktor Mucha, Kentaro Tanaka, Martin Kampel
PDF
Rethinking HTG Evaluation: Bridging Generation and Recognition Konstantina Nikolaidou, George Retsinas, Giorgos Sfikas, Marcus Liwicki
PDF
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata, Daisuke Miyashita, Youyang Ng, Yasuto Hoshi, Jun Deguchi
PDF
Rethinking the Role of Infrastructure in Collaborative Perception Hyunchul Bae, Minhee Kang, Minwoo Song, Heejin Ahn
PDF
Retrieval of Sun-Induced Plant Fluorescence in the O2-A Absorption Band from DESIS Imagery Jim Buffat, Miguel Pato, Kevin Alonso, Stefan Auer, Emiliano Carmona, Stefan W. Maier, Rupert Müller, Patrick Rademske, Uwe Rascher, Hanno Scharr
PDF
Reversible and Cascaded Lightweight Colour Constancy: Jointly Addressing Illumination Correction and White Balance Zihao Guo, Fei Li, Rujie Liu, Arisu Endo, Takashi Kikuchi, Shun Takeuchi
PDF
Revisiting Relevance Feedback for CLIP-Based Interactive Image Retrieval Ryoya Nara, Yu-Chieh Lin, Yuji Nozawa, Youyang Ng, Goh Itoh, Osamu Torii, Yusuke Matsui
PDF
RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representations from X-Ray Images Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama
PDF
RLNet: Adaptive Fusion of 4D Radar and LiDAR for 3D Object Detection Ruoyu Xu, Zhiyu Xiang
PDF
RMT-BVQA: Recurrent Memory Transformer Based Blind Video Quality Assessment for Enhanced Video Content Tianhao Peng, Chen Feng, Duolikun Danier, Fan Zhang, Benoit Quentin Arthur Vallade, Alex Mackin, David Bull
PDF
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (Early Version) Yao Mu, Tianxing Chen, Shijia Peng, Zanxin Chen, Zeyu Gao, Yude Zou, Lunkai Lin, Zhiqiang Xie, Ping Luo
PDF
Robust Fine-Tuning and Adaptation of Zero-Shot Models via Adaptive Weight-Space Ensembling Mario Döbler, Michael Feil, Robert A. Marsden, Bin Yang
PDF
Robust Single Rotation Averaging Revisited Seong Hun Lee, Javier Civera
PDF
Robust UDA for Crop and Weed Segmentation: Multi-Scale Attention and Style-Adaptive Techniques Numair Nadeem, Muhammad Hamza Asad, Abdul Bais
PDF
Robustness to Spurious Correlation: A Comprehensive Review Mohammadjavad Maheronnaghsh, Taha Akbari Alvanagh
PDF
RoCOCO: Robustness Benchmark of MS-COCO to Stress-Test Image-Text Matching Models Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun
PDF
ROMEO: Revisiting Optimization Methods for Reconstructing 3D Human-Object Interaction Models from Images Alexey Gavryushin, Yifei Liu, Daoji Huang, Yen-Ling Kuo, Julien Valentin, Luc Van Gool, Otmar Hilliges, Xi Wang
PDF
RoSA Dataset: Road Construct Zone Segmentation for Autonomous Driving Jinwoo Kim, Kyounghwan An, Donghwan Lee
PDF
RoWeeder: Unsupervised Weed Mapping Through Crop-Row Detection Pasquale De Marinis, Gennaro Vessio, Giovanna Castellano
PDF
RP3D: A Roadside Perception Framework for 3D Object Detection via Multi-View Sensor Fusion Shaowu Zheng, Ruyi Huang, Yuan Ji, Ming Ye, Weihua Li
PDF
rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator Banafsheh Adami, Nima Karimian
PDF
S-ROPE: Spectral Frame Representation of Periodic Events Luis Garcia Rodriguez, Jonas Konrad, Dominik Drees, Benjamin Risse
PDF
SABER-6D: Shape Representation Based Implicit Object Pose Estimation Shishir Reddy Vutukur, Mengkejiergeli Ba, Benjamin Busam, Matthias Kayser, Gurprit Singh
PDF
Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents Tristan Gottwald, Maximilian Schier, Bodo Rosenhahn
PDF
San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows Nicolò Di Domenico, Guido Borghi, Annalisa Franco, Marco Boschetti, Federica Giacomini, Sebastian Barzaghi, Silvia Ferucci, Simone Zambruno, Lorenzo Mularoni, Qiong Gao, Chenyue Che, Guoxin Li, Yanyan Zu, Jiayao Hao, Junpei Zhang, Ákos Dúcz, Levente Gego, Klevis Imeri, Viktória Nemkin, Azam Rakhmatillaev, Soma Szatmári, William Rowan
PDF
Sanity Checks for Explanation Uncertainty Matias Valdenegro-Toro, Mihir Mulye
PDF
Satellite Image Dehazing via Masked Image Modeling and Jigsaw Transformation Guisik Kim, Choongsang Cho, Junseok Kwon
PDF
SC-Track: State Transition and Constrained Non-Negative Matrix Factorization for Multi-Camera Multi-Target Tracking Xiaolong Yang, Xuting Duan, Jianshan Zhou, Chunmian Lin, Xu Han
PDF
Scalable Indoor Novel-View Synthesis Using Drone-Captured 360 Imagery with 3D Gaussian Splatting Yuanbo Chen, Chengyu Zhang, Jason Wang, Xuefan Gao, Avideh Zakhor
PDF
Scaling up Resonate-and-Fire Networks for Fast Deep Learning Thomas E. Huber, Jules Lecomte, Borislav Polovnikov, Axel von Arnim
PDF
ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of Goal-Directed Attention Alessandro D'Amelio, Manuele Lucchi, Giuseppe Boccignone
PDF
Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation Francisco Eiras, Kemal Oksuz, Adel Bibi, Philip H. S. Torr, Puneet K. Dokania
PDF
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella, Alessio Xompero, Paolo Gastaldo, Andrea Cavallaro
PDF
Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method Hongjun Choi, Dongbin Na, Kyungjin Cho, Byunguk Bae, Seo Taek Kong, Hyunjoon Ahn, Sungchul Choi, Jaeyoung Kim
PDF
Self-Supervised Disentangled Representation Learning of Artistic Style Through Neural Style Transfer Dan Ruta, Gemma Canet Tarres, Alexander Black, Andrew Gilbert, John P. Collomosse
PDF
Self-Supervised HDR Imaging from Motion and Exposure Cues Michal Nazarczuk, Sibi Catley-Chandar, Ales Leonardis, Eduardo Pérez-Pellitero
PDF
Self-Supervised Models Are Strong Industrial Few-Shot Defect Classification Learners Teng Yang, Pengcheng Gao, Jinbao Wang, Yongliang Tang
PDF
Self-Supervised Road Accident Anticipation with Non-Decreasing Danger Aurel Pjetri, Davide Abbondandolo, Douglas Coimbra de Andrade, Stefano Caprasecca, Francesco Sambo, Andrew D. Bagdanov
PDF
Semantic Segmentation of Benthic Classes in Reef Environments Using a Large Vision Transformer Charlotte Sertic, Jonathan Sauder, Devis Tuia
PDF
Sequential PatchCore: Anomaly Detection for Surface Inspection Using Synthetic Impurities Runzhou Mao, Juraj Fulir, Christoph Garth, Petra Gospodnetic
PDF
Similar Paintings Retrieval from Individual and Multiple Poses Adrien Deliège, Maria Giulia Dondero
PDF
Single Image 3D Human Pose Estimation Using Sequential Joint Group Generation Szymon Lisowski, Peter Hardy, Hansung Kim
PDF
Skeleton-Aware Motion Retargeting Using Masked Pose Modeling Giulia Martinelli, Nicola Garau, Niccolò Bisagno, Nicola Conci
PDF
SkelFormer: Markerless 3D Pose and Shape Estimation Using Skeletal Transformers Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad
PDF
Sketch & Paint: Stroke-by-Stroke Evolution of Visual Artworks Jeripothula Prudviraj, Vikram Jamwal
PDF
Smoothing Predictions of Multi-Task EmotiNet Models for Compound Facial Expression Recognition Andrey V. Savchenko
PDF
Solving Inverse Problem with Unspecified Forward Operator Using Diffusion Models Jialing Zhang, Chongxuan Li, Dequan Wang
PDF
SOOD-ImageNet: A Large-Scale Dataset for Semantic Out-of-Distribution Image Classification and Semantic Segmentation Alberto Bacchin, Davide Allegro, Stefano Ghidoni, Emanuele Menegatti
PDF
Source-Free Domain Adaptation for YOLO Object Detection Simon Varailhon, Masih Aminbeidokhti, Marco Pedersoli, Eric Granger
PDF
Sources of Uncertainty in 3D Scene Reconstruction Marcus Klasson, Riccardo Mereu, Juho Kannala, Arno Solin
PDF
Soybean Pod and Seed Counting in Both Outdoor Fields and Indoor Laboratories Using Unions of Deep Neural Networks Tianyou Jiang, Mingshun Shao, Tianyi Zhang, Xiaoyu Liu, Qun Yu
PDF
Space3D-Bench: Spatial 3D Question Answering Benchmark Emilia Szymanska, Mihai Dusmanu, Jan-Willem Buurlage, Mahdi Rad, Marc Pollefeys
PDF
SplatPose+: Real-Time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu, Yan Song Hu, Yuhao Chen, John S. Zelek
PDF
SQUAD: Scalar Quantized Representation Learning for Unsupervised Anomaly Detection and Localization Shih-Chih Lin, Shang-Hong Lai
PDF
SR-VQA: Super-Resolution Video Quality Assessment Model Yuqin Cao, Wei Sun, Weixia Zhang, Yinan Sun, Ziheng Jia, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai
PDF
Storytelling Video Generation with Retrieval Augmentation and Character Consistency Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen
PDF
StreamLTS: Query-Based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection Yunshuang Yuan, Monika Sester
PDF
Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers Martín Méndez, Pau Torras, Adrià Molina, Jialuo Chen, Oriol Ramos Terrades, Alicia Fornés
PDF
SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Stefanie Speidel
PDF
Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization Rohan Reddy Mekala, Frederik Pahde, Simon Baur, Sneha Chandrashekar, Madeline Diep, Markus Wenzel, Eric L. Wisotzky, Galip Ümit Yolcu, Sebastian Lapuschkin, Jackie Ma, Peter Eisert, Mikael Lindvall, Adam A. Porter, Wojciech Samek
PDF
Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition Parsa Rahimi, Behrooz Razeghi, Sébastien Marcel
PDF
SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl, Mauricio Murillo, Keyhan Najafian, Farhad Maleki
PDF
TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection Xixi Liu, Christopher Zach
PDF
Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding Yuhang Liu, Boyi Sun, Yishuo Wang, Jing Yang, Xingxia Wang, Fei-Yue Wang
PDF
TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans Aggelina Chatziagapi, Bindita Chaudhuri, Amit Kumar, Rakesh Ranjan, Dimitris Samaras, Nikolaos Sarafianos
PDF
Target-Oriented Object Grasping via Multimodal Human Guidance Pengwei Xie, Siang Chen, Yixiang Dai, Dingchang Hu, Kaiqin Yang, Guijin Wang
PDF
Task-Specific Adaptation of Segmentation Foundation Model via Prompt Learning Hyung-Il Kim, Kimin Yun, Jun-Seok Yun, Yuseok Bae
PDF
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection Hanning Chen, Wenjun Huang, Yang Ni, Sanggeon Yun, Yezi Liu, Fei Wen, Alvaro Velasquez, Hugo Latapie, Mohsen Imani
PDF
TASOD: A Data Collection for Tiny and Small Object Detection Lars Fichtel, Dominik Erbacher, Dennis Grünwald, Leon Heller, Christian Bachmeir, Radu Timofte
PDF
Temporal-Consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting Andrea Marelli, Luca Magri, Federica Arrigoni, Giacomo Boracchi
PDF
Textualized and Feature-Based Models for Compound Multimodal Emotion Recognition in the Wild Nicolas Richet, Soufiane Belharbi, Haseeb Aslam, Meike Emilie Schadt, Manuela González-González, Gustave Cortal, Alessandro Lameiras Koerich, Marco Pedersoli, Alain Finkel, Simon Bacon, Eric Granger
PDF
TF-OCM: Training-Free Optimal Community Matching for Domain Generalized Few-Shot Learning Ahmed Radwan, Mohamed Shehata
PDF
The BRAVO Semantic Segmentation Challenge Results in UNCV2024 Tuan-Hung Vu, Eduardo Valle, Andrei Bursuc, Tommie Kerssies, Daan de Geus, Gijs Dubbelman, Long Qian, Bingke Zhu, Yingying Chen, Ming Tang, Jinqiao Wang, Tomás Vojír, Jan Sochman, Jirí Matas, Michael Smith, Frank P. Ferrie, Shamik Basu, Christos Sakaridis, Luc Van Gool
PDF
The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition Andrea Atzori, Pietro Cosseddu, Gianni Fenu, Mirko Marras
PDF
The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models Simone Caldarella, Massimiliano Mancini, Elisa Ricci, Rahaf Aljundi
PDF
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives Èric Sanchez, Adrià Molina, Oriol Ramos Terrades
PDF
The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results Matej Kristan, Jirí Matas, Pavel Tokmakov, Michael Felsberg, Luka Cehovin Zajc, Alan Lukezic, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Hyung Jin Chang, Gustavo Fernández, Minasadat Attari, Antoni B. Chan, Liang Chen, Xin Chen, Jaired Collins, Yutao Cui, Ganesh Sai Manas Devarapu, Yinglong Du, Heng Fan, Wan-Cyuan Fan, Zhenhua Feng, Mingqi Gao, Rama Krishna Gorthi, Raghav Goyal, Jungong Han, Bijaya Kumar Hatuwal, Zhenyu He, Xiantao Hu, Xingsen Huang, Yuqing Huang, Dongmei Jiang, Ben Kang, Kannappan Palaniappan, Josef Kittler, Simiao Lai, Ning Li, Xiaohai Li, Xin Li, Cheng Liang, Liting Lin, Haibin Ling, Ting Liu, Ziquan Liu, Huchuan Lu, Yifei Luo, Deshui Miao, Juan David Mogollon, Ziqi Pang, Jaswanth Reddy Pochimireddy, Viktor Prutyanov, Gani Rahmon, Aleksandr Romanov, Liangtao Shi, Mennatullah Siam, Leonid Sigal, Arun Kumar Sivapuram, Roman A. Solovyev, Elham Soltani Kazemi, Imad Eddine Toubal, Jia Wan, Limin Wang, Xinying Wang, Yaowei Wang, Yu-Xiong Wang, Zhiquan Wang, Gangshan Wu, Qiangqiang Wu, Xiaojun Wu, Zihao Xia, Jinxia Xie, Chenlong Xu, Tianyang Xu, Yong Xu, Chaocan Xue, Chao Yang, Jinyu Yang, Ming-Hsuan Yang, Chenyang Yu, Ke Yu, Chunhui Zhang, Jiaming Zhang, Zhipeng Zhang, Feng Zheng, Yaozong Zheng, Bineng Zhong, Jinglin Zhou, Junbao Zhou, Yong Zhou, Zikun Zhou, Guibo Zhu, Jiawen Zhu, Xuefeng Zhu, Vladimir V. Zunin
PDF
THP3D: Text-Driven Multi-Granularity 3D Human Parsing Keito Suzuki, Bang Du, Kunyao Chen, Runfa Blark Li, Truong Q. Nguyen
PDF
Time-Resolved MNIST Dataset for Single-Photon Recognition Aleksi Suonsivu, Lauri Salmela, Edoardo Peretti, Leevi Uosukainen, Radu Ciprian Bilcu, Giacomo Boracchi
PDF
ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment Hsiang-Wei Huang, Jiacheng Sun, Cheng-Yen Yang, Zhongyu Jiang, Li-Yu Huang, Jenq-Neng Hwang, Yu-Ching Yeh
PDF
TONO: A Synthetic Dataset for Face Image Compliance to ISO/ICAO Standard Guido Borghi, Annalisa Franco, Nicolò Di Domenico, Davide Maltoni
PDF
Top-GAP: Integrating Size Priors in CNNs for More Interpretability, Robustness, and Bias Mitigation Lars Nieradzik, Henrike Stephani, Janis Keuper
PDF
Towards Auto-Generated Ground Truth for Evaluation of Perception Systems in Agriculture Jan Christoph Krause, Mark Niemeyer, Janosch Bajorath, Naeem Iqbal, Joachim Hertzberg
PDF
Towards Low-Power, High-Frequency Gaze Direction Tracking with an Event-Camera Yvonne Vullers, Luna Gava, Arren Glover, Chiara Bartolozzi
PDF
Towards Motion from Video Diffusion Models Paul Janson, Tiberiu Popa, Eugene Belilovsky
PDF
Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning Yushen Zuo, Jun Xiao, Kin-Chung Chan, Rongkang Dong, Cuixin Yang, Zongqi He, Hao Xie, Kin-Man Lam
PDF
Towards Multimodal In-Context Learning for Vision and Language Models Sivan Doveh, Shaked Perek, Muhammad Jehanzeb Mirza, Wei Lin, Amit Alfassy, Assaf Arbelle, Shimon Ullman, Leonid Karlinsky
PDF
Towards Real-Time Online Egocentric Action Recognition on Smart Eyewear Riccardo Santambrogio, Federico Caspani, Greta Corti, Francesca Palermo, Simone Mentasti, Diana Trojaniello, Matteo Matteucci
PDF
Towards Resource-Aware Visual Inertial SLAM Giovanni Affatato, Marco Paracchini, Francesca Palermo, Diana Trojaniello, Tommaso Ongarello, Marco Marcon, Stefano Tubaro
PDF
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces Junrui Zhang, Jiaqi Li, Yachuan Huang, Yiran Wang, Jinghong Zheng, Liao Shen, Zhiguo Cao
PDF
Towards Unsupervised Eye-Region Segmentation for Eye Tracking Jiangfan Deng, Zhuang Jia, Zhaoxue Wang, Xiang Long, Daniel K. Du
PDF
Towards Wearable Multi-Modal Human Activity Recognition with Deep Fusion Networks Lennart Schiweck, Alaa Saleh, Cristóbal Curio
PDF
Towards Zero-Shot Camera Trap Image Categorization Jirí Vyskocil, Lukás Picek
PDF
Tracking Virtual Meetings in the Wild: Re-Identification in Multi-Participant Virtual Meetings Oriel Perl, Ido Leshem, Uria Franko, Yuval Goldman
PDF
Tracking-Assisted Object Detection with Event Cameras Ting-Kang Yen, Igor Morawski, Shusil Dangi, Kai He, Chung-Yi Lin, Jia-Fong Yeh, Hung-Ting Su, Winston H. Hsu
PDF
TrackLidFormer: A Transformer-Based Approach for Occluded Object Tracking Leon Eisemann, Kushal Narasimha, Johannes Maucher
PDF
Training and Benchmarking Leukocyte Sub-Types Classification Methods with Synthetic Images Luca Zedda, Lorenzo Putzu, Andrea Loddo, Cecilia Di Ruberto
PDF
Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection Sondos Mohamed, Walter Zimmer, Ross Greer, Ahmed Alaaeldin Ghita, Modesto Castrillón Santana, Mohan M. Trivedi, Alois Knoll, Salvatore M. Carta, Mirko Marras
PDF
TRICKY 2024 Challenge on Monocular Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez, Alex Costanzino, Fabio Tosi, Matteo Poggi, Luigi Di Stefano, Jean-Baptiste Weibel, Dominik Bauer, Doris Antensteiner, Markus Vincze, Jiaqi Li, Yachuan Huang, Junrui Zhang, Yiran Wang, Jinghong Zheng, Liao Shen, Zhiguo Cao, Ziyang Song, Zerong Wang, Ruijie Zhu, Hao Zhang, Rui Li, Jiang Wu, Xian Li, Yu Zhu, Jinqiu Sun, Yanning Zhang, Pihai Sun, Yuanqi Yao, Wenbo Zhao, Kui Jiang, Junjun Jiang, Mykola Lavreniuk, Pengzhi Li, Jui-Lin Wang
PDF
UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment Vlad Hosu, Lorenzo Agnolucci, Oliver Wiedemann, Daisuke Iso, Dietmar Saupe
PDF
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO Julian Moosmann, Pietro Bonazzi, Yawei Li, Sizhen Bian, Philipp Mayer, Luca Benini, Michele Magno
PDF
Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti, Qianqian Zou, Max Mehltretter
PDF
Underwater Uncertainty: A Multi-Annotator Image Dataset for Benthic Habitat Classification Galadrielle Humblot-Renaux, Anders Skaarup Johansen, Jonathan Eichild Schmidt, Amanda Frederikke Irlind, Niels Madsen, Thomas B. Moeslund, Malte Pedersen
PDF
Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist, Deblina Bhattacharjee, Bahar Aydemir, Baran Ozaydin, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk
PDF
Unsupervised Anomaly Segmentation at High Resolution with Patch-Divide-and-Conquer and Self-Ensembling Hendrik Meininger, Radu Timofte
PDF
Unsupervised Tomato Split Anomaly Detection Using Hyperspectral Imaging and Variational Autoencoders Mahmoud Abdulsalam, Usman A. Zahidi, Bradley Hurst, Simon Pearson, Grzegorz Cielniak, James Brown
PDF
Unsupervised Video Summarization: A Reconstruction Model with Proximal Gradient Methods Anali Alfaro, Ivan Sipiran
PDF
Unveiling Visual Biases in Audio-Visual Localization Benchmarks Liangyu Chen, Zihao Yue, Boshen Xu, Qin Jin
PDF
Upper-Body Pose-Based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection Andrea Toaiari, Vittorio Murino, Marco Cristani, Cigdem Beyan
PDF
Utilizing Class-Agnostic Point-to-Box Regressors as Object Proposal Generators Gulin Tufekci Dogan, Ramazan Gokberk Cinbis, Ilkay Ulusoy
PDF
UTrack: Multi-Object Tracking with Uncertain Detections Edgardo Solano-Carrillo, Felix Sattler, Antje Alex, Alexander Klein, Bruno Pereira Costa, Ángel Bueno Rodríguez, Jannis Stoppe
PDF
V2X-Based Decentralized Singular Value Decomposition in Dynamic Vehicular Environment Jianxin Zhao, Min-Bin Lin, Alexey Vinel
PDF
Valeo4Cast: A Modular Approach to End-to-End Forecasting Yihong Xu, Éloi Zablocki, Alexandre Boulch, Gilles Puy, Mickaël Chen, Florent Bartoccioni, Nermin Samet, Oriane Siméoni, Spyros Gidaris, Tuan-Hung Vu, Andrei Bursuc, Eduardo Valle, Renaud Marlet, Matthieu Cord
PDF
Variable Resolution Improves Visual Question Answering Under a Limited Pixel Budget Andrey Gizdov, Shimon Ullman, Daniel Harari
PDF
VATE: A Large Scale Multimodal Spontaneous Dataset for Affective Evaluation Francesco Agnelli, Giuliano Grossi, Alessandro D'Amelio, Marco De Paoli, Raffaella Lanzarotti
PDF
Vibration Vision: Real-Time Machinery Fault Diagnosis with Event Cameras Muhammad Aitsam, Gaurvi Goyal, Chiara Bartolozzi, Alessandro G. Di Nuovo
PDF
VICooper: A Practical Vehicle-Infrastructure Cooperative Perception Framework for Autonomous Driving Shaowu Zheng, Ming Ye, Yuan Ji, Ruyi Huang, Weihua Li
PDF
Video Editing for Video Retrieval Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damen
PDF
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
PDF
Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods Adam Phillips, Daniel Grandes Rodriguez, Miriam Sánchez-Manzano, Alan Salvadó, Manuel Garin, Gloria Haro, Coloma Ballester
PDF
VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis Donggoo Kang, Dasol Jeong, Hyunmin Lee, Sangwoo Park, Hasil Park, Sunkyu Kwon, Yeong-Joon Kim, Joonki Paik
PDF
VQA-Driven Facet-Level Texture Segmentation in 3D Surfaces Iyyakutti Iyappan Ganapathi, Sajid Javed, Syed Sadaf Ali, Mohamad Alansari, Said Boumaraf, Naoufel Werghi
PDF
VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla
PDF
Watt for What: Rethinking Deep Learning's Energy-Performance Relationship Shreyank N. Gowda, Xinyue Hao, Gen Li, Shashank Narayana Gowda, Xiaobo Jin, Laura Sevilla-Lara
PDF
Well Begun Is Half Done: The Importance of Initialization in Dataset Distillation Yiran Guan, Zhu Chen, Xingkui Zhu, Dingkang Liang, Yuliang Liu, Xiang Bai
PDF
What Could Go Wrong? Discovering and Describing Failure Modes in Computer Vision Gabriela Csurka, Tyler L. Hayes, Diane Larlus, Riccardo Volpi
PDF
What Makes a Face Look like a Hat: Decoupling Low-Level and High-Level Visual Properties with Image Triplets Maytus Piriyajitakonkij, Sirawaj Itthipuripat, Ian C. Ballard, Ioannis Pappas
PDF
What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon Utkarsh Tiwari, Snehashis Majhi, Michal Balazia, François Brémond
PDF
What's Wrong with the Absolute Trajectory Error? Seong Hun Lee, Javier Civera
PDF
When the Small-Loss Trick Is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections Keryan Chelouche, Marie Lachaize, Marine Bernard, Louise Olgiati, Rémi Cuingnet
PDF
Wild Berry Image Dataset Collected in Finnish Forests and Peatlands Using Drones Luigi Riz, Sergio Povoli, Andrea Caraffa, Davide Boscaini, Mohamed Lamine Mekhalfi, Paul Chippendale, Marjut Turtiainen, Birgitta Partanen, Laura Smith Ballester, Juan Fco. Blanes Noguera, Alessio Franchi, Elisa Castelli, Giacomo Piccinini, Luca Marchesotti, Micael Santos Couceiro, Fabio Poiesi
PDF
WildFusion: Individual Animal Identification with Calibrated Similarity Fusion Vojtech Cermák, Lukás Picek, Lukás Adam, Lukás Neumann, Jirí Matas
PDF
XAI-Guided Insulator Anomaly Detection for Imbalanced Datasets Maximilian Andreas Hoefler, Karsten Müller, Wojciech Samek
PDF
xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong
PDF
YCB-Ev 1.1: Event-Vision Dataset for 6DoF Object Pose Estimation Pavel Rojtberg, Thomas Pöllabauer
PDF
Your Diffusion Model Is an Implicit Synthetic Image Detector Xi Wang, Vicky Kalogeiton
PDF
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer Hiroki Azuma, Yusuke Matsui, Atsuto Maki
PDF
Μgat: Improving Single-Page Document Parsing by Providing Multi-Page Context Fabio Quattrini, Carmine Zaccagnino, Silvia Cascianelli, Laura Righi, Rita Cucchiara
PDF