CVPRW 2025 659 papers
3D Face Reconstruction from Radar Images
Valentin Braeutigam, Vanessa Wirth, Ingrid Ullmann, Christian Schüßler, Martin Vossiek, Matthias Berking, Bernhard Egger 3rd Multi-Modal Aerial View Image Challenge: Sensor Domain Translation - PBVS 2025
Dylan Bowald, Justice Wheelwright, Oliver Nina, Ángel D. Sappa, Riad I. Hammoud, Erik Blasch, Nathan Inkawhich 4th Multi-Modal Aerial View Image Challenge: SAR Classification - PBVS 2025
Nathan Inkawhich, Claire Thorp, Justice Wheelwright, Oliver Nina, Dylan Bowald, Ángel D. Sappa, Erik Blasch A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search
Tinh-Anh Nguyen-Nhu, Huu-Loc Tran, Nguyen-Khang Le, Minh-Nhat Nguyen, Tien-Huy Nguyen, Hoang-Long Nguyen-Huu, Huu-Phong Phan-Nguyen, Huy-Thach Pham, Quan Nguyen, Hoang M. Le, Quang-Vinh Dinh A Simple Detector with Frame Dynamics Is a Strong Tracker
Chenxu Peng, Chenxu Wang, Minrui Zou, Danyang Li, Zhengpeng Yang, Yimian Dai, Ming-Ming Cheng, Xiang Li A True Hyperspectral Image Super-Resolution Dataset
Alexander Ulrichsen, Thomas De Kerf, David Dunphy, Paul Murray, Steve Vanlanduit, Stephen Marshall Action Anticipation from SoccerNet Football Video Broadcasts
Mohamad Dalal, Artur Xarles, Anthony Cioppa, Silvio Giancola, Marc Van Droogenbroeck, Bernard Ghanem, Albert Clapés, Sergio Escalera, Thomas B. Moeslund Action Valuation in Sports: A Survey
Artur Xarles, Sergio Escalera, Thomas B. Moeslund, Albert Clapés ADAPTOR: Adaptive Token Reduction for Video Diffusion Transformers
Elia Peruzzo, Adil Karjauv, Nicu Sebe, Amir Ghodrati, AmirHossein Habibian AdaVid: Adaptive Video-Language Pretraining
Chaitanya Patel, Juan Carlos Niebles, Ehsan Adeli Advancements in Affective and Behavior Analysis: The 8th ABAW Workshop and Competition
Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Stefanos Zafeiriou, Irene Kotsia, Eric Granger, Marco Pedersoli, Simon Bacon, Alice Baird, Chris Gagne, Chunchang Shao, Guanyu Hu, Soufiane Belharbi, Muhammad Haseeb Aslam Advancing Ambient Lighting Normalization via Diffusion Shadow Generation
Xin Lu, Jiarong Yang, Yuanfei Bao, Zihao Fan, Anya Hu, Kunyu Wang, Jie Xiao, Xi Wang, Hongjian Liu, Xueyang Fu, Zheng-Jun Zha Aerial Infrared Health Monitoring of Solar Photovoltaic Farms at Scale
Isaac Corley, Conor Wallace, Sourav Agrawal, Burton Putrah, Jonathan Lwowski An Empirical Study for Efficient Video Quality Assessment
Wei Sun, Kang Fu, Linhan Cao, Dandan Zhu, Kaiwei Zhang, Yucheng Zhu, Zicheng Zhang, Menghan Hu, Xiongkuo Min, Guangtao Zhai An End-to-End Pipeline for Virtual Banner Replacement in Football Broadcasts
Victor Gaspar, Anthony Cioppa, Jan Held, Silvio Giancola, Marc Braham, Adrien Deliège, Bernard Ghanem, Marc Van Droogenbroeck An Interactive Agent Foundation Model
Zane Durante, Ran Gong, Bidipta Sarkar, Naoki Wake, Rohan Taori, Paul Tang, Shrinidhi Kowshika Lakshmikanth, Kevin A. Schulman, Arnold Milstein, Hoi Vo, Ehsan Adeli, Demetri Terzopoulos, Li Fei-Fei, Jianfeng Gao An LLM-Enabled Multi-Agent Autonomous Mechatronics Design Framework
Zeyu Wang, Frank Po Wen Lo, Qian Chen, Yongqi Zhang, Chen Lin, Xu Chen, Zhenhua Yu, Alexander J. Thompson, Eric M. Yeatman, Benny P. L. Lo Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders
Matthew Lyle Olson, Musashi Hinck, Neale Ratzlaff, Changbai Li, Phillip Howard, Vasudev Lal, Shao-Yen Tseng AppleGrowthVision: A Large-Scale Stereo Dataset for Phenological Analysis, Fruit Detection, and 3D Reconstruction in Apple Orchards
Laura von Hirschhausen, Jannes S. Magnusson, Mykyta Kovalenko, Fredrik Boye, Tanay Rawat, Peter Eisert, Anna Hilsmann, Sebastian Pretzsch, Sebastian Bosse ARDGen: Augmentation Regularization for Domain-Generalized Medical Report Generation
Syed Bilal Ahsan, Muhammad Ikhalas, Muhammad Muzamil Khan, Sana Ullah, Muhammad Zaigham Zaheer Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition
Sergio Romero-Tapiador, Ruben Tolosana, Blanca Lacruz-Pleguezuelos, Laura Judith Marcos-Zambrano, Guadalupe X. Bazán, Isabel Espinosa-Salinas, Julian Fierrez, Javier Ortega-Garcia, Enrique Carrillo de Santa Pau, Aythami Morales Attacking Attention of Foundation Models Disrupts Downstream Tasks
Hondamunige Prasanna Silva, Federico Becattini, Lorenzo Seidenari Attention-Aware Temporal Adversarial Shadows on Traffic Sign Sequences
Pedram MohajerAnsari, Amir Salarpour, David Fernandez, Cigdem Kokenoz, Bing Li, Mert D. Pesé Autonomous Multimodal Reasoning via Implicit Chain-of-Vision
Yiqiao Huang, Qi He, Zhaorun Chen, Haopeng Zhang, Hanchao Yu, Zhuokai Zhao Benchmarking Multi-Modal Semantic Segmentation Under Sensor Failures: Missing and Noisy Modality Robustness
Chenfei Liao, Kaiyu Lei, Xu Zheng, Junha Moon, Zhixiong Wang, Yixuan Wang, Danda Pani Paudel, Luc Van Gool, Xuming Hu Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model
Lu Xu, Sijie Zhu, Chunyuan Li, Chia-Wen Kuo, Fan Chen, Xinyao Wang, Guang Chen, Dawei Du, Ye Yuan, Longyin Wen BiasBench: A Reproducible Benchmark for Tuning the Biases of Event Cameras
Andreas Ziegler, David Joseph, Thomas Gossard, Emil Moldovan, Andreas Zell CaddieSet: A Golf Swing Dataset with Human Joint Features and Ball Information
Seunghyeon Jung, Seoyoung Hong, Jiwoo Jeong, Seungwon Jeong, Jaerim Choi, Hoki Kim, Woojin Lee Can Geometry Save Central Views for Sports Field Registration?
Floriane Magera, Thomas Hoyoux, Martin Castin, Olivier Barnich, Anthony Cioppa, Marc Van Droogenbroeck CityGen: Infinite and Controllable City Layout Generation
Jie Deng, Wenhao Chai, Jianshu Guo, Qixuan Huang, Junsheng Huang, Wenhao Hu, Shengyu Hao, Jenq-Neng Hwang, Gaoang Wang Classification Drives Geographic Bias in Street Scene Segmentation
Rahul Nair, Bhanu Tokas, Gabriel Tseng, Esther Rolf, Hannah Kerner Cite
CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD mAP Updates
Ankit Kumar Shaw, Kun Jiang, Tuopu Wen, Chandan Kumar Sah, Yining Shi, Mengmeng Yang, Diange Yang, Xiaoli Lian Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation
Li Zhong, Ahmed Ghazal, Jun-Jun Wan, Frederik Zilly, Patrick Mackens, Joachim E. Vollrath, Bogdan Sorin Coseriu CLIPDraw++: Text-to-Sketch Synthesis with Simple Primitives
Nityanand Mathur, Shyam Marjit, Abhra Chaudhuri, Anjan Dutta Comparison Visual Instruction Tuning
Wei Lin, Muhammad Jehanzeb Mirza, Sivan Doveh, Rogério Feris, Raja Giryes, Sepp Hochreiter, Leonid Karlinsky Compressed Domain Multiframe Processing
Chengyu Wang, Jing Li, Saurabh Kumar, Seok-Jun Lee, Hamid R. Sheikh CondiMen: Conditional Multi-Person Mesh Recovery
Romain Brégier, Fabien Baradel, Thomas Lucas, Salma Galaaoui, Matthieu Armando, Philippe Weinzaepfel, Grégory Rogez conSAMme: Achieving Consistent Segmentations with SAM
Josh Myers-Dean, Kangning Liu, Brian L. Price, Yifei Fan, Jason Kuen, Danna Gurari Coordinated Robustness Evaluation Framework for Vision-Language Models
Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Sahand Ghorbanpour, Avisek Naug, Antonio Guillen, Ricardo Luna, Soumyendu Sarkar COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails
Miguel Espinosa, Valerio Marsocci, Yuru Jia, Elliot Crowley, Mikolaj Czerkawski Cross-Modal Consistency Learning for Sign Language Recognition
Kepeng Wu, Zecheng Li, Weichao Zhao, Hezhen Hu, Wengang Zhou, Houqiang Li Cycle Training with Semi-Supervised Domain Adaptation: Bridging Accuracy and Efficiency for Real-Time Mobile Scene Detection
Huu-Phong Phan-Nguyen, Anh Dao, Tien-Huy Nguyen, Tuan Quang, Huu-Loc Tran, Tinh-Anh Nguyen-Nhu, Huy-Thach Pham, Quan Nguyen, Hoang M. Le, Quang-Vinh Dinh CYFLOD: Cyclic Filtering and Loss Damping for Alleviating Noisy Labels in Fine-Grained Visual Classification
Nauman Ullah Gilal, Khaled A. Al-Thelaya, Fahad Majeed, Zhihe Lu, Sabri Boughorbel, Jens Schneider, Marco Agus CytoFM: The First Cytology Foundation Model
Vedrana Ivezic, Ashwath Radhachandran, Ekaterina Redekop, Shreeram Athreya, Dongwoo Lee, Vivek Sant, Corey W. Arnold, William Speier Data Scaling Laws for End-to-End Autonomous Driving
Alexander Naumann, Xunjiang Gu, Tolga Dimlioglu, Mariusz Bojarski, Alperen Degirmenci, Alexander Popov, Devansh Bisla, Marco Pavone, Urs Muller, Boris Ivanovic Datasets for Valence and Arousal Inference: A Survey
Helen Schneider, Svetlana Pavlitska, Helen Gremmelmaier, Marius Zöllner Decoding Vision Transformers: The Diffusion Steering Lens
Ryota Takatsuki, Sonia Joseph, Ippei Fujisawa, Ryota Kanai Decomposing Food Images for Better Nutrition Analysis: A Nutritionist-Inspired Two-Step Multimodal LLM Approach
Pitikorn Khlaisamniang, Kun Kerdthaisong, Supasate Vorathammathorn, Nutchanon Yongsatianchot, Hirunkul Phimsiri, Amrest Chinkamol, Teermade Thitseesaeng, Kanyakorn Veerakanjana, Kaisorn Kachai, Piyalitt Ittichaiwong, Tossaporn Saengja Defurnishing with X-Ray Vision: Joint Removal of Furniture from Panoramas and Mesh
Alan Dolhasz, Chen Ma, Dave Gausebeck, Kevin Chen, Gregor Miller, Lucas Hayne, Gunnar Hovden, Azwad Sabik, Olaf Brandt, Mira Slavcheva Detecting Looted Archaeological Sites from Satellite Image Time Series
Elliot Vincent, Mehraïl Saroufim, Jonathan Chemla, Yves Ubelmann, Philippe Marquis, Jean Ponce, Mathieu Aubry Detection and Localization of Drones and UAVs Using Sound and Vision
Erik Tegler, Max Modig, Per Skarin, Kalle Åström, Magnus Oskarsson, Gabrielle Flood Dist-Tracker: A Small Object-Aware Detector and Tracker for UAV Tracking
Wenzhen Wang, Jing Fu, Jiayi Song, Kaiyu Li, Hui Qiao, Jiang Liu, Hao Sun, Xiangyong Cao Distilling Normalizing Flows
Steven Walton, Valeriy Klyukin, Maksim Artemev, Denis Derkach, Nikita Orlov, Humphrey Shi Distribution Shifts at Scale: Out-of-Distribution Detection in Earth Observation
Burak Ekim, Girmaw Abebe Tadesse, Caleb Robinson, Gilles Quentin Hacheme, Michael Schmitt, Rahul Dodhia, Juan M. Lavista Ferres Domain Adaptation of VLM for Soccer Video Understanding
Tiancheng Jiang, Henry Wang, Md Sirajus Salekin, Parmida Atighehchian, Shinan Zhang Drive4C: A Closed-Loop Benchmark on What Foundation Models Really Need to Be Capable of for Language-Guided Autonomous Driving
Tin Stribor Sohn, Maximilian Dillitzer, Johannes Bach, Jason J. Corso, Tim Brühl, Robin Schwager, Tim Dieter Eberhardt, Eric Sax Dyadic Mamba: Long-Term Dyadic Human Motion Synthesis
Julian Tanke, Takashi Shibuya, Kengo Uchida, Koichi Saito, Yuki Mitsufuji Dynamic Watermarks in Images Generated by Diffusion Models
Yunzhuo Chen, Jordan Vice, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian E-BARF: Bundle Adjusting Neural Radiance Fields from a Moving Event Camera
Zhipeng Tang, Shifan Zhu, Zezhou Cheng, Donghyun Kim, Erik G. Learned-Miller Efficient 2D to Full 3D Human Pose Uplifting Including Joint Rotations
Katja Ludwig, Yuliia Oksymets, Robin Schön, Daniel Kienzle, Rainer Lienhart Efficient Burst Super-Resolution with One-Step Diffusion
Kento Kawai, Takeru Oba, Kyotaro Tokoro, Kazutoshi Akita, Norimichi Ukita Efficient Image Generation with Variadic Attention Heads
Steven Walton, Ali Hassani, Xingqian Xu, Zhangyang Wang, Humphrey Shi Efficient VideoMAE via Temporal Progressive Training
Xianhang Li, Peng Wang, Xinyu Li, Heng Wang, Hongru Zhu, Cihang Xie Emotions in LatAm: A New Dataset and Benchmark for Emotion Recognition in Latin America
Pooja Kishore Kumar, Willams de Lima Costa, Renato Nogueira Ferraz e Oliveira, Veronica Teichrieb, Estefania Talavera Martínez Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution
Yiwen Wang, Ying Liang, Yuxuan Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song Enhancing Vision Transformer Explainability Using Artificial Astrocytes
Nicolas Echevarrieta-Catalan, Ana Ribas-Rodriguez, Francisco Cedron, Odelia Schwartz, Vanessa Aguiar-Pulido ePBR: Extended PBR Materials in Image Synthesis
Yu Guo, Zhiqiang Lao, Xiyun Song, Yubin Zhou, Zongfang Lin, Heather Yu EV-LayerSegNet: Self-Supervised Motion Segmentation Using Event Cameras
Youssef Farah, Federico Paredes-Vallés, Guido de Croon, Muhammad Ahmed Humais, Hussain M. Sajwani, Yahya H. Zweiri EvenFormer: Dynamic Even Transformer for Real-World Image Restoration
Xin Lu, Yuanfei Bao, Jiarong Yang, Anya Hu, Jie Xiao, Kunyu Wang, Dong Li, Senyan Xu, Kean Liu, Xueyang Fu, Zheng-Jun Zha Event-Based Continuous Color Video Decompression from Single Frames
Ziyun Wang, Friedhelm Hamann, Kenneth Chaney, Wen Jiang, Guillermo Gallego, Kostas Daniilidis Event-Based Eye Tracking. Even-Based Vision Workshop 2025
Qinyu Chen, Chang Gao, Min Liu, Daniele Perrone, Yan Ru Pei, Zuowen Wang, Zhuo Zou, Shihang Tan, Tao Han, Guorui Lu, Zhen Xu, Junyuan Ding, Ziteng Wang, Zongwei Wu, Han Han, Yuliang Wu, Jinze Chen, Wei Zhai, Yang Cao, Zhengjun Zha, Nuwan Bandara, Thivya Kandappu, Archan Misra, Xiaopeng Lin, Hongxiang Huang, Hongwei Ren, Bojun Cheng, Hoang M. Truong, Vinh-Thuan Ly, Huy G. Tran, Thuan-Phat Nguyen, Tram T. Doan Event-Conditioned Dual-Modal Fusion for Motion Deblurring
Kean Liu, Mingchen Zhong, Senyan Xu, Zhijing Sun, Jiaying Zhu, Chengjie Ge, Xingbo Wang, Xin Lu, Xueyang Fu, Zheng-Jun Zha ExaM: Unsupervised Concept-Based Representation Learning to Better Explain Models in Vision Tasks
Maguelonne Heritier, Djebril Mekhazni, Cédric Leblond-Ménard, Benoit Godbout, Nathan Guilbaud, Mahdi Alehdaghi, Eric Granger Exemplar Masking for Multimodal Incremental Learning
Yi-Lun Lee, Chen-Yu Lee, Wei-Chen Chiu, Yi-Hsuan Tsai Expanded SPAN for Efficient Super-Resolution
Qing Wang, Yang Wang, Hongyu An, Yi Liu, Liou Zhang, Shijie Zhao Explaining 3D Point Cloud Semantic Segmentation Models Through Adversarial Attacks
Jorge Francisco Ciprián-Sánchez, Josafat-Mattias Burmeister, Rico Richter, Jürgen Döllner Exploring Missing Modality in Multimodal Egocentric Datasets
Merey Ramazanova, Alejandro Pardo, Humam Alwassel, Bernard Ghanem Exploring Modality Guidance to Enhance VFM-Based Feature Fusion for UDA in 3D Semantic Segmentation
Johannes Spöcklberger, Wei Lin, Pedro Hermosilla, Sivan Doveh, Horst Possegger, Muhammad Jehanzeb Mirza Exploring Semi-Supervised Learning for Online Mapping
Adam Lilja, Erik Wallin, Junsheng Fu, Lars Hammarstrand Exploring Temporal Dynamics in Event-Based Eye Tracker
Hongxiang Huang, Xiaopeng Lin, Hongwei Ren, Yue Zhou, Bojun Cheng Extra-Lightweight AI-Based Privacy Preserving Framework for Egocentric Wearable Cameras
Long Li, Fengqing Zhu, Heather A. Eicher-Miller, J. Graham Thomas, Yuning Huang, Edward Sazonov Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging
David C. Wong, Bin Wang, Gorkem Durak, Marouane Tliba, Akshay Chaudhari, Aladine Chetouani, Ahmet Enis Çetin, Cagdas Topel, Nicolo Gennaro, Camila Lopes Vendrami, Tugce Agirlar Trabzonlu, Amir Ali Rahsepar, Laetitia Perronne, Matthew Antalek, Onural Ozturk, Gokcan Okur, Andrew C. Gordon, Ayis Pyrros, Frank H. Miller, Amir Borhani, Hatice Savas, Eric M. Hart, Drew A. Torigian, Jayaram K. Udupa, Elizabeth A. Krupinski, Ulas Bagci Fast Sphericity and Roundness Approximation in 2D and 3D Using Local Thickness
Pawel Tomasz Pieta, Peter Winkel Rasmussen, Anders Bjorholm Dahl, Anders Nymark Christensen FCTFANet: A Fused CNN-Transformer Feature Aggregator Network for Image Restoration
Amit Monga, Hemkant Nehete, Partha Kaushik, Tharun Kumar Reddy Bollu, Balasubramanian Raman, Gaurav Sharma Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Rajhans Singh, Rafael Bidese-Puhl, Kshitiz Dhakal, Sudhir Sornapudi FieldMOT: A Field-Registered Multi-Object Tracking for Sports Videos
Hong-Qi Chen, Chao-Chi Liao, Yuan-Heng Sun, Cheng-Kuan Lin, Yu-Chee Tseng FLAR-SVD: Fast and Latency-Aware Singular Value Decomposition for Model Compression
Moritz Thoma, Jorge Villasante, Emad Aghajanzadeh, Shambhavi Balamuthu Sampath, Pierpaolo Morì, Maximilian Groetzinger, Daniil Dylkin, Manoj Rohit Vemparala, Nael Fasfous, Alexander Frickenstein, Daniel Mueller-Gritschneder, Ulf Schlichtmann Food Degradation Analysis Using Multimodal Fuzzy Clustering
Julio J. Valdés, Stephie Liu, Shawn Yang, Yuhao Chen, Alexander Wong, Pengcheng Xi FoodVideoQA: A Novel Baseline Framework for Dietary Monitoring
Krish Shah, Siddharth Viswanath, Pengcheng Xi, Alexander Wong, Yuhao Chen ForesightNav: Learning Scene Imagination for Efficient Exploration
Hardik Shah, Jiaxu Xing, Nico Messikommer, Boyang Sun, Marc Pollefeys, Davide Scaramuzza Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization
Darryl Hannan, John Cooper, Dylan White, Timothy Doster, Henry Kvinge, Yijing Watkins Frequency-Prior Enhanced Ambient Lighting Normalization via Visual Perceptual Refinement
Yuanfei Bao, Xin Lu, Xingbo Wang, Jiarong Yang, Anya Hu, Kunyu Wang, Jie Xiao, Dong Li, Xueyang Fu, Zheng-Jun Zha From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction
Vladimir Golovkin, Nikolay Nemtsev, Vasyl Shandyba, Oleg Udin, Nikita Kasatkin, Pavel Kononov, Anton Afanasiev, Sergey Ulasen, Andrei Boiarov FullCycle: Full Stage Adversarial Attack for Reinforcement Learning Robustness Evaluation
Zhenshu Ma, Xuan Cai, Changhang Tian, Yuqi Fan, Kemou Jiang, Gangfu Liu, Xuesong Bai, Aoyong Li, Yilong Ren, Haiyang Yu FusedVision: A Knowledge-Infusing Approach for Practical Anomaly Detection in Real-World Surveillance Videos
Khaled Waleed Dawoud, Zaigham Zaheer, Mustaqeem Khan, Karthik Nandakumar, Abdulmotaleb Elsaddik, Muhammad Haris Khan Fusion or Confusion? a Look at Dataset Pooling for Infrared Object Detection
Stefan Becker, Ann-Kristin Grosselfinger, Jens Bayer, David Münch, Wolfgang Hübner, Michael Arens FusionNet: Multi-Model Linear Fusion Framework for Low-Light Image Enhancement
Kangbiao Shi, Yixu Feng, Tao Hu, Yu Cao, Peng Wu, Yijin Liang, Yanning Zhang, Qingsen Yan Generative AI for Film Creation: A Survey of Recent Advances
Ruihan Zhang, Borou Yu, Jiajian Min, Yetong Xin, Zheng Wei, Juncheng Nemo Shi, Mingzhen Huang, Xianghao Kong, Nix Liu Xin, Shanshan Jiang, Praagya Bahuguna, Mark Chan, Khushi Hora, Lijian Yang, Yongqi Liang, Runhe Bian, Yunlei Liu, Isabela Campillo Valencia, Patricia Morales Tredinick, Ilia Kozlov, Sijia Jiang, Peiwen Huang, Na Chen, Xuanxuan Liu, Anyi Rao Geometry-Aware Texture Generation for 3D Head Modeling with Artist-Driven Control
Amin Fadaeinejad, Abdallah Dib, Luiz Gustavo Hafemann, Emeline Got, Trevor Anderson, Amaury Depierre, Nikolaus F. Troje, Marcus A. Brubaker, Marc-André Carbonneau gMINT: Gradiant-Based Membership Inference Test Applied to Image Models
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Gonzalo Mancera, Ruben Tolosana Goal-Driven Human Motion Synthesis in Diverse Task
Inwoo Hwang, Jinseok Bae, Donggeun Lim, Young Min Kim GPT-FL: Generative Pre-Trained Model-Assisted Federated Learning
Tuo Zhang, Tiantian Feng, Samiul Alam, Dimitrios Dimitriadis, Sunwoo Lee, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr GRS: Generating Robotic Simulation Tasks from Real-World Images
Alex Zook, Fan-Yun Sun, Josef B. Spjut, Valts Blukis, Stan Birchfield, Jonathan Tremblay HDC: Hierarchical Distillation for Multi-Level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation
Tran Quoc Khanh Le, Nguyen Lan Vi Vu, Ha-Hieu Pham, Xuan-Loc Huynh, Tien-Huy Nguyen, Minh Huu Nhat Le, Quan Nguyen, Hien D. Nguyen How Good Is My Video-LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, Salman Khan How Much Noise Is There in Labels Generated by Humans? a Method to Validate Automatically Generated Bounding Boxes
Mariusz Karol Nowak, Jacek Cyranka, Natalia Maslany, Aleksander Kostuch, Jakub Derbisz, Mateusz Komorkiewicz, Patryk Siwek, Mateusz Wójcik, Dariusz Marchewka, Pawel Skruch Human vs. Machine Minds: Ego-Centric Action Recognition Compared
Sadegh Rahmani-Boldaji, Filip Rybansky, Quoc Vuong, Frank Guerin, Andrew Gilbert IAUNet: Instance-Aware U-Net
Yaroslav Prytula, Illia Tsiporenko, Ali Zeynalli, Dmytro Fishman Ice Hockey Puck Localization Using Contextual Cues
Liam Salass, Jerrin Bright, Amir Nazemi, Yuhao Chen, John S. Zelek, David A. Clausi Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions
Mohammadmostafa Rostamkhani, Baktash Ansari, Hoorieh Sabzevari, Farzan Rahmani, Sauleh Eetemadi IMC: A Benchmark for Invariant Learning Under Multiple Causes
Taero Kim, Seonggyun Lee, Joonseong Kang, Youngjun Choi, Wonsang Yun, Nicole Hee-Yeon Kim, Ziyu Chen, Lexing Xie, Kyungwoo Song Improving Open-World Object Localization by Discovering Background
Ashish Singh, Michael Jones, Kuan-Chuan Peng, Anoop Cherian, Moitreya Chatterjee, Erik G. Learned-Miller Instance Feature Caching for Cross-Domain Few-Shot Object Detection
Yali Huang, Jie Mei, Yiming Yang, Mi Guo, Mingyuan Jiu, Mingliang Xu Instruction-Augmented Multimodal Alignment for Image-Text and Element Matching
Xinli Yue, Jianhui Sun, Junda Lu, Liangchao Yao, Fan Xia, Tianyi Wang, Fengyun Rao, Jing Lyu, Yuetang Deng Jump-Aware: Player Position Rectification and Identification in Dynamic Sports Using Jump Event Spotting
Yin May Oo, Ankhzaya Jamsrandorj, Vanyi Chao, Hoang Quoc Nguyen, Yewon Hwang, Kyung-Ryoul Mun, Jinwook Kim LangCoop: Collaborative Driving with Language
Xiangbo Gao, Yuheng Wu, Rujia Wang, Chenxi Liu, Yang Zhou, Zhengzhong Tu LAPIS: A Novel Dataset for Personalized Image Aesthetic Assessment
Anne-Sofie Maerten, Li-Wei Chen, Stefanie De Winter, Christophe Bossens, Johan Wagemans Learned Smartphone ISP on Mobile GPUs, Mobile AI 2025 Challenge: Report
Andrey Ignatov, Georgy Perevozchikov, Radu Timofte, Cheng Li, Lian Liu, Jun Cao, Heng Sun, Wu Pan, Song Wang, Keqiang Yu, Shuo Liu, Hongqin He, Zhenhao Dong, Jianke Chen, Dejun Hao, Keqiang Yu, Tingniao Wang, Xiaoqing Zhou, Dong Zhang, Chunxia Zhang, Jianguang He, Hailong Yan, Ao Li, Xiangtao Zhang, Zhe Liu, Ce Zhu, Le Zhang, Andrei Arhire, Shuo Liu, Junpyo Seo, Fen Xie, Xiuzhi Fang, Chen Wu, Zhangsheng Wang, Pengbo Zhang, Jiazi Huang Learning Optical Flow Field via Neural Ordinary Differential Equation
Leyla Mirvakhabova, Hong Cai, Jisoo Jeong, Hanno Ackermann, Farhad G. Zanjani, Fatih Porikli Learning to Drive from a World Model
Mitchell Goff, Greg Hogan, George Hotz, Armand du Parc Locmaria, Kacper Raczy, Harald Schäfer, Adeeb Shihadeh, Weixing Zhang, Yassine Yousfi Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation
Sarosij Bose, Hannah Dela Cruz, Arindam Dutta, Elena Kokkoni, Konstantinos Karydis, Amit K. Roy-Chowdhury LMFormer: Lane Based Motion Prediction Transformer
Harsh Yadav, Maximilian Schäfer, Kun Zhao, Tobias Meisen LNTransformer: Lung Nodule Transformer for Sparse CT Segmentation
Hooman Ramezani, Charlotte Vedrines, Dionne M. Aleman, Daniel Létourneau Location-Free Scene Graph Generation
Ege Özsoy, Felix Holm, Chantal Pellegrini, Tobias Czempiel, Mahdi Saleh, Nassir Navab, Benjamin Busam Looking into the Shadow: Recording a Total Solar Eclipse with High-Resolution Event Cameras
Fernando Cladera, Kenneth Chaney, Caroline Pritchard, M. Ani Hsieh, Vijay Kumar, Camillo J. Taylor, Kostas Daniilidis Low-Frame-Rate Cell Tracking: Unmet Needs and Future Directions
Mina Gachloo, Akhila Nangineedi, Mahsa Partovi, Fardifa Fathmiul Alam, Tzu-Yu Chu, James Schvaneveldt, Xiaoming Lu, Tirthankar Biswas, Marc R. Birtwistle, Federico Iuricich LVP-CLIP: Revisiting CLIP for Continual Learning with Label Vector Pool
Yue Ma, Huantao Ren, Boyu Wang, Jingang Jin, Senem Velipasalar, Qinru Qiu Maize Ear Sensing for On-Farm Yield Predictions
Pedro Cisdeli, Gustavo Nocera Santiago, German Mandrini, Ignacio Antonio Ciampitti Mapping Biodiversity at Very-High Resolution in Europe
César Leblanc, Lukás Picek, Rémi Palard, Benjamin Deneu, Maximilien Servajean, Pierre Bonnet, Alexis Joly MAVEN: Multi-Modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire, Kunal Shah, Mudasir Nazir Khan, Nikhil Pakhale, Lownish Rai Sookha, Mudasir Ahmad Ganaie, Abhinav Dhall MerCulture: A Comprehensive Benchmark to Evaluate Vision-Language Models on Cultural Understanding in Singapore
Tushar Pranav, Eshan Pandey, Lyka Diane Bala Austria, Yin Yin Loo, Jing Hao Lim, Indriyati Atmosukarto, Donny Cheng Lock Soh MObI: Multimodal Object Inpainting Using Diffusion Models
Alexandru Buburuzan, Anuj Sharma, John Redford, Puneet K. Dokania, Romain Mueller MoCLIP Motion-Aware Fine-Tuning and Distillation of CLIP for Human Motion Generation
Gabriel Maldonado, Armin Danesh Pazho, Ghazal Alinezhad Noghre, Vinit Katariya, Hamed Tabkhi MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Kengo Uchida, Takashi Shibuya, Yuhta Takida, Naoki Murata, Julian Tanke, Shusuke Takahashi, Yuki Mitsufuji MTA-VPS: A Large-Scale Benchmark for Video-Based Person Search
Ding Qi, Shuguang Dou, Jian Liu, Huaixuan Cao, Hao Zhang, Dongsheng Jiang, Cairong Zhao Multi-Agent Systems for Robotic Autonomy with LLMs
Junhong Chen, Ziqi Yang, Haoyuan G. Xu, Dandan Zhang, George P. Mylonas Multi-Entity Video Transformers for Fine-Grained Video Representation Learning
Matthew Walmer, Rose Catherine Kanjirathinkal, Kai Sheng Tai, Keyur Muzumdar, Tai-Peng Tian, Abhinav Shrivastava Multi-Person Physics-Based Pose Estimation for Combat Sports
Hossein Feizollah Zadeh Khoiee, David R. Labbé, Thomas Romeas, Jocelyn Faubert, Sheldon Andrews Cite
Multi-Spectral Imaging and Data Fusion for Real-Time Bleeding Detection
Ghazal Rouhafzay, Stephen Rowlands, Angel J. Valencia, Shengsong Yang, Pierre Payeur, Haitao Tian, James Dickens Multimodal 3D Object Detection on Unseen Domains
Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael Jones, Vishal Patel Multimodal Generalized Category Discovery
Yuchang Su, Renping Zhou, Siyu Huang, Xingjian Li, Tianyang Wang, Ziyue Wang, Min Xu Nanoparticle Diameter Measurements with Event Camera Tracking
Michael C. Daugherty, Matthew DiSalvo, Aaron Goldfain, Alexander Peterson, Edward Kwee, Thomas Germer, Gregory Cooksey, Jagat Budhathoki, Peter Bajcsy NeIn: Telling What You Don't Want
Nhat-Tan Bui, Dinh-Hieu Hoang, Quoc-Huy Trinh, Minh-Triet Tran, Truong Nguyen, Susan Gauch NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds
Mahan Rafidashti, Ji Lan, Maryam Fatemi, Junsheng Fu, Lars Hammarstrand, Lennart Svensson NTIRE 2025 Ambient Lighting Normalization Challenge Report
Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Zongwei Wu, Radu Timofte, Yuanfei Bao, Xingbo Wang, Xin Lu, Jiarong Yang, Anya Hu, Kunyu Wang, Jie Xiao, Dong Li, Xueyang Fu, Zheng-Jun Zha, Zihao Fan, Xi Wang, Yurui Zhu, Kean Liu, Senyan Xu, Hongjian Liu, Yupeng Xiao, David Serrano-Lozano, Francisco A. Molina-Bakhos, Danna Xue, Yixiong Yang, Maria Pilligua, Ramon Baldrich, María Vanrell, Javier Vazquez-Corral, Xuan Sun, Zijie Lou, Ting Liu, Kuldeep Purohit, Jameer Babu Pinjari, Yilin Zhang, Huan Zheng, Yanyan Wei, Suiyi Zhao, Shengeng Tang, Zhao Zhang, Yushen Zuo, Zongqi He, Zhe Xiao, Cuixin Yang, Rongkang Dong, Jun Xiao, Kin-Man Lam, Nikhil Akalwadi, Vijayalaxmi Ashok Aralikatti, Dheeraj Damodhar Hegde, Ramesh Ashok Tabib, Uma Mudenagudi, Anas M. Ali, Bilel Benjdira, Wadii Boulila NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results
Yuqian Fu, Xingyu Qiu, Bin Ren, Yanwei Fu, Radu Timofte, Nicu Sebe, Ming-Hsuan Yang, Luc Van Gool, Kaijin Zhang, Qingpeng Nong, Xiugang Dong, Hong Gao, Xiangsheng Zhou, Jiancheng Pan, Yanxing Liu, Xiao He, Jiahao Li, Yuze Sun, Xiaomeng Huang, Zhenyu Zhang, Ran Ma, Yuhan Liu, Zijian Zhuang, Shuai Yi, Yixiong Zou, Lingyi Hong, Mingxi Chen, Runze Li, Xingdong Sheng, Wenqiang Zhang, Weisen Chen, Yongxin Yan, Xinguo Chen, Yuanjie Shao, Zhengrong Zuo, Nong Sang, Hao Wu, Haoran Sun, Shuming Hu, Yan Zhang, Zhiguang Shi, Yu Zhang, Chao Chen, Tao Wang, Da Feng, Linhai Zhuo, Ziming Lin, Yali Huang, Jie Me, Yiming Yang, Mi Guo, Mingyuan Jiu, Mingliang Xu, Maomao Xiong, Qunshu Zhang, Xinyu Cao, Yuqing Yang, Dianmo Sheng, Xuanpu Zhao, Zhiyu Li, Xuyang Ding, Wenqian Li NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, Yubo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, Xin Cao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun, Subhajit Paul, Ni Tang, Junhao Huang, Zihan Cheng, Hongyun Zhu, Yuehan Wu, Kaixin Deng, Huang Ouyang, Tianxin Xiao, Fan Yang, Zhizun Luo, Zeyu Xiao, Zhuoyuan Li, Pham Hoang Le Nguyen, Dinh Thien An, Luu Thanh Son, Kiet Van Nguyen, Ronghua Xu, Xianmin Tian, Weijian Zhou, Jiacheng Zhang, Yuqian Chen, Yihang Duan, Yujie Wu, Suresh Raikwar, Arsh Garg, Kritika Kritika, Jianhua Zheng, Xiaoshan Ma, Ruolin Zhao, Yongyu Yang, Yongsheng Liang, Guiming Huang, Qiang Li, Hongbin Zhang, Xiangyu Zheng, A. N. Rajagopalan NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results
Sangmin Lee, Eunpil Park, Angel Canelo, Hyunhee Park, Youngjo Kim, Hyung-Ju Chun, Xin Jin, Chongyi Li, Chun-Le Guo, Radu Timofte, Qi Wu, Tianheng Qiu, Yuchun Dong, Shenglin Ding, Guanghua Pan, Weiyu Zhou, Tao Hu, Yixu Feng, Duwei Dai, Yu Cao, Peng Wu, Wei Dong, Yanning Zhang, Qingsen Yan, Simon J. Larsen, Senyan Xu, Xingbo Wang, Ruixuan Jiang, Xin Lu, Marcos V. Conde, Javier Abad-Hernández, Álvaro García-Lara, Daniel Feijoo, Álvaro García, Zeyu Xiao, Zhuoyuan Li NTIRE 2025 Challenge on Event-Based Image Deblurring: Methods and Results
Lei Sun, Andrea Alfarano, Peiqi Duan, Shaolin Su, Kaiwei Wang, Boxin Shi, Radu Timofte, Danda Pani Paudel, Luc Van Gool, Qinglin Liu, Wei Yu, Xiaoqian Lv, Lu Yang, Shuigen Wang, Shengping Zhang, Xiangyang Ji, Long Bao, Yuqiang Yang, Jinao Song, Ziyi Wang, Shuang Wen, Heng Sun, Kean Liu, Mingchen Zhong, Senyan Xu, Zhijing Sun, Jiaying Zhu, Chengjie Ge, Xingbo Wang, Yidi Liu, Xin Lu, Xueyang Fu, Zheng-Jun Zha, Dawei Fan, Dafeng Zhang, Yong Yang, Siru Zhang, Qinghua Yang, Hao Kang, Huiyuan Fu, Heng Zhang, Hongyuan Yu, Zhijuan Huang, Shouyan Wei, Feng Li, Runmin Cong, Weiqi Luo, Mingyun Lin, Chenxu Jiang, Hongyi Liu, Lei Yu, Weilun Li, Jiajun Zhai, Tingting Lin, Shuang Ma, Sai Zhou, Zhanwen Liu, Yang Wang, Eiffel Chong, Nuwan Bandara, Thivya Kandappu, Archan Misra, Yihang Chen, Zhan Li, Weijun Yuan, Wenzhuo Wang, Boyang Yao, Zhanglu Chen, Yijing Sun, Tianjiao Wan, Zijian Gao, Qisheng Xu, Kele Xu, Yukun Zhang, Yu He, Xiaoyan Xie, Tao Fu, Yashu Guatamkumar Patel, Vihar Ramesh Jain, Divesh Basina, Rishik Ashili, Manish Kumar Manjhi, Sourav Kumar, Prinon Benny, Himanshu Ghunawat, B. Sri Sairam Gautam, Anett Varghese, Abhishek Yadav NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez, Fabio Tosi, Luigi Di Stefano, Radu Timofte, Alex Costanzino, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Zhe Zhang, Yang Yang, Wu Chen, Anlong Ming, Mingshuai Zhao, Mengying Yu, Shida Gao, Xiangfeng Wang, Feng Xue, Jun Shi, Yong Yang, Yong A, Yixiang Jin, Dingzhe Li, Aryan Shukla, Liam Frija-Altarac, Matthew Toews, Hui Geng, Tianjiao Wan, Zijian Gao, Qisheng Xu, Kele Xu, Zijian Zang, Jameer Babu Pinjari, Kuldeep Purohit, Mykola Lavreniuk, Jing Cao, Shenyi Li, Kui Jiang, Junjun Jiang, Yong Huang NTIRE 2025 Challenge on Image Super-Resolution (x4): Methods and Results
Zheng Chen, Kai Liu, Jue Gong, Jingkai Wang, Lei Sun, Zongwei Wu, Radu Timofte, Yulun Zhang, Xiangyu Kong, Xiaoxuan Yu, Hyunhee Park, Suejin Han, Hakjae Jeon, Dafeng Zhang, Hyung-Ju Chun, Donghun Ryou, Inju Ha, Bohyung Han, Lu Zhao, Yuyi Zhang, Pengyu Yan, Jiawei Hu, Pengwei Liu, Fengjun Guo, Hongyuan Yu, Pufan Xu, Zhijuan Huang, Shuyuan Cui, Peng Guo, Jiahui Liu, Dongkai Zhang, Heng Zhang, Huiyuan Fu, Huadong Ma, Yanhui Guo, Sisi Tian, Xin Li, Jinwen Liang, Jie Liu, Jie Tang, Gangshan Wu, Zeyu Xiao, Zhuoyuan Li, Yinxiang Zhang, Wenxuan Cai, Vijayalaxmi Ashok Aralikatti, Nikhil Akalwadi, G. Gyaneshwar Rao, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Marcos V. Conde, Alejandro Merino, Bruno Longarela, Javier Abad, Weijun Yuan, Zhan Li, Zhanglu Chen, Boyang Yao, Aagam Jain, Milan Kumar Singh, Ankit Kumar, Shubh Kawa, Divyavardhan Singh, Anjali Sarvaiya, Kishor P. Upla, Raghavendra Ramachandra, Chia-Ming Lee, Yu-Fan Lin, Chih-Chung Hsu, Risheek V. Hiremath, Palani Yashaswini, Yuxuan Jiang, Qiang Zhu, Siyue Teng, Fan Zhang, Shuyuan Zhu, Bing Zeng, David Bull, Jingwei Liao, Yuqing Yang, Wenda Shao, Junyi Zhao, Qisheng Xu, Kele Xu, Sunder Ali Khowaja, Ik Hyun Lee, Snehal Singh Tomar, Rajarshi Ray, Klaus Mueller, Sachin Chaudhary, Surya Vashisth, Akshay Dudhane, Praful Hambarde, Satya Naryan Tazi, Prashant W. Patil, Santosh Kumar Vipparthi, Subrahmanyam Murala, Bilel Benjdira, Anas M. Ali, Wadii Boulila, Zahra Moammeri, Ahmad Mahmoudi-Aznaveh, Ali Karbasi, Hossein Motamednia, Liangyan Li, Guanhua Zhao, Kevin Le, Yimo Ning, Haoxuan Huang, Jun Chen NTIRE 2025 Challenge on Light Field Image Super-Resolution: Methods and Results
Yingqian Wang, Zhengyu Liang, Fengyuan Zhang, Lvli Tian, Longguang Wang, Juncheng Li, Jungang Yang, Radu Timofte, Yulan Guo, Kai Jin, Zeqiang Wei, Angulia Yang, Di Wu, Mingzhi Gao, Xiuzhuang Zhou, Yue Yan, Yuaho Wang, Shuang Chen, Zeping Tian, Yizhi Hu, Yao Lu, Haosong Liu, Xiancheng Zhu, Huanqiang Zeng, Jianqing Zhu, Yifan Shi, Junhui Hou, Mingyang Yu, Zhijian Wu, Dingjiang Huang, Wenli Zheng, Zekai Xu, Huiyuan Fu, Heng Zhang, Zhijuan Huang, Hongyuan Yu, Zeke Zexi Hu, Haodong Chen, Vera Yuk Ying Chung, Xiaoming Chen, Zean Chen, Yeyao Chen, Gangyi Jiang, Haiyong Xu, Ting Luo, Guanglong Liao, Danhao Zhang, Siyu Zhang, Wendong Mao, Zhongfeng Wang, Sunita Arya, Abhishek Kumar Sinha, S. Manthira Moorthi, Hao Zhang, Hao Sheng, Da Yang, Zhenglong Cui, Shuai Wang, Haotian Zhang, Xingzheng Wang, Yuanbo Huang, Jiahao Lin, Yuhang Lin, Ahmed Salem, Ebrahem Elkady, Hatem Ibrahem, Jae-Won Suh, Hyun-Soo Kang, Changguang Wu, Hao Hou, Pengpeng Li, Peng Huang, Jiangxin Dong, Jinhui Tang NTIRE 2025 Challenge on Low Light Image Enhancement: Methods and Results
Xiaoning Liu, Zongwei Wu, Florin-Alexandru Vasluianu, Hailong Yan, Bin Ren, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Kangbiao Shi, Yixu Feng, Tao Hu, Yu Cao, Peng Wu, Yijin Liang, Yanning Zhang, Qingsen Yan, Han Zhou, Wei Dong, Yan Min, Mohab Kishawy, Jun Chen, Pengpeng Yu, Anjin Park, Seung-Soo Lee, Young-Joon Park, Zixiao Hu, Junyv Liu, Huilin Zhang, Jun Zhang, Fei Wan, Bingxin Xu, Hongzhe Liu, Cheng Xu, Weiguo Pan, Songyin Dai, Xunpeng Yi, Qinglong Yan, Yibing Zhang, Jiayi Ma, Changhui Hu, Kerui Hu, Donghang Jing, Tiesheng Chen, Zhi Jin, Hongjun Wu, Biao Huang, Haitao Ling, Jiahao Wu, Dandan Zhan, G. Gyaneshwar Rao, Vijayalaxmi Ashok Aralikatti, Nikhil Akalwadi, Ramesh Ashok Tabib, Uma Mudenagudi, Ruirui Lin, Guoxi Huang, Nantheera Anantrasirichai, Qirui Yang, Alexandru Brateanu, Ciprian Orhei, Cosmin Ancuti, Daniel Feijoo, Juan C. Benito, Álvaro García, Marcos V. Conde, Yang Qin, Raul Balmez, Anas M. Ali, Bilel Benjdira, Wadii Boulila, Tianyi Mao, Huan Zheng, Yanyan Wei, Shengeng Tang, Dan Guo, Zhao Zhang, Sabari Nathan, K. Uma, A. Sasithradevi, B. Sathya Bama, S. Mohamed Mansoor Roomi, Ao Li, Xiangtao Zhang, Zhe Liu, Yijie Tang, Jialong Tang, Zhicheng Fu, Gong Chen, Joe Nasti, John Nicholson, Zeyu Xiao, Zhuoyuan Li, Ashutosh Kulkarni, Prashant W. Patil, Santosh Kumar Vipparthi, Subrahmanyam Murala, Duan Liu, Weile Li, Hangyuan Lu, Rixian Liu, Tengfeng Wang, Jinxing Liang, Chenxin Yu NTIRE 2025 Challenge on Night Photography Rendering
Egor I. Ershov, Sergey Korchagin, Aleksei Khalin, Artyom Panshin, Arseniy P. Terekhin, Ekaterina Zaychenkova, Georgiy Lobarev, Vsevolod Plokhotnyuk, Denis Abramov, Elisey Zhdanov, Sofia Dorogova, Yasin Mamedov, Nikola Banic, Georgy Perevozchikov, Radu Timofte, Lize Zhang, Yuqian Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Yibin Huang, Guangqi Shao, Xiaotao Wang, Lei Lei, Sishun Pan, Zhiqiang Zhong, Yang Yang, Anas M. Ali, Hamad Aloqayli, Bilel Benjdira, Wadii Boulila, Xiaoyang Ma, Zijun Gao, Leyi Xing, Zongqi He, Yushen Zuo, Zhe Xiao, Kin-Chung Chan, Hanmin Li, Jun Xiao, Kin-Man Lam, Yunpeng Wu, Dmitrij Manzura, Daniil Storonkin, Weixin Guo, Kele Xu, Qisheng Xu, Zijian Gao, Tianjiao Wan, Buda Vampilov, Furkan Kinli, Furkan Kiraç NTIRE 2025 Challenge on RAW Image Restoration and Super-Resolution
Marcos V. Conde, Radu Timofte, Zihao Lu, Xiangyu Kong, Xiaoxia Xing, Fan Wang, Suejin Han, MinKyu Park, Tianyu Hao, Yuhong He, Ruoqi Li, Yueqi Yang, Jianyang Yu, Kele Xu, Zisheng Xu, Yong Dou, Watchara Ruangsang, Ruixuan Jiang, Senyan Xu, Siyuan Jiang, Xueyang Fu, Zheng-Jun Zha, Jiajie Lu, Xiang Yu, Minmin Yi, Yuanjia Chen, Liwen Zhang, Zijie Jin, Tianyu Zhang, Xin Lu, Yeda Chen, Dong Liu, Li Pang, Yuhang Yang, Hongzhong Wang, Xiangyong Cao, Cheng Li, Lian Liu, Wei Song, Heng Sun, Yubo Wang, Jinghua Wang, Guanlan Hong NTIRE 2025 Challenge on Real-World Face Restoration: Methods and Results
Zheng Chen, Jingkai Wang, Kai Liu, Jue Gong, Lei Sun, Zongwei Wu, Radu Timofte, Yulun Zhang, Jianxing Zhang, Jinlong Wu, Jun Wang, Zheng Xie, Hakjae Jeon, Suejin Han, Hyung-Ju Chun, Hyunhee Park, Zhicun Yin, Junjie Chen, Ming Liu, Xiaoming Li, Chao Zhou, Wangmeng Zuo, Weixia Zhang, Dingquan Li, Kede Ma, Yun Zhang, Zhuofan Zheng, Yuyue Liu, Shizhen Tang, Zihao Zhang, Yi Ning, Hao Jiang, Wenjie An, Kangmeng Yu, Chenyang Wang, Kui Jiang, Xianming Liu, Junjun Jiang, Yingfu Zhang, Gang He, Siqi Wang, Kepeng Xu, Zhenyang Liu, Changxin Zhou, Shanlan Shen, Yubo Duan, Yiang Chen, Jin Guo, Mengru Yang, Jen-Wei Lee, Chia-Ming Lee, Chih-Chung Hsu, Hu Peng, Chunming He NTIRE 2025 Challenge on Short-Form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study
Xin Li, Xijun Wang, Bingchen Li, Kun Yuan, Yizhen Shao, Suhang Yao, Ming Sun, Chao Zhou, Radu Timofte, Zhibo Chen NTIRE 2025 Challenge on Short-Form UGC Video Quality Assessment and Enhancement: Methods and Results
Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong, Zhengzhong Tu, Yufan Liu, Xiangguang Chen, Zuowei Cao, Minhao Tang, Shan Liu, Kexin Zhang, Jingfen Xie, Yan Wang, Kai Chen, Shijie Zhao, Yunchen Zhang, Xiangkai Xu, Hong Gao, Ji Shi, Yiming Bao, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu, Ying Liang, Yiwen Wang, Xinning Chai, Yuxuan Zhang, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song, Wei Sun, Kang Fu, Linhan Cao, Dandan Zhu, Kaiwei Zhang, Yucheng Zhu, Zicheng Zhang, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Zhi Jin, Jiawei Wu, Wei Wang, Wenjian Zhang, Yuhai Lan, Gaoxiong Yi, Hengyuan Na, Wang Luo, Di Wu, Mingyin Bai, Jiawang Du, Zilong Lu, Zhenyu Jiang, Hui Zeng, Ziguan Cui, Zongliang Gan, Guijin Tang, Xinglin Xie, Kehuan Song, Xiaoqiang Lu, Licheng Jiao, Fang Liu, Xu Liu, Puhua Chen, Ha Thu Nguyen, Katrien De Moor, Seyed Ali Amirshahi, Mohamed-Chaker Larabi, Qi Tang, Linfeng He, Zhiyong Gao, Zixuan Gao, Guohua Zhang, Zhiye Huang, Yi Deng, Qingmiao Jiang, Lu Chen, Yi Yang, Xi Liao, Nourine Mohammed Nadir, Yuxuan Jiang, Qiang Zhu, Siyue Teng, Fan Zhang, Shuyuan Zhu, Bing Zeng, David Bull, Meiqin Liu, Chao Yao, Yao Zhao NTIRE 2025 Challenge on Single Image Reflection Removal in the Wild: Datasets, Methods and Results
Kangning Yang, Jie Cai, Ling Ouyang, Florin-Alexandru Vasluianu, Radu Timofte, Jiaming Ding, Huiming Sun, Lan Fu, Jinlong Li, Chiu Man Ho, Zibo Meng, Mingjia Li, Hainuo Wang, Qiming Hu, Jiarui Wang, Hao Zhao, Jin Hu, Xiaojie Guo, Mengru Yang, Jing He, Yiqing Wang, Zhiyang Chen, Hao Fang, Wei Zhang, Runmin Cong, Dheeraj Damodhar Hegde, Jatin Kalal, Nikhil Akalwadi, Ramesh Ashok Tabib, Uma Mudenagudi, Yu-Fan Lin, Chia-Ming Lee, Chih-Chung Hsu, Mengxin Zhang, Sabari Nathan, K. Uma, A. Sasithradevi, B. Sathya Bama, S. Mohamed Mansoor Roomi, Bilel Benjdira, Anas M. Ali, Wadii Boulila, Wei Dong, Yunzhe Li, Ali Hussein, Han Zhou, Jun Chen, Zeyu Xiao, Zhuoyuan Li NTIRE 2025 Challenge on Text to Image Generation Model Quality Assessment
Shuhao Han, Haotian Fan, Fangyuan Kong, Wenjie Liao, Chunle Guo, Chongyi Li, Radu Timofte, Liang Li, Tao Li, Junhui Cui, Yunqiu Wang, Yang Tai, Jingwei Sun, Jianhui Sun, Xinli Yue, Tianyi Wang, Huan Hou, Junda Lu, Xinyang Huang, Zitang Zhou, Zijian Zhang, Xuhui Zheng, Xuecheng Wu, Chong Peng, Xuezhi Cao, Trong-Hieu Nguyen-Mau, Minh-Hoang Le, Minh-Khoa Le-Phan, Duy-Nam Ly, Hai-Dang Nguyen, Minh-Triet Tran, Yukang Lin, Yan Hong, Chuanbiao Song, Siyuan Li, Jun Lan, Zhichao Zhang, Xinyue Li, Wei Sun, Zicheng Zhang, Yunhao Li, Xiaohong Liu, Guangtao Zhai, Zitong Xu, Huiyu Duan, Jiarui Wang, Guangji Ma, Liu Yang, Lu Liu, Qiang Hu, Xiongkuo Min, Zichuan Wang, Zhenchen Tang, Bo Peng, Jing Dong, Fengbin Guan, Zihao Yu, Yiting Lu, Wei Luo, Xin Li, Minhao Lin, Haofeng Chen, Xuanxuan He, Kele Xu, Qisheng Xu, Zijian Gao, Tianjiao Wan, Bo-Cheng Qiu, Chih-Chung Hsu, Chia-Ming Lee, Yu-Fan Lin, Bo Yu, Zehao Wang, Da Mu, Mingxiu Chen, Junkang Fang, Huamei Sun, Wending Zhao, Zhiyu Wang, Wang Liu, Weikang Yu, Puhong Duan, Bin Sun, Xudong Kang, Shutao Li, Shuai He, Lingzhi Fu, Heng Cong, Rongyu Zhang, Jiarong He, Zhishan Qiao, Yongqing Huang, Zewen Chen, Zhe Pang, Juan Wang, Jian Guo, Zhizhuo Shao, Ziyu Feng, Bing Li, Weiming Hu, Hesong Li, Dehua Liu, Zeming Liu, Qingsong Xie, Ruichen Wang, Zhihao Li, Yuqi Liang, Jianqi Bi, Jun Luo, Junfeng Yang, Can Li, Jing Fu, Hongwei Xu, Mingrui Long, Lulin Tang NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results
Nickolay Safonov, Alexey Bryntsev, Andrey Moskalenko, Dmitry Kulikov, Dmitriy S. Vatolin, Radu Timofte, Haibo Lei, Qifan Gao, Qing Luo, Yaqing Li, Jie Song, Shaozhe Hao, Meisong Zheng, Jingyi Xu, Chengbin Wu, Jiahui Liu, Ying Chen, Xin Deng, Mai Xu, Peipei Liang, Jie Ma, Junjie Jin, Yingxue Pang, Fangzhou Luo, Kai Chen, Shijie Zhao, Mingyang Wu, Renjie Li, Yushen Zuo, Zhengzhong Tu, Shengyun Zhong NTIRE 2025 Challenge on Video Quality Enhancement for Video Conferencing: Datasets, Methods and Results
Varun Jain, Zongwei Wu, Quan Zou, Louis Florentin, Henrik Turbell, Sandeep Siddhartha, Radu Timofte, Qifan Gao, Linyan Jiang, Qing Luo, Jie Song, Yaqing Li, Summer Luo, Mae Chen, Stefan Liu, Danie Song, Huimin Zeng, Qi Chen, Ajeet Kumar Verma, Shweta Tripathi, Vinit Jakhetiya, Badri N. Subhdhi, Sunil Jaiswal NTIRE 2025 Image Shadow Removal Challenge Report
Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Cailian Chen, Zongwei Wu, Radu Timofte, Mingjia Li, Jin Hu, Hainuo Wang, Hengxing Liu, Jiarui Wang, Qiming Hu, Xiaojie Guo, Xin Lu, Jiarong Yang, Yuanfei Bao, Anya Hu, Zihao Fan, Kunyu Wang, Jie Xiao, Xi Wang, Xueyang Fu, Zheng-Jun Zha, Yu-Fan Lin, Chia-Ming Lee, Chih-Chung Hsu, Xingbo Wang, Dong Li, Yuxu Chen, Bin Chen, Yuanbo Zhou, Yuanbin Chen, Hongwei Wang, Jiannan Lin, Qinquan Gao, Tong Tong, Zhao Zhang, Yanyan Wei, Wei Dong, Han Zhou, Seyed Amirreza Mousavi, Jun Chen, Haobo Liang, Jiajie Jing, Junyu Li, Yan Yang, Seoyeon Lee, Chaewon Kim, Ziyu Feng, Shidi Chen, Bowen Luan, Zewen Chen, Vijayalaxmi Ashok Aralikatti, G. Gyaneshwar Rao, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Anas M. Ali, Bilel Benjdira, Wadii Boulila, Alexandru Brateanu, Cosmin Ancuti, Tanmay Chaturvedi, Manish Kumar, Anmol Srivastav, Daksh Trivedi, Shashwat Thakur, Kishor P. Upla, Zeyu Xiao, Zhuoyuan Li, Boda Zhou, Shashank Shekhar, Kele Xu, Qisheng Xu, Zijian Gao, Tianjiao Wan, Suiyi Zhao, Bo Wang, Yan Luo, Mingshen Wang, Yilin Zhang NTIRE 2025 the 2nd Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang, Radu Timofte, Qiaosi Yi, Zhengqiang Zhang, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang, Tianyu Hao, Lin Wang, Zhe Xiao, Pengzhou Ji, Shupeng Zhong, Xiangming Wang, Jiaqi Yan, Sishun Pan, Ce Wang, Yibin Huang, Zhang Sheng Wang, Haobo Liang, Zhenghao Pan, Jinjian Wu, Yushen Zuo, Yuanbo Zhou NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results
Xiaohong Liu, Xiongkuo Min, Qiang Hu, Xiaoyun Zhang, Jie Guo, Guangtao Zhai, Shushi Wang, Yingjie Zhou, Lu Liu, Jingxin Li, Liu Yang, Farong Wen, Li Xu, Yanwei Jiang, Xilei Zhu, Chunyi Li, Zicheng Zhang, Huiyu Duan, Xiele Wu, Yixuan Gao, Yuqin Cao, Jun Jia, Wei Sun, Jiezhang Cao, Radu Timofte, Baojun Li, Jiamian Huang, Dan Luo, Tao Liu, Weixia Zhang, Bingkun Zheng, Junlin Chen, Ruikai Zhou, Meiya Chen, Yu Wang, Hao Jiang, Xiantao Li, Yuxiang Jiang, Jun Tang, Yimeng Zhao, Bo Hu, Zelu Qi, Chaoyang Zhang, Fei Zhao, Ping Shi, Lingzhi Fu, Heng Cong, Shuai He, Rongyu Zhang, Jiarong He, Zongyao Hu, Wei Luo, Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen, Mengjing Su, Yi Wang, Tuo Chen, Chunxiao Li, Shuaiyu Zhao, Jiaxin Wen, Chuyi Lin, Sitong Liu, Ningxin Chu, Jing Wan, Yu Zhou, Baoying Chen, Jishen Zeng, Jiarui Liu, Xianjin Liu, Xin Chen, Lanzhi Zhou, Hangyu Li, You Han, Bibo Xiang, Zhenjie Liu, Jianzhang Lu, Jialin Gui, Renjie Lu, Shangfei Wang, Donghao Zhou, Jingyu Lin, Quanjian Song, Jiancheng Huang, Yufeng Yang, Changwei Wang, Shupeng Zhong, Yang Yang, Lihuo He, Jia Liu, Yuting Xing, Tida Fang, Yuchun Jin OccludeNeRF: Geometry-Aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF
Jingyu Shi, Achleshwar Luthra, Jiazhi Li, Xiang Gao, Xiyun Song, Zongfang Lin, Xianfeng David Gu, Heather Yu On the Suitability of Reinforcement Fine-Tuning to Visual Tasks
Xiaxu Chen, Wei Li, Chunxu Liu, Chi Xie, Xiaoyan Hu, Chengqian Ma, Feng Zhu, Rui Zhao OpenSplat3D: Open-Vocabulary 3D Instance Segmentation Using Gaussian Splatting
Jens Piekenbrinck, Christian Schmidt, Alexander Hermans, Narunas Vaskevicius, Timm Linder, Bastian Leibe OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu, Chen Zhao, Fatimah Zohra, Mattia Soldan, Alejandro Pardo, Mengmeng Xu, Lama Alssum, Merey Ramazanova, Juan León Alcázar, Anthony Cioppa, Silvio Giancola, Carlos Hinojosa, Bernard Ghanem Outlier-Robust Multi-Model Fitting on Quantum Annealers
Saurabh Pandey, Luca Magri, Federica Arrigoni, Vladislav Golyanik Overview of the 1st International Workshop on Interactive Video Search and Exploration
Luca Rossetto, George Awad, Werner Bailer, Cathal Gurrin, Björn Þór Jónsson, Jakub Lokoc, Stevan Rudinac, Klaus Schoeffmann Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation
Leonard Waldmann, Ando Shah, Yi Wang, Nils Lehmann, Adam J. Stewart, Zhitong Xiong, Xiao Xiang Zhu, Stefan Bauer, John Chuang PaSTe: Improving the Efficiency of Visual Anomaly Detection at the Edge
Manuel Barusco, Francesco Borsatti, Davide Dalle Pezze, Francesco Paissan, Elisabetta Farella, Gian Antonio Susto PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
Jongseo Lee, Wooil Lee, Gyeong-Moon Park, Seong Tae Kim, Jinwoo Choi PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers
Maximilian Augustin, Syed Shakib Sarwar, Mostafa Elhoushi, Yuecheng Li, Sai Qian Zhang, Barbara De Salvo Physics-Based Human Pose Estimation from a Single Moving RGB Camera
Ayce Idil Aytekin, Chuqiao Li, Diogo C. Luvizon, Rishabh Dabral, Martin R. Oswald, Marc Habermann, Christian Theobalt Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Qi Gan, Sao Mai Nguyen, Eric Fenaux, Stéphan Clémençon, Mounim A. El-Yacoubi Pose-Aware Weakly-Supervised Action Segmentation
Zhihao Zhao, Reza Ghoddoosian, Isht Dwivedi, Nakul Agarwal, Behzad Dariush PPTracker: Tracking UAV Swarms with Prior Prompt
Haolin Qin, Tianhao Li, Tingfa Xu, Jingxuan Xu, Yuqiang Fang, Jianan Li Probabilistic Online Event Downsampling
Andreu Girbau-Xalabarder, Jun Nagata, Shinichi Sumiyoshi Probing Vulnerabilities of Vision-LiDAR Based Autonomous Driving Systems
Siwei Yang, Zeyu Wang, Diego Ortiz Barbosa, Luis Burbano, Murat Kantarcioglu, Alvaro A. Cárdenas, Cihang Xie Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians
Yixuan Li, Xingjian Ran, Linning Xu, Tao Lu, Mulin Yu, Zhenzhi Wang, Yuanbo Xiangli, Dahua Lin, Bo Dai Progressive Autoregressive Video Diffusion Models
Desai Xie, Zhan Xu, Yicong Hong, Hao Tan, Difan Liu, Feng Liu, Arie E. Kaufman, Yang Zhou Prompt Categories Cluster for Weakly Supervised Semantic Segmentation
Wangyu Wu, Xianglin Qiu, Siqi Song, Zhenhong Chen, Xiaowei Huang, Fei Ma, Jimin Xiao PromptNorm: Image Geometry Guides Ambient Light Normalization
David Serrano-Lozano, Francisco A. Molina-Bakhos, Danna Xue, Yixiong Yang, Maria Pilligua, Ramon Baldrich, María Vanrell, Javier Vazquez-Corral ProtoPatchNet: An Interpretable Patch-Based Prototypical Network
Mohana Singh, Vivek B. S., Jayavardhana Gubbi, R. Venkatesh Babu Prototype-Guided Diffusion for Digital Pathology: Achieving Foundation Model Performance with Minimal Clinical Data
Ekaterina Redekop, Mara Pleasure, Vedrana Ivezic, Zichen Wang, Kimberly Flores, Anthony Sisk, William Speier, Corey W. Arnold Pureformer: Transformer-Based Image Denoising
Arnim Gautam, Aditi Pawar, Aishwarya Joshi, Satya Narayan Tazi, Sachin Chaudhary, Praful Hambarde, Akshay Dudhane, Santosh Kumar Vipparthi, Subrahmanyam Murala QID: Efficient Query-Informed ViTs in Data-Scarce Regimes for OCR-Free Document Understanding
Binh M. Le, Shaoyuan Xu, Jinmiao Fu, Zhishen Huang, Moyan Li, Yanhui Guo, Hongdong Li, Sameera Ramasinghe, Bryan Wang Quality Assessment for Talking Head Videos via Multi-Modal Feature Representation
Mengjing Su, Yi Wang, Tuo Chen, Chunxiao Li, Shuaiyu Zhao, Jiaxin Wen, Chuyi Lin, Sitong Liu, Ningxin Chu, Yu Zhou Quantized Image Super-Resolution on Mobile NPUs, Mobile AI 2025 Challenge: Report
Andrey Ignatov, Georgy Perevozchikov, Radu Timofte, Zhiyu Zhang, Tianxiao Gao, Yukun Yang, Shiai Zhu, Shihao Wang, Kihwan Yoon, Ganzorig Gankhuyag, Hyeon-Cheol Moon, Taehyun Jeong, Yumi Kim, Suhyeon Lee, Jaehun Baek, Jinwoo Jeong, Eunjun Park, Jun Lee, Heejun Lee, Sungjei Kim, Dafeng Zhang, Yong Yang, Heo Myeong Cheol, Yonghyun Park, Jooho Jeong, Wontae Kim, Kanghwan Lee, Diankai Zhang, Biao Wu, Chengjian Zheng, Shaoli Liu, Si Gao, Ning Wang, Mingshen Wang, Zhao Zhang, Suiyi Zhao, Jinhan Guan, Bo Wang, Yan Luo RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
Yujin Wang, Quanfeng Liu, Zhengxin Jiang, Tianyi Wang, Junfeng Jiao, Hongqing Chu, Bingzhao Gao, Hong Chen Reading in the Dark with Foveated Event Vision
Carl Brander, Giovanni Cioffi, Nico Messikommer, Davide Scaramuzza Real-Time Ultra-Fine-Grained Surgical Instrument Classification
Md. Atabuzzaman, Gino DiMatteo, Hani Alomari, Chiawei Tang, Connor Hale, Adam E. Goode, David Ryan King, Chris Thomas REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding
Sakib Reza, Xiyun Song, Heather Yu, Zongfang Lin, Mohsen Moghaddam, Octavia I. Camps ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking
Tzoulio Chamiti, Leandro Di Bella, Adrian Munteanu, Nikos Deligiannis RepFC: Universal Structural Reparametrization Block for High Performance, Lightweight Deep Neural Networks
Shambhavi Balamuthu Sampath, Judeson Anthony Fernando, Moritz Thoma, Nael Fasfous, Lukas Frickenstein, Pierpaolo Morì, Manoj Rohit Vemparala, Alexander Frickenstein, Ulf Schlichtmann, Walter Stechele Repurposing SAM for User-Defined Semantics Aware Segmentation
Rohit Kundu, Sudipta Paul, Arindam Dutta, Amit Roy-Chowdhury Rethinking the Role of Spatial Mixing
George Cazenavette, Joel Julin, Simon Lucey Revisiting Multi-Modal LLM Evaluation
Jian Lu, Shikhar Srivastava, Junyu Chen, Robik Shrestha, Manoj Acharya, Kushal Kafle, Christopher Kanan Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models
Jierun Chen, Fangyun Wei, Jinjing Zhao, Sizhe Song, Bohuai Wu, Zhuoxuan Peng, S.-H. Gary Chan, Hongyang Zhang RGB Photo Enhancement on Mobile GPUs, Mobile AI 2025 Challenge: Report
Andrey Ignatov, Georgy Perevozchikov, Radu Timofte, Wu Pan, Song Wang, Dong Zhang, Zhao Ran, Xiaochen Li, Shichang Ju, Diankai Zhang, Biao Wu, Shaoli Liu, Si Gao, Chengjian Zheng, Ning Wang, Yi Feng, Cailu Wan, Xiangji Wu, Hailong Yan, Ao Li, Xiangtao Zhang, Zhe Liu, Ce Zhu, Le Zhang, Jinjie Zhou, Yang Lu, Feng Duo, Runhua Deng, Xuanyu Chen, Shuhui Xie, Guojie Xiao, Zhifeng Wang, Long Peng, Aiwen Jiang Robustness Evaluation for Video Models with Reinforcement Learning
Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Sahand Ghorbanpour, Avisek Naug, Antonio Guillen, Ricardo Luna, Soumyendu Sarkar S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications
Elías Masquil, Roger Marí, Thibaud Ehret, Enric Meinhardt-Llopis, Pablo Musé, Gabriele Facciolo S2p-Hd: GPU-Accelerated Binocular Stereo Pipeline for Large-Scale Same-Date Stereo
Tristan Amadei, Enric Meinhardt-Llopis, Carlo de Franchis, Jérémy Anger, Thibaud Ehret, Gabriele Facciolo SAM4EM: Efficient Memory-Based Two Stage Prompt-Free Segment Anything Model Adapter for Complex 3D Neuroscience Electron Microscopy Stacks
Uzair Shah, Marco Agus, Daniya Boges, Vanessa Chiappini, Mahmood Alzubaidi, Jens Schneider, Markus Hadwiger, Pierre J. Magistretti, Mowafa S. Househ, Corrado Calì SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos
Joshua Li, Fernando Jose Pena Cantu, Emily Yu, Alexander Wong, Yuchen Cui, Yuhao Chen SC-NeRF: NeRF-Based Point Cloud Reconstruction Using a Stationary Camera for Agricultural Applications
Kibon Ku, Talukder Z. Jubery, Elijah Rodriguez, Aditya Balu, Soumik Sarkar, Adarsh Krishnamurthy, Baskar Ganapathysubramanian Scale-Invariant Implicit Neural Representations for Object Counting
Siyuan Xu, Yucheng Wang, Xihaier Luo, Byung-Jun Yoon, Xiaoning Qian Scaling Laws in Zero-Shot Gender Classification Using CLIP
Lucas M. Ceschini, Gabriel de Oliveira Ramos, Cláudio R. Jung Scaling On-Device GPU Inference for Large Generative Models
Jiuqiang Tang, Raman Sorokin, Ekaterina Ignasheva, Grant Jensen, Lin Chen, Juhyun Lee, Andrei Kulik, Matthias Grundmann ScoreCAM++: Gated Score-Weighted Visual Explanations for CNNs
Soham Mitra, Atri Sukul, Swalpa Kumar Roy, Pravendra Singh, Vinay Kumar Verma Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions
Yifei Dong, Fengyi Wu, Sanjian Zhang, Guangyu Chen, Yuzhi Hu, Masumi Yano, Jingdong Sun, Siyu Huang, Feng Liu, Qi Dai, Zhi-Qi Cheng Seeing like a Cephalopod: Colour Vision with a Monochrome Event Camera
Sami Arja, Nimrod Kruger, Alexandre Marcireau, Nicholas Owen Ralph, Saeed Afshar, Gregory Cohen Segment Any Primitive: Zero-Shot 3D Primitive Segmentation from Point Cloud
Yushan Bai, Shaohu Wang, Rongtao Xu, Yuchuang Tong, Chaoran Xu, Zhengtao Zhang Segment AnyNeuron
Taha Razzaq, Ahmed Rashid Qazi, Asim Iqbal Self-Supervised Pretraining for Fine-Grained Plankton Recognition
Joona Kareinen, Tuomas Eerola, Kaisa Kraft, Lasse Lensu, Sanna Suikkanen, Heikki Kälviäinen Semantic Matters: Multimodal Features for Affective Analysis
Tobias Hallmen, Robin-Nico Kampa, Fabian Deuser, Norbert Oswald, Elisabeth André Semantic-Aware Local Image Editing with a Single Mask Operation
Dongchao Wen, Zijian Chen, Weihong Deng, Yujiang Tian, Hongzhi Shi, Yingjie Zhang, Xingchen Cui, Jian Zhao, Lingyan Liang, Mei Wang Shopformer: Transformer-Based Framework for Detecting Shoplifting via Human Pose
Narges Rashvand, Ghazal Alinezhad Noghre, Armin Danesh Pazho, Babak Rahimi Ardabili, Hamed Tabkhi Short-Term 3D Human Mesh Recovery with Virtual Markers Disentanglement
Xiyuan Kang, Yi Yuan, Xu Dong, Muhammad Awais, Lilian Tang, Josef Kittler, Zhenhua Feng SILK: Smooth InterpoLation frameworK for Motion In-Betweening
Elly Akhoundi, Hung Yu Ling, Anup Anand Deshmukh, Judith Bütepage SimCache: Similarity Caching for Efficient VLM-Based Scene Understanding
Surya Selvam, Ravi K. Rajendran, Murugan Sankaradas, Anand Raghunathan, Srimat T. Chakradhar SLRTP2025 Sign Language Production Challenge: Methodology, Results and Future Work
Harry Walsh, Edward Fish, Ozge Mercanoglu Sincan, Mohamed Ilyes Lakhal, Richard Bowden, Neil Fox, Bencie Woll, Kepeng Wu, Zecheng Li, Weichao Zhao, Haodong Wang, Wengang Zhou, Houqiang Li, Shengeng Tang, Jiayi He, Xu Wang, Ruobei Zhang, Yaxiong Wang, Lechao Cheng, Sümeyye Meryem Tasyürek, Tugçe Kiziltepe, Hacer Yalim Keles SoyStageNet: Balancing Accuracy and Efficiency for Real-Time Soybean Growth Stage Detection
Abdellah Lakhssassi, Toqi Tahamid Sarker, Khaled R. Ahmed, Naoufal Lakhssassi, Khalid Meksem Spatio-Temporal State Space Model for Efficient Event-Based Optical Flow
Muhammad Ahmed Humais, Xiaoqian Huang, Hussain M. Sajwani, Sajid Javed, Yahya H. Zweiri Splat-SLAM: Globally Optimized RGB-Only SLAM with 3D Gaussians
Erik Sandström, Ganlin Zhang, Keisuke Tateno, Michael Oechsle, Michael Niemeyer, Youmin Zhang, Manthan Patel, Luc Van Gool, Martin R. Oswald, Federico Tombari SplatMesh: Interactive 3D Segmentation and Editing Using Mesh-Based Gaussian Splatting
Kaichen Zhou, Lanqing Hong, Xinhai Chang, Yingji Zhong, Enze Xie, Hao Dong, Zhihao Li, Yongxin Yang, Zhenguo Li, Wei Zhang SplatTouch: Explicit 3D Representation Binding Vision and Touch
Antonio Luigi Stefani, Niccolò Bisagno, Nicola Conci, Francesco G. B. De Natale SSL4Eco: A Global Seasonal Dataset for Geospatial Foundation Models in Ecology
Elena Plekhanova, Damien Robert, Johannes Dollinger, Emilia Arens, Philipp Brun, Jan Dirk Wegner, Niklaus E. Zimmermann Stochastic-Based Patch Filtering for Few-Shot Learning
Javier Ródenas Cumplido, Eduardo Aguilar, Petia Radeva STRRNet: Semantics-Guided Two-Stage Raindrop Removal Network
Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinlong Li, Mengfei Han Syn3DTxt: Embedding 3D Cues for Scene Text Generation
Li-Syun Hsiung, Jun-Kai Tu, Kuan-Wu Chu, Yu-Hsuan Chiu, Yan-Tsung Peng, Sheng-Luen Chung, Gee-Sern Hsu T-SAM: Transductive Learning for Segment Anything Model
Rangel Daroya, Deepak Chandran, Subhransu Maji, Andrea Fanelli Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer, Yury Belousov, Vitaliy Kinakh, Teddy Furon, Slava Voloshynovskiy TerraMesh: A Planetary Mosaic of Multimodal Earth Observation Data
Benedikt Blumenstiel, Paolo Fraccaro, Valerio Marsocci, Johannes Jakubik, Stefano Maurogiovanni, Mikolaj Czerkawski, Rocco Sedona, Gabriele Cavallaro, Thomas Brunschwiler, Juan Bernabé-Moreno, Nicolas Longépé TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
Forouzan Fallah, Maitreya Patel, Agneet Chatterjee, Vlad I. Morariu, Chitta Baral, Yezhou Yang Texture2LoD3: Enabling LoD3 Building Reconstruction with Panoramic Images
Wenzhao Tang, Weihang Li, Xiucheng Liang, Olaf Wysocki, Filip Biljecki, Christoph Holst, Boris Jutzi The Fourth Monocular Depth Estimation Challenge
Anton Obukhov, Matteo Poggi, Fabio Tosi, Ripudaman Singh Arora, Jaime Spencer, Chris Russell, Simon Hadfield, Richard Bowden, Shuaihang Wang, Zhenxin Ma, Weijie Chen, Baobei Xu, Fengyu Sun, Di Xie, Jiang Zhu, Mykola Lavreniuk, Haining Guan, Qun Wu, Yupei Zeng, Chao Lu, Huanran Wang, GuangYuan Zhou, Haotian Zhang, Jianxiong Wang, Qiang Rao, Chunjie Wang, Xiao Liu, Zhiqiang Lou, Hualie Jiang, Yihao Chen, Rui Xu, Minglang Tan, Zihan Qin, Yifan Mao, Jiayang Liu, Jialei Xu, Yifan Yang, Wenbo Zhao, Junjun Jiang, Xianming Liu, Mingshuai Zhao, Anlong Ming, Wu Chen, Feng Xue, Mengying Yu, Shida Gao, Xiangfeng Wang, Gbenga Omotara, Ramy Farag, Jacket Demby's, Seyed Mohamad Ali Tousi, Guilherme N. DeSouza, Tuan-Anh Yang, Minh-Quang Nguyen, Thien-Phuc Tran, Albert Luginov, Muhammad Shahzad The Tenth NTIRE 2025 Image Denoising Challenge Report
Lei Sun, Hang Guo, Bin Ren, Luc Van Gool, Radu Timofte, Yawei Li Thermal Image Super-Resolution Challenge Results - PBVS 2025
Rafael E. Rivadeneira, Ángel D. Sappa, Riad I. Hammoud, Jiyong Rao, Hang Zhong, Yu Wang, Shengjie Zhao, Zhiwei Zhong, Yung-Hui Li, Shiqi Wang, Qiangqiang Shen, Hanzhang Wang, Xuanqi Zhang ToF-360 - A Panoramic Time-of-Flight RGB-D Dataset for Single Capture Indoor Semantic 3D Reconstruction
Hideaki Kanayama, Mahdi Chamseddine, Suresh Guttikonda, So Okumura, Soichiro Yokota, Didier Stricker, Jason R. Rambach Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
Huu-Loc Tran, Tinh-Anh Nguyen-Nhu, Huu-Phong Phan-Nguyen, Tien-Huy Nguyen, Nhat-Minh Nguyen-Dich, Anh Dao, Huy-Duc Do, Quan Nguyen, Hoang M. Le, Quang-Vinh Dinh Towards Evaluating the Robustness of Visual State Space Models
Hashmat Shadab Malik, Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar, Fahad Shahbaz Khan, Salman Khan Towards Faster and More Compact Foundation Models for Molecular Property Prediction
Yasir Ghunaim, Andrés Villa, Gergo Ignacz, Gyorgy Szekely, Motasem Alfarra, Bernard Ghanem Towards Low-Latency Event-Based Obstacle Avoidance on a FPGA-Drone
Pietro Bonazzi, Christian Vogt, Michael Jost, Lyes Khacef, Federico Paredes-Vallés, Michele Magno Towards Synthetic Concept Activation Vectors via Generative Models
Riccardo Campi, Santiago Borrego, Antonio De Santis, Matteo Bianchi, Andrea Tocchetti, Marco Brambilla Training Data Reconstruction: Privacy Due to Uncertainty?
Christina Runkel, Kanchana Vaishnavi Gandikota, Jonas Geiping, Carola-Bibiane Schönlieb, Michael Moeller Training Neural Networks on RAW and HDR Images for Restoration Tasks
Andrew Yanzhe Ke, Lei Luo, Xiaoyu Xiang, Yuchen Fan, Rakesh Ranjan, Alexandre Chapiro, Rafal Mantiuk TT3D: Table Tennis 3D Reconstruction
Thomas Gossard, Andreas Ziegler, Andreas Zell Turin3D: Evaluating Adaptation Strategies Under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques
Luca Barco, Giacomo Blanco, Gaetano Chiriaco, Alessia Intini, Luigi La Riccia, Vittorio Scolamiero, Piero Boccardo, Paolo Garza, Fabrizio Dominici Two Views Are Better than One: Monocular 3D Pose Estimation with Multiview Consistency
Christian Keilstrup Ingwersen, Rasmus Tirsgaard, Rasmus Nylander, Janus Nørtoft Jensen, Anders Bjorholm Dahl, Morten Rieger Hannemose U-Shape Mamba: State Space Model for Faster Diffusion
Alex Ergasti, Filippo Botti, Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati Uncertainty Aware Training to Improve Uncertainty Active Learning for Semantic Segmentation
Moritz Thoma, Tobias Preintner, Emad Aghajanzadeh, Shambhavi Balamuthu Sampath, Pierpaolo Morì, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Daniel Mueller-Gritschneder, Ulf Schlichtmann Understanding the Effect of Using Semantically Meaningful Tokens for Visual Representation Learning
Neha Mukund Kalibhat, Priyatham Kattakinda, Sumit Nawathe, Arman Zarei, Nikita Seleznev, Samuel Sharpe, Senthil Kumar, Soheil Feizi V3LMA: Visual 3D-Enhanced Language Model for Autonomous Driving
Jannik Lübberstedt, Esteban Rivera, Nico Uhlemann, Markus Lienkamp Virtual Pose Coach: A Motion-Retargeting Approach for Pose Training
Tzu-Chun Chiu, Ming-Han Lee, Kun-Ru Wu, Yu-Shuen Wang, Yu-Chee Tseng Visual Question Answering on Multiple Remote Sensing Image Modalities
Hichem Boussaid, Lucrezia Tosato, Flora Weissgerber, Camille Kurtz, Laurent Wendling, Sylvain Lobry VNL-STES: A Benchmark Dataset and Model for Spatiotemporal Event Spotting in Volleyball Analytics
Hoang Quoc Nguyen, Ankhzaya Jamsrandorj, Vanyi Chao, Yin May Oo, Muhammad Amrulloh Robbani, Kyung-Ryoul Mun, Jinwook Kim Vocabulary-Free Few-Shot Learning for Vision-Language Models
Maxime Zanella, Clément Fuchs, Ismail Ben Ayed, Christophe De Vleeschouwer VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos
Bao Tran Gia, Khiem Le, Tien Do, Tien-Dung Mai, Thanh Duc Ngo, Duy-Dinh Le, Shin'ichi Satoh What Is the Added Value of UDA in the VFM Era?
Brunó Bence Englert, Tommie Kerssies, Gijs Dubbelman What Makes for a Good Stereoscopic Image?
Netanel Tamir, Shir Amir, Ranel Itzhaky, Noam Atia, Shobhita Sundaram, Stephanie Fu, Ron Sokolovsky, Phillip Isola, Tali Dekel, Richard Zhang, Miriam Farber Wheat3DGS: In-Field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting
Daiwei Zhang, Joaquin Gajardo, Tomislav Medic, Isinsu Katircioglu, Mike Boss, Norbert Kirchgeßner, Achim Walter, Lukas Roth Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
Yuxiang Lin, Jingdong Sun, Zhi-Qi Cheng, Jue Wang, Haomin Liang, Zebang Cheng, Yifei Dong, Jun-Yan He, Xiaojiang Peng, Xian-Sheng Hua Window Token Concatenation for Efficient Visual Large Language Models
Yifan Li, Wentao Bao, Botao Ye, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong