CVPRW 2024

800 papers

'Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning Joshua Feinglass, Jayaraman J. Thiagarajan, Rushil Anirudh, T. S. Jayram, Yezhou Yang
PDF
(Street) Lights Will Guide You: Georeferencing Nighttime Astronaut Photography of Earth Alex Stoken, Peter Ilhardt, Mark Lambert, Kenton Fisher
2T-UNET: A Two-Tower UNet with Depth Clues for Robust Stereo Depth Estimation Mansi Sharma, Rohit Choudhary, Rithvik Anil
PDF
3D Clothed Human Reconstruction from Sparse Multi-View Images Jin Gyu Hong, Seung Young Noh, Hee Kyung Lee, Won-Sik Cheong, Ju Yong Chang
3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement Filipa Lino, Carlos Santiago, Manuel Marques
PDF
3D Human Scan with a Moving Event Camera Kai Kohyama, Shintaro Shiba, Yoshimitsu Aoki
PDF
3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin, Bofan Lyu, Judith Cueto Fernandez, Eline van der Kruk, Ajay Seth, Xucong Zhang
PDF
A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai
PDF
A Coarse-to-Fine Two-Stage Helmet Detection Method for Motorcyclists Hongpu Zhang, Zhe Cui, Fei Su
A Comparative Analysis of Implicit Augmentation Techniques for Breast Cancer Diagnosis Using Multiple Views Yumnah Hasan, Talhat Khan, Darian Reyes Fernández de Bulnes, Juan F. H. Albarracín, Conor Ryan
A Comprehensive Analysis of Factors Impacting Membership Inference Daniel DeAlcala, Gonzalo Mancera, Aythami Morales, Julian Fiérrez, Ruben Tolosana, Javier Ortega-Garcia
A Cross-Dataset Study for Text-Based 3D Human Motion Retrieval Léore Bensabath, Mathis Petrovich, Gül Varol
PDF
A Deep Biclustering Framework for Brain Network Analysis Md Abdur Rahaman, Zening Fu, Armin Iraji, Vince D. Calhoun
A Dual-Mode Approach for Vision-Based Navigation in a Lunar Landing Scenario Luca Ostrogovich, Roberto Del Prete, Giuseppe Tomasicchio, Nicolas Longépé, Alfredo Renga
A General Framework for Jersey Number Recognition in Sports Video Maria Koshkina, James H. Elder
PDF
A Generative Exploration of Cuisine Transfer Philip Wootaek Shin, Ajay Narayanan Sridhar, Jack Sampson, Vijaykrishnan Narayanan
A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza
PDF
A Lightweight Spatiotemporal Network for Online Eye Tracking with Event Camera Yan Ru Pei, Sasskia Brüers, Sébastien M. Crouzet, Douglas McLelland, Olivier Coenen
PDF
A Method of Moments Embedding Constraint and Its Application to Semi-Supervised Learning Michael Majurski, Sumeet Menon, Parniyan Farvardin, David Chapman
PDF
A Multimodal Approach Integrating Convolutional and Recurrent Neural Networks for Alzheimer's Disease Temporal Progression Prediction Durga Supriya Hl, Swetha Mary Thomas, Sowmya Kamath S
A Perspective on Deep Vision Performance with Standard Image and Video Codecs Christoph Reich, Oliver Hahn, Daniel Cremers, Stefan Roth, Biplob Debnath
PDF
A Review and Efficient Implementation of Scene Graph Generation Metrics Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart
PDF
A Robust Online Multi-Camera People Tracking System with Geometric Consistency and State-Aware Re-ID Correction Zhenyu Xie, Zelin Ni, Wenjie Yang, Yuang Zhang, Yihang Chen, Yang Zhang, Xiao Ma
A Stroke of Genius: Predicting the Next Move in Badminton Magnus Ibh, Stella Graßhof, Dan Witzner Hansen
PDF
A Survey of Video Datasets for Grounded Event Understanding Kate Sanders, Benjamin Van Durme
PDF
A Survey on 3D Egocentric Human Pose Estimation Md Mushfiqur Azam, Kevin Desai
PDF
A Universal Protocol to Benchmark Camera Calibration for Sports Floriane Magera, Thomas Hoyoux, Olivier Barnich, Marc Van Droogenbroeck
PDF
A Visualization Method for Data Domain Changes in CNN Networks and the Optimization Method for Selecting Thresholds in Classification Tasks Minzhe Huang, Changwei Nie, Weihong Zhong
PDF
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models Gahyeon Kim, Sohee Kim, Seokju Lee
PDF
ABC-CapsNet: Attention Based Cascaded Capsule Network for Audio Deepfake Detection Taiba Majid Wani, Reeva Gulzar, Irene Amerini
PDF
Achieving Reliable and Fair Skin Lesion Diagnosis via Unsupervised Domain Adaptation Janet Wang, Yunbei Zhang, Zhengming Ding, Jihun Hamm
PDF
Active Data Collection and Management for Real-World Continual Learning via Pretrained Oracle Vivek Chavan, Paul Koch, Marian Schlüter, Clemens Briese, Jörg Krüger
Active Transferability Estimation Tarun Ram Menta, Surgan Jandial, Akash Patil, Saketh Bachu, K. B. Vimal, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Mausoom Sarkar, Chirag Agarwal
Adapting the Segment Anything Model During Usage in Novel Situations Robin Schön, Julian Lorenz, Katja Ludwig, Rainer Lienhart
PDF
Adaptive Memory Replay for Continual Learning James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogério Feris, Zsolt Kira, Leonid Karlinsky
PDF
Adaptive Render-Video Streaming for Virtual Environments Jia-Jie Lim, Matthias Sebastian Treder, Aaron Chadha, Yiannis Andreopoulos
Advanced Facial Analysis in Multi-Modal Data with Cascaded Cross-Attention Based Transformer Jun-Hwa Kim, Namho Kim, Minsoo Hong, Chee Sun Won
Advancing Brain Tumor Analysis: Curating a High-Quality MRI Dataset for Deep Learning-Based Molecular Marker Profiling Divya D. Reddy, Niloufar Saadat, James M. Holcomb, Benjamin C. Wagner, Nghi C. Truong, Jason Bowerman, Kimmo J. Hatanpaa, Toral R. Patel, Marco C. Pinho, Ananth J. Madhuranthakam, Chandan Ganesh Bangalore Yogananda, Joseph A. Maldjian
Advancing COVID-19 Detection in 3D CT Scans Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen
PDF
Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics Hyojin Kim, Jiyoon Lee, Yonghyun Jeong, Haneol Jang, Youngjoon Yoo
PDF
AdvDenoise: Fast Generation Framework of Universal and Robust Adversarial Patches Using Denoise Jing Li, Zigan Wang, Jinliang Li
Adversarial Identity Injection for Semantic Face Image Synthesis Giuseppe Tarollo, Tomaso Fontanini, Claudio Ferrari, Guido Borghi, Andrea Prati
Affine-Based Deformable Attention and Selective Fusion for Semi-Dense Matching Hongkai Chen, Zixin Luo, Yurun Tian, Xuyang Bai, Ziyu Wang, Lei Zhou, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan
PDF
AffordanceLLM: Grounding Affordance from Vision Language Models Shengyi Qian, Weifeng Chen, Min Bai, Xiong Zhou, Zhuowen Tu, Li Erran Li
PDF
AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning Guoxian Song
PDF
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art Faizan Farooq Khan, Diana Kim, Divyansh Jha, Youssef Mohamed, Hanna H. Chang, Ahmed Elgammal, Luba Elliott, Mohamed Elhoseiny
PDF
AIGC Image Quality Assessment via Image-Prompt Correspondence Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen
AIGC-VQA: A Holistic Perception Metric for AIGC Video Quality Assessment Yiting Lu, Xin Li, Bingchen Li, Zihao Yu, Fengbin Guan, Xinrui Wang, Ruling Liao, Yan Ye, Zhibo Chen
AIGeN: An Adversarial Approach for Instruction Generation in VLN Niyati Rawal, Roberto Bigazzi, Lorenzo Baraldi, Rita Cucchiara
PDF
AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment Chunyi Li, Tengchuan Kou, Yixuan Gao, Yuqin Cao, Wei Sun, Zicheng Zhang, Yingjie Zhou, Zhichao Zhang, Weixia Zhang, Haoning Wu, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai
PDF
AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai, Wei Sun, Yuqin Cao, Yanwei Jiang, Jun Jia, Zhichao Zhang, Zijian Chen, Weixia Zhang, Xiongkuo Min, Steve Göring, Zihao Qi, Chen Feng
PDF
ALINA: Advanced Line Identification and Notation Algorithm Mohammed Abdul Hafeez Khan, Parth Ganeriwala, Siddhartha Bhattacharyya, Natasha A. Neogi, Raja Muthalagu
PDF
An Analysis of Best-Practice Strategies for Replay and Rehearsal in Continual Learning Alexander Krawczyk, Alexander Gepperth
An Effective Ensemble Learning Framework for Affective Behaviour Analysis Wei Zhang, Feng Qiu, Chen Liu, Lincheng Li, Heming Du, Tianchen Guo, Xin Yu
An Effective Method for Detecting Violation of Helmet Rule for Motorcyclists Yunliang Chen, Wei Zhou, Zicen Zhou, Bing Ma, Chen Wang, Yingda Shang, An Guo, Tianshu Chu
An Empty Room Is All We Want: Automatic Defurnishing of Indoor Panoramas Mira Slavcheva, Dave Gausebeck, Kevin Chen, David Buchhofer, Azwad Sabik, Chen Ma, Sachal Dhillon, Olaf Brandt, Alan Dolhasz
PDF
An End-to-End Approach for Handwriting Recognition: From Handwritten Text Lines to Complete Pages Dayvid Castro, Byron Leite Dantas Bezerra, Cleber Zanchettin
An End-to-End Vision Transformer Approach for Image Copy Detection Jiahe Steven Lee, Wynne Hsu, Mong-Li Lee
An Investigation into the Impact of AI-Powered Image Enhancement on Forensic Facial Recognition Justin Norman, Hany Farid
An Online Approach and Evaluation Method for Tracking People Across Cameras in Extremely Long Video Sequence Cheng-Yen Yang, Hsiang-Wei Huang, Pyong-Kun Kim, Zhongyu Jiang, Kwang-Ju Kim, Chung-I Huang, Haiqing Du, Jenq-Neng Hwang
Analyzing Participants' Engagement During Online Meetings Using Unsupervised Remote Photoplethysmography with Behavioral Features Alexander Vedernikov, Zhaodong Sun, Virpi-Liisa Kykyri, Mikko Pohjola, Miriam Nokia, Xiaobai Li
PDF
Analyzing the Internals of Neural Radiance Fields Lukas Radl, Andreas Kurz, Michael Steiner, Markus Steinberger
PDF
AnimalFormer: Multimodal Vision Framework for Behavior-Based Precision Livestock Farming Ahmed Rashid Qazi, Taha Razzaq, Asim Iqbal
PDF
AR-CP: Uncertainty-Aware Perception in Adverse Conditions with Conformal Prediction and Augmented Reality for Assisted Driving Achref Doula, Max Mühlhäuser, Alejandro Sánchez Guinea
Are Deep Learning Models Pre-Trained on RGB Data Good Enough for RGB-Thermal Image Retrieval? Amulya Pendota, Sumohana S. Channappayya
Are NeRFs Ready for Autonomous Driving? Towards Closing the Real-to-Simulation Gap Carl Lindström, Georg Hess, Adam Lilja, Maryam Fatemi, Lars Hammarstrand, Christoffer Petersson, Lennart Svensson
PDF
ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong
Assessing the Performance of Efficient Face Anti-Spoofing Detection Against Physical and Digital Presentation Attacks Luis S. Luevano, Yoanna Martínez-Díaz, Heydi Méndez-Vázquez, Miguel González-Mendoza, Davide Frey
PDF
AsymFormer: Asymmetrical Cross-Modal Representation Learning for Mobile Platform Real-Time RGB-D Semantic Segmentation Siqi Du, Weixi Wang, Renzhong Guo, Ruisheng Wang, Shengjun Tang
PDF
ATOM: Attention Mixer for Efficient Dataset Distillation Samir Khaki, Ahmad Sajedi, Kai Wang, Lucy Z. Liu, Yuri A. Lawryshyn, Konstantinos N. Plataniotis
PDF
Attention Guidance Distillation Network for Efficient Image Super-Resolution Hongyuan Wang, Ziyan Wei, Qingting Tang, Shuli Cheng, Liejun Wang, Yongming Li
AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang
PDF
Audio Provenance Analysis in Heterogeneous Media Sets Milica Gerhardt, Luca Cuccovillo, Patrick Aichroth
PDF
Audio Transformer for Synthetic Speech Detection via Multi-Formant Analysis Luca Cuccovillo, Milica Gerhardt, Patrick Aichroth
PDF
Audio-Visual Generalized Zero-Shot Learning Using Pre-Trained Large Multi-Modal Models David Kurzendörfer, Otniel-Bogdan Mercea, A. Sophia Koepke, Zeynep Akata
PDF
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Seymanur Akti, Hazim Kemal Ekenel, Alexander Waibel
PDF
AugData Distillation for Monocular 3D Human Pose Estimation Jiman Kim
Augmented Self-Mask Attention Transformer for Naturalistic Driving Action Recognition Tiantian Zhang, Qingtian Wang, Xiaodong Dong, Wenqing Yu, Hao Sun, Xuyang Zhou, Aigong Zhen, Shun Cui, Dong Wu, Zhongjiang He
Augmenting Pass Prediction via Imitation Learning in Soccer Simulations Takeshi Kaneko, Rei Kawakami, Takeshi Naemura, Nakamasa Inoue
Automatic Recognition of Food Ingestion Environment from the AIM-2 Wearable Sensor Yuning Huang, M. A Hassan, Jiangpeng He, Janine A. Higgins, Megan A. McCrory, Heather A. Eicher-Miller, J. Graham Thomas, Edward Sazonov, Fengqing Zhu
PDF
AutoSoccerPose: Automated 3D Posture Analysis of Soccer Shot Movements Calvin Yeung, Kenjiro Ide, Keisuke Fujii
BAA-NGP: Bundle-Adjusting Accelerated Neural Graphics Primitives Sainan Liu, Shan Lin, Jingpei Lu, Alexey Supikov, Michael C. Yip
PDF
Benchmarking Robustness in Neural Radiance Fields Chen Wang, Angtian Wang, Junbo Li, Alan L. Yuille, Cihang Xie
PDF
Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity Zhenlin Xu, Yi Zhu, Siqi Deng, Abhay Mittal, Yanbei Chen, Manchen Wang, Paolo Favaro, Joseph Tighe, Davide Modolo
PDF
Beyond Appearances: Material Segmentation with Embedded Spectral Information from RGB-D Imagery Fabian Perez, Hoover Rueda-Chacon
Beyond Deepfake Images: Detecting AI-Generated Videos Danial Samadi Vahdati, Tai D. Nguyen, Aref Azizpour, Matthew C. Stamm
PDF
Beyond Respiratory Models: A Physics-Enhanced Synthetic Data Generation Method for 2D-3D Deformable Registration François Lecomte, Pablo Alvarez, Stéphane Cotin, Jean-Louis Dillenseger
PDF
Beyond the Premier: Assessing Action Spotting Transfer Capability Across Diverse Domains Bruno Cabado, Anthony Cioppa, Silvio Giancola, Andrés Villa, Bertha Guijarro-Berdiñas, Emilio J. Padrón, Bernard Ghanem, Marc Van Droogenbroeck
PDF
Beyond the Screen: Evaluating Deepfake Detectors Under Moiré Pattern Effects Razaib Tariq, Minji Heo, Simon S. Woo, Shahroz Tariq
BGDNet: Background-Guided Indoor Panorama Depth Estimation Jiajing Chen, Zhiqiang Wan, Manjunath Narayana, Yuguang Li, Will Hutchcroft, Senem Velipasalar, Sing Bing Kang
BigEPIT: Scaling EPIT for Light Field Image Super-Resolution Wentao Chao, Yiming Kan, Xuechun Wang, Fuqing Duan, Guanghui Wang
BiMAE - A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification Maksim Kukushkin, Martin Bogdan, Thomas Schmid
Blind Localization and Clustering of Anomalies in Textures Andrei-Timotei Ardelean, Tim Weyrich
PDF
Block Selective Reprogramming for On-Device Training of Vision Transformers Sreetama Sarkar, Souvik Kundu, Kai Zheng, Peter A. Beerel
PDF
Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid Thanh-Huy Nguyen, Thi Kim Ngan Ngo, Mai Anh Vu, Ting-Yuan Tu
PDF
BMAD: Benchmarks for Medical Anomaly Detection Jinan Bao, Hanshi Sun, Hanqiu Deng, Yinsheng He, Zhaoxiang Zhang, Xingyu Li
PDF
BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects Tomas Hodan, Martin Sundermeyer, Yann Labbé, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas
PDF
Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition Genggeng Chen, Kexin Dai, Kangzhen Yang, Tao Hu, Xiangyu Chen, Yongqing Yang, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan
PDF
Bridging Domains in Melanoma Diagnostics: Predicting BRAF Mutations and Sentinel Lymph Node Positivity with Attention-Based Models in Histological Images Carlos Hernández-Pérez, Lauren Jimenez-Martin, Verónica Vilaplana
PDF
Building Secure and Engaging Video Communication by Using Monitor Illumination Jun Myeong Choi, Johnathan Chi-Ho Leung, Noah Frahm, Max Christman, Gedas Bertasius, Roni Sengupta
Burst Image Super-Resolution with Base Frame Selection Sanghyun Kim, Min Jung Lee, Woohyeok Kim, Deunsol Jung, Jaesung Rim, Sunghyun Cho, Minsu Cho
PDF
CaBins: CLIP-Based Adaptive Bins for Monocular Depth Estimation Eunjin Son, Sang Jun Lee
Cache and Reuse: Rethinking the Efficiency of On-Device Transfer Learning Yuedong Yang, Hung-Yueh Chiang, Guihong Li, Diana Marculescu, Radu Marculescu
CAFF-DINO: Multi-Spectral Object Detection Transformers with Cross-Attention Features Fusion Kevin Helvig, Baptiste Abeloos, Pauline Trouvé-Peloux
PDF
CAGE: Circumplex Affect Guided Expression Inference Niklas Wagner, Felix Mätzler, Samed Rouven Vossberg, Helen Schneider, Svetlana Pavlitska, J. Marius Zöllner
PDF
Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-Trained Vision Transformers Dipam Goswami, Bartlomiej Twardowski, Joost van de Weijer
PDF
Calibration of Continual Learning Models Lanpei Li, Elia Piccoli, Andrea Cossu, Davide Bacciu, Vincenzo Lomonaco
PDF
Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo, Javier Civera
PDF
Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu
PDF
Can Synthetic Plant Images from Generative Models Facilitate Rare Species Identification and Classification? Debajyoti Dasgupta, Arijit Mondal, Partha Pratim Chakraborty
Can the Accuracy Bias by Facial Hairstyle Be Reduced Through Balancing the Training Data? Kagan Öztürk, Haiyu Wu, Kevin W. Bowyer
PDF
CASR: Efficient Cascade Network Structure with Channel Aligned Method for 4k Real-Time Single Image Super-Resolution Kihwan Yoon, Ganzorig Gankhuyag, Jinman Park, Haengseon Son, Kyoungwon Min
CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery Sai Bhargav Rongali, Sarthak Mehrotra, Ankit Jha, N C Mohamad Hassan, Shirsha Bose, Tanisha Gupta, Mainak Singha, Biplab Banerjee
PDF
CenterPoint Transformer for BEV Object Detection with Automotive Radar Loveneet Saini, Yu Su, Hasan Tercan, Tobias Meisen
Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs Jonathan Roberts, Timo Lüddecke, Rehan Sheikh, Kai Han, Samuel Albanie
PDF
ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models Mengxue Qu, Xiaodong Chen, Wu Liu, Alicia Li, Yao Zhao
CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario Zhizhao Duan, Hao Cheng, Duo Xu, Xi Wu, Xiangxie Zhang, Xi Ye, Zhen Xie
PDF
Class Similarity Transition: Decoupling Class Similarities and Imbalance from Generalized Few-Shot Segmentation Shihong Wang, Ruixun Liu, Kaiyu Li, Jiawei Jiang, Xiangyong Cao
PDF
Class-Incremental Mixture of Gaussians for Deep Continual Learning Lukasz Korycki, Bartosz Krawczyk
Classification of 2D Ultrasound Breast Cancer Images with Deep Learning Jack Ellis, Kofi Appiah, Emmanuel Amankwaa-Frempong, Sze Chai Kwok
Classifier Guided Cluster Density Reduction for Dataset Selection Cheng Chang, Keyu Long, Zijian Li, Himanshu Rai
Click, Crop & Detect: One-Click Offline Annotation for Human-in-the-Loop 3D Object Detection on Point Clouds Nitin Kumar Saravana Kannan, Matthias Reuse, Martin Simon
Cluster Self-Refinement for Enhanced Online Multi-Camera People Tracking Jeongho Kim, Wooksu Shin, Hancheol Park, Donghyuk Choi
Cluster Triplet Loss for Unsupervised Domain Adaptation on Histology Images Ruby Wood, Enric Domingo, Viktor Hendrik Koelzer, Timothy S. Maughan, Jens Rittscher
CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels Chi-Hsuan Wu, Shih-Yang Liu, Xijie Huang, Xingbo Wang, Rong Zhang, Luca Minciullo, Wong Kai Yiu, Kenny Kwan, Kwang-Ting Cheng
PDF
Co-Designing a Sub-Millisecond Latency Event-Based Eye Tracking System with Submanifold Sparse CNN Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So
PDF
Coarse or Fine? Recognising Action End States Without Labels Davide Moltisanti, Hakan Bilen, Laura Sevilla-Lara, Frank Keller
PDF
Codebook VQ-VAE Approach for Prostate Cancer Diagnosis Using Multiparametric MRI Ekaterina Redekop, Mara Pleasure, Zichen Wang, Karthik V. Sarma, Adam Kinnaird, William Speier, Corey W. Arnold
CoDISP: Exploring Compressed Domain Camera ISP with RGB-Guided Encoder Molin Zhang, Soumendu Majee, Chengyu Wang, Seok-Jun Lee, Hamid R. Sheikh
CoLa-SDF: Controllable Latent StyleSDF for Disentangled 3D Face Generation Rahul Dey, Bernhard Egger, Vishnu Naresh Boddeti, Ye Wang, Tim K. Marks
Collaborative Blind Image Deblurring Thomas Eboli, Jean-Michel Morel, Gabriele Facciolo
PDF
Collaborative Visual Place Recognition Through Federated Learning Mattia Dutto, Gabriele Moreno Berton, Debora Caldarola, Eros Fanì, Gabriele Trivigno, Carlo Masone
PDF
Color-Cued Efficient Densification Method for 3D Gaussian Splatting Sieun Kim, Kyungjin Lee, Youngki Lee
Comparative Analysis of Generalization and Harmonization Methods for 3D Brain fMRI Images: A Case Study on OpenBHB Dataset Soroosh Safari Loaliyan, Greg Ver Steeg
Complex Style Image Transformations for Domain Generalization in Medical Images Nikolaos Spanos, Anastasios Arsenos, Paraskevi-Antonia Theofilou, Paraskevi K. Tzouveli, Athanasios Voulodimos, Stefanos D. Kollias
PDF
Computational Spectral Imaging with Unified Encoding Model and Beyond Xinyuan Liu, Lingen Li, Lin Zhu, Lizhi Wang
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang
PDF
CONDA: Continual Unsupervised Domain Adaptation Learning in Visual Perception for Self-Driving Cars Thanh-Dat Truong, Pierce Helton, Ahmed Moustafa, Jackson David Cothren, Khoa Luu
PDF
Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao
PDF
Conformal Semantic Image Segmentation: Post-Hoc Quantification of Predictive Uncertainty Luca Mossina, Joseba Dalmau, Léo Andéol
PDF
Connecting NeRFs, Images, and Text Francesco Ballerini, Pierluigi Zama Ramirez, Roberto Mirabella, Samuele Salti, Luigi Di Stefano
PDF
ConPro: Learning Severity Representation for Medical Images Using Contrastive Learning and Preference Optimization Hong Nguyen, Hoang Nguyen, Melinda Chang, Hieu Pham, Shrikanth Narayanan, Michael Pazzani
PDF
Content-Aware Input Scaling and Deep Learning Computation Offloading for Low-Latency Embedded Vision Omkar Prabhune, Tianen Chen, Younghyun Kim
Context-Aware Video Anomaly Detection in Long-Term Datasets Zhengye Yang, Richard J. Radke
PDF
Contextualising Implicit Representations for Semantic Tasks Theo W. Costain, Kejie Li, Victor Adrian Prisacariu
PDF
Continual Diffusion with STAMINA: STack-and-Mask INcremental Adapters James Seale Smith, Yen-Chang Hsu, Zsolt Kira, Yilin Shen, Hongxia Jin
PDF
Continual Learning with Weight Interpolation Jedrzej Kozal, Jan Wasilewski, Bartosz Krawczyk, Michal Wozniak
PDF
Continual-Zoo: Leveraging Zoo Models for Continual Classification of Medical Images Nourhan Bayasi, Ghassan Hamarneh, Rafeef Garbi
Contrastive Clothing and Pose Generation for Cloth-Changing Person Re-Identification Vuong D. Nguyen, Pranav Mantini, Shishir K. Shah
Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes Ivica Obadic, Alex Levering, Lars Pennig, Dário A. B. Oliveira, Diego Marcos, Xiaoxiang Zhu
PDF
ControlPolypNet: Towards Controlled Colon Polyp Synthesis for Improved Polyp Segmentation Vanshali Sharma, Abhishek Kumar, Debesh Jha, Manas Kamal Bhuyan, Pradip K. Das, Ulas Bagci
Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets Hao Chen, Ran Tao, Han Zhang, Yidong Wang, Xiang Li, Wei Ye, Jindong Wang, Guosheng Hu, Marios Savvides
PDF
COOD: Combined Out-of-Distribution Detection Using Multiple Measures for Anomaly & Novel Class Detection in Large-Scale Hierarchical Classification Laurens E. Hogeweg, Rajesh Gangireddy, Django Brunink, Vincent J. Kalkman, Ludo Cornelissen, Jacob W. Kamminga
PDF
Coreset Selection for Object Detection Hojun Lee, Suyoung Kim, Junhoo Lee, Jaeyoung Yoo, Nojun Kwak
PDF
COVER: A Comprehensive Video Quality Evaluator Chenlong He, Qi Zheng, Ruoxi Zhu, Xiaoyang Zeng, Yibo Fan, Zhengzhong Tu
Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein, Frédéric Giraud, Lilian Calvet, Alexander Schwarz, Nicola Alessandro Cavalcanti, Sergey Prokudin, Mazda Farshad, Siyu Tang, Marc Pollefeys, Fabio Carrillo, Philipp Fürnstahl
PDF
CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task Kangzhen Yang, Tao Hu, Kexin Dai, Genggeng Chen, Yu Cao, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan
PDF
CroSpace6D: Leveraging Geometric and Motion Cues for High-Precision Cross-Domain 6DoF Pose Estimation for Non-Cooperative Spacecrafts Jianhong Zuo, Shengyang Zhang, Qianyu Zhang, Yutao Zhao, Baichuan Liu, Aodi Wu, Xue Wan, Leizheng Shu, Guohua Kang
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding Jay Bhanushali, Manivannan Muniyandi, Praneeth Chakravarthula
PDF
Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection Ayush Ghadiya, Purbayan Kar, Vishal M. Chudasama, Pankaj Wasnik
Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification Without Labels Amaya Dharmasiri, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan
PDF
Cross-Sensor Super-Resolution of Irregularly Sampled Sentinel-2 Time Series Aimi Okabayashi, Nicolas Audebert, Simon Donike, Charlotte Pelletier
PDF
Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches Yi Li, Yunan Wu, Aggelos K. Katsaggelos
PDF
Cross-View Aggregation Network for Stereo Image Super-Resolution Zhitao Chen, Tao Lu, Kanghui Zhao, Bolin Zhu, Zhen Li, Jiaming Wang, Yanduo Zhang
CSCO: Connectivity Search of Convolutional Operators Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen
PDF
CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention Damith Chamalke Senadeera, Xiaoyun Yang, Dimitrios Kollias, Gregory G. Slabaugh
PDF
CycleGANAS: Differentiable Neural Architecture Search for CycleGAN Taegun An, Changhee Joo
PDF
DaFF: Dual Attentive Feature Fusion for Multispectral Pedestrian Detection Afnan Althoupety, Li-Yun Wang, Wu-Chi Feng, Banafsheh Rekabdar
Damage Detection and Localization by Learning Deep Features of Elastic Waves in Piezoelectric Ceramic Using Point Contact Method Pragyan Banerjee, Pranjal Saxena, Nur M. M. Kalimullah, Amit Shelke, Anowarul Habib
Data-Efficient and Robust Task Selection for Meta-Learning Donglin Zhan, James Anderson
PDF
Data-Free Defense of Black Box Models Against Adversarial Attacks Gaurav Kumar Nayak, Inder Khatri, Ruchit Rawal, Anirban Chakraborty
PDF
Data-Free Model Fusion with Generator Assistants Luyao Shi, Prashanth Vijayaraghavan, Ehsan Degan
Dataset Condensation with Latent Quantile Matching Wei Wei, Tom De Schepper, Kevin Mets
PDF
DCDR-UNet: Deformable Convolution Based Detail Restoration via U-Shape Network for Single Image HDR Reconstruction Joonsoo Kim, Zhe Zhu, Tien Bau, Chenguang Liu
DCE-Diff: Diffusion Model for Synthesis of Early and Late Dynamic Contrast-Enhanced MR Images from Non-Contrast Multimodal Inputs Kishore Kumar M, Sriprabha Ramanarayanan, Sadhana S, Arunima Sarkar, Matcha Naga Gayathri, Keerthi Ram, Mohanasankar Sivaprakasam
DDOS: The Drone Depth and Obstacle Segmentation Dataset Benedikt Kolbeinsson, Krystian Mikolajczyk
PDF
De-Noised Vision-Language Fusion Guided by Visual Cues for E-Commerce Product Search Zhizhang Hu, Shasha Li, Ming Du, Arnab Dhua, Douglas Gray
DECNet: A Non-Contacting Dual-Modality Emotion Classification Network for Driver Health Monitoring Zhekang Dong, Chenhao Hu, Shiqi Zhou, Liyan Zhu, Junfan Wang, Yi Chen, Xudong Lv, Xiaoyue Ji
Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation Tse-Wei Chen, Wei Tao, Dongyue Zhao, Kazuhiro Mima, Tadayuki Ito, Kinya Osa, Masami Kato
DeDoDe V2: Analyzing and Improving the DeDoDe Keypoint Detector Johan Edstedt, Georg Bökman, Zhenjun Zhao
PDF
Deep Generative Data Assimilation in Multimodal Setting Yongquan Qu, Juan Nathaniel, Shuolin Li, Pierre Gentine
PDF
Deep Learning-Based Identification of Arctic Ocean Boundaries and Near-Surface Phenomena in Underwater Echograms Femina Senjaliya, Melissa Cote, Amanda Dash, Alexandra Branzan Albu, Andrea Niemi, Stéphane Gauthier, Julek Chawarski, Steve Pearce, Kaan Ersahin, Keath Borg
Deep Portrait Quality Assessment. a NTIRE 2024 Challenge Survey Nicolas Chahine, Marcos V. Conde, Daniela Carfora, Gabriel Pacianotto, Benoit Pochon, Sira Ferradans, Radu Timofte, Zhichao Duan, Xinrui Xu, Yipo Huang, Quan Yuan, Xiangfei Sheng, Zhichao Yang, Leida Li, Haotian Fan, Fangyuan Kong, Yifang Xu, Wei Sun, Weixia Zhang, Yanwei Jiang, Haoning Wu, Zicheng Zhang, Jun Jia, Yingjie Zhou, Zhongpeng Ji, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Xiaoqi Wang, Junqi Liu, Zixi Guo, Yun Zhang, Zewen Chen, Wen Wang, Juan Wang, Bing Li
PDF
Deep RAW Image Super-Resolution. a NTIRE 2024 Challenge Survey Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhijing Sun, Jiaying Zhu, Liangyan Li, Ke Chen, Yunzhe Li, Yimo Ning, Guanhua Zhao, Jun Chen, Jinyang Yu, Kele Xu, Qisheng Xu, Yong Dou
PDF
Deep Video Codec Control for Vision Models Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar
PDF
DeepDistAL: Deepfake Dataset Distillation Using Active Learning Md. Shohel Rana, Mohammad Nur Nobi, Andrew H. Sung
Deepfake Catcher: Can a Simple Fusion Be Effective and Outperform Complex DNNs? Akshay Agarwal, Nalini K. Ratha
DeepLocalization: Using Change Point Detection for Temporal Action Localization Mohammed Shaiqur Rahman, Ibne Farabi Shihab, Lynna Chu, Anuj Sharma
PDF
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer Wei Dong, Han Zhou, Ruiyi Wang, Xiaohong Liu, Guangtao Zhai, Jun Chen
PDF
DELTA: Decoupling Long-Tailed Online Continual Learning Siddeshwar Raghavan, Jiangpeng He, Fengqing Zhu
PDF
Demographic Bias Effects on Face Image Synthesis Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple
PDF
DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha
PDF
Deploying Machine Learning Anomaly Detection Models to Flight Ready AI Boards James Murphy, Maria Buckley, Léonie Buckley, Adam Taylor, Jake O'Brien, Brian Mac Namee
Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images Jaeyoung Chung, Jeongtaek Oh, Kyoung Mu Lee
PDF
DepthVoting: A Few-Shot Point Cloud Classification Model Incorporating a Projection-Based Voting Mechanism Yunhui Zhu, Jiajing Chen, Senem Velipasalar
Detecting Out-of-Distribution Earth Observation Images with Diffusion Models Georges Le Bellier, Nicolas Audebert
PDF
Dformer: Learning Efficient Image Restoration with Perceptual Guidance Nodirkhuja Khudjaev, Roman Tsoy, Sma Sharif, Azamat Myrzabekov, Seongwan Kim, Jaeho Lee
DGBD: Depth Guided Branched Diffusion for Comprehensive Controllability in Multi-View Generation Hovhannes Margaryan, Daniil Hayrapetyan, Wenyan Cong, Zhangyang Wang, Humphrey Shi
DIA: Diffusion Based Inverse Network Attack on Collaborative Inference Dake Chen, Shiduo Li, Yuke Zhang, Chenghao Li, Souvik Kundu, Peter A. Beerel
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal
PDF
DiCo-NeRF: Difference of Cosine Similarity for Neural Rendering of Fisheye Driving Scenes Jiho Choi, Gyutae Hwang, Sang Jun Lee
DiffLight: Integrating Content and Detail for Low-Light Image Enhancement Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Qingsen Yan, Yanning Zhang
DiffSeg: Towards Detecting Diffusion-Based Inpainting Attacks Using Multi-Feature Segmentation Raphael Antonius Frick, Martin Steinebach
DiffTED: One-Shot Audio-Driven TED Talk Video Generation with Diffusion-Based Co-Speech Gestures Steven Hogue, Chenxu Zhang, Hamza Daruger, Yapeng Tian, Xiaohu Guo
Diffusion-Based Adaptation for Classification of Unknown Degraded Images Dinesh Daultani, Masayuki Tanaka, Masatoshi Okutomi, Kazuki Endo
Discovering Interpretable Models of Scientific Image Data with Deep Learning Christopher J. Soelistyo, Alan R. Lowe
PDF
Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert Sontje Ihler, Felix Kuhnke, Timo Kuhlgatz, Thomas Seel
Divide and Conquer Boosting for Enhanced Traffic Safety Description and Analysis with Large Vision Language Model Khai Trinh Xuan, Khoi Nguyen Nguyen, Bach Hoang Ngo, Vu Dinh Xuan, Minh-Hung An, Quang-Vinh Dinh
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled Ensemble Blaz Rolih, Dick Ameln, Ashwin Vaidya, Samet Akcay
DMR: Disentangling Marginal Representations for Out-of-Distribution Detection Dasol Choi, Dongbin Na
Do More with What You Have: Transferring Depth-Scale from Labeled to Unlabeled Domains Alexandra Dana, Nadav Carmel, Amit Shomer, Ofer Manela, Tomer Peleg
PDF
Domain Adaptation Using Pseudo Labels for COVID-19 Detection Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen
PDF
Domain Adaptation, Explainability & Fairness in AI for Medical Image Analysis: Diagnosis of COVID-19 Based on 3-D Chest CT-Scans Dimitrios Kollias, Anastasios Arsenos, Stefanos Kollias
Domain Generalization for Crop Segmentation with Standardized Ensemble Knowledge Distillation Simone Angarano, Mauro Martini, Alessandro Navone, Marcello Chiaberge
PDF
Domain Targeted Synthetic Plant Style Transfer Using Stable Diffusion, LoRA and ControlNet Zane K. J. Hartley, Rob J. Lind, Michael P. Pound, Andrew P. French
PDF
DQ-HorizonNet: Enhancing Door Detection Accuracy in Panoramic Images via Dynamic Quantization Cing-Jia Lin, Jheng-Wei Su, Kai-Wen Hsiao, Ting-Yu Yen, Chih-Yuan Yao, Hung-Kuo Chu
Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images Vazgen Zohranyan, Vagner Navasardyan, Hayk Navasardyan, Jan Borggrefe, Shant Navasardyan
PDF
DRCT: Saving Image Super-Resolution Away from Information Bottleneck Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou
PDF
Drone-HAT: Hybrid Attention Transformer for Complex Action Recognition in Drone Surveillance Videos Mustaqeem Khan, Jamil Ahmad, Abdulmotaleb El Saddik, Wail Gueaieb, Giulia De Masi, Fakhri Karray
DSTCFuse: A Method Based on Dual-Cycled Cross-Awareness of Structure Tensor for Semantic Segmentation via Infrared and Visible Image Fusion Xuan Li, Rongfu Chen, Jie Wang, Lei Ma, Li Cheng, Haiwen Yuan
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang
PDF
DuST: Dual Swin Transformer for Multi-Modal Video and Time-Series Modeling Liang Shi, Yixin Chen, Meimei Liu, Feng Guo
DVMSR: Distillated Vision Mamba for Efficient Super-Resolution Xiaoyan Lei, Wenlong Zhang, Weifeng Cao
PDF
Dynamic Addition of Noise in a Diffusion Model for Anomaly Detection Justin Tebbe, Jawad Tayyub
PDF
Dynamic Distinction Learning: Adaptive Pseudo Anomalies for Video Anomaly Detection Demetris Lappas, Vasileios Argyriou, Dimitrios Makris
PDF
Dynamic Knowledge Adapter with Probabilistic Calibration for Generalized Few-Shot Semantic Segmentation Jintao Tong, Haichen Zhou, Yicong Liu, Yiman Hu, Yixiong Zou
E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data Aref Azizpour, Tai D. Nguyen, Manil Shrestha, Kaidi Xu, Edward Kim, Matthew C. Stamm
PDF
EarthMatch: Iterative Coregistration for Fine-Grained Localization of Astronaut Photography Gabriele Moreno Berton, Gabriele Goletto, Gabriele Trivigno, Alex Stoken, Barbara Caputo, Carlo Masone
PDF
ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Iaroslav Melekhov, Anand Umashankar, Hyeong-Jin Kim, Vladislav Serkov, Dusty Argyle
PDF
ED-DCFNet: An Unsupervised Encoder-Decoder Neural Model for Event-Driven Feature Extraction and Object Tracking Raz Ramon, Hadar Cohen Duwek, Elishai Ezra Tsur
EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting Min-Hui Lin, Mahesh Reddy, Guillaume Berger, Michel Sarkis, Fatih Porikli, Ning Bi
PDF
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models Adrien Le-Coz, Houssem Ouertatani, Stéphane Herbin, Faouzi Adjed
PDF
Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation Jun Yu, Wangyuan Zhu, Jichao Zhu, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Guochen Xie, Zhihong Wei, Qingsong Liu, Jiaen Liang
PDF
Efficient Light Field Image Super-Resolution via Progressive Disentangling Gaosheng Liu, Huanjing Yue, Jingyu Yang
Efficient Local Correlation Volume for Unsupervised Optical Flow Estimation on Small Moving Objects in Large Satellite Images Sarra Khairi, Etienne Meunier, Renaud Fraisse, Patrick Bouthemy
PDF
Efficient Online Multi-Camera Tracking with Memory-Efficient Accumulated Appearance Features and Trajectory Validation Lap Quoc Tran, Huan Duc Vi
Efficient Skeleton-Based Action Recognition for Real-Time Embedded Systems Nadhira Noor, Fabianaugie Jametoni, Jinbeom Kim, Hyunsu Hong, In Kyu Park
Efficient Transformer Adaptation with Soft Token Merging Xin Yuan, Hongliang Fei, Jinoo Baek
Efficient Video Stabilization via Partial Block Phase Correlation on Edge GPUs Cevahir Çigla
EfficientNet-SAM: A Novel EffecientNet with Spatial Attention Mechanism for COVID-19 Detection in Pulmonary CT Scans Ramy Farag, Parth Upadhay, Jacket Demby's, Yixiang Gao, Katherin Garces Montoya, Seyed Mohamad Ali Tousi, Gbenga Omotara, Guilherme N. DeSouza
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss Zhuoyang Zhang, Han Cai, Song Han
Efflex: Efficient and Flexible Pipeline for Spatio-Temporal Trajectory Graph Modeling and Representation Learning Ming Cheng, Ziyi Zhou, Bowen Zhang, Ziyu Wang, Jiaqi Gan, Ziang Ren, Weiqi Feng, Yi Lyu, Hefan Zhang, Xingjian Diao
PDF
EgoSG: Learning 3D Scene Graphs from Egocentric RGB-D Sequences Chaoyi Zhang, Xitong Yang, Ji Hou, Kris Kitani, Weidong Cai, Fu-Jen Chu
EL2NM: Extremely Low-Light Noise Modeling Through Diffusion Iteration Jiahao Qin, Pinle Qin, Rui Chai, Jia Qin, Zanxia Jin
ELSA: Exploiting Layer-Wise N: M Sparsity for Vision Transformer Acceleration Ning-Chi Huang, Chi-Chih Chang, Wei-Cheng Lin, Endri Taka, Diana Marculescu, Kai-Chiang Wu
Emotic Masked Autoencoder on Dual-Views with Attention Fusion for Facial Expression Recognition Xuan-Bach Nguyen, Hoang-Thien Nguyen, Thanh-Huy Nguyen, Nhu-Tai Do, Quang Vinh Dinh
EmotiEffNet and Temporal Convolutional Networks in Video-Based Facial Expression Recognition and Action Unit Detection Andrey V. Savchenko, Anna P. Sidorova
Emotion Recognition Using Transformers with Random Masking Seongjae Min, Junseok Yang, Sejoon Lim
End-to-End Deep Learning Models for Gap Identification in Maize Fields Rana Waqar, Zeljana Grbovic, Maryam Khan, Nina Pajevic, Dimitrije Stefanovic, Vladan Filipovic, Marko Panic, Nemanja Djuric
PDF
End-to-End Neural Network Compression via L1/l2 Regularized Latency Surrogates Anshul Nasery, Hardik Shah, Arun Sai Suggala, Prateek Jain
End-to-End Solution for Tenebrio Molitor Rearing Monitoring with Uncertainty Estimation and Domain Shift Detection Pawel Majewski, Piotr Lampa, Robert Burduk, Jacek Reiner
Energy-Efficient Uncertainty-Aware Biomass Composition Prediction at the Edge Muhammad Zawish, Paul Albert, Flavio Esposito, Steven Davy, Lizy Abraham
PDF
Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation Jensen Hwa, Qingyu Zhao, Aditya Lahiri, Adnan Masood, Babak Salimi, Ehsan Adeli
PDF
Enhancing 2D Representation Learning with a 3D Prior Mehmet Aygün, Prithviraj Dhar, Zhicheng Yan, Oisin Mac Aodha, Rakesh Ranjan
PDF
Enhancing Emotion Recognition with Pre-Trained Masked Autoencoders and Sequential Learning Weiwei Zhou, Jiada Lu, Chenkun Ling, Weifeng Wang, Shaowei Liu
Enhancing Image Classification Robustness Through Adversarial Sampling with Delta Data Augmentation (DDA) Iván Reyes-Amezcua, Gilberto Ochoa-Ruiz, Andres Mendez-Vazquez
Enhancing Ki-67 Cell Segmentation with Dual U-Net Models: A Step Towards Uncertainty-Informed Active Learning David Anglada-Rotger, Julia Sala, Ferran Marqués, Philippe Salembier, Montse Pardàs
PDF
Enhancing Road Object Detection in Fisheye Cameras: An Effective Framework Integrating SAHI and Hybrid Inference Bao Tran Gia, Tuong Bui Cong Khanh, Hien Ho Trong, Thuyen Tran Doan, Tien Do, Duy-Dinh Le, Thanh Duc Ngo
Enhancing Targeted Attack Transferability via Diversified Weight Pruning Hung-Jui Wang, Yu-Yu Wu, Shang-Tse Chen
PDF
Enhancing the Transferability of Adversarial Attacks with Stealth Preservation Xinwei Zhang, Tianyuan Zhang, Yitong Zhang, Shuangcheng Liu
Enhancing Traffic Safety with Parallel Dense Video Captioning for End-to-End Event Analysis Maged Shoman, Dongdong Wang, Armstrong Aboah, Mohamed A. Abdel-Aty
PDF
Enhancing Visual Question Answering Through Question-Driven Image Captions as Prompts Övgü Özdemir, Erdem Akagündüz
PDF
Enrich, Distill and Fuse: Generalized Few-Shot Semantic Segmentation in Remote Sensing Leveraging Foundation Model's Assistance Tianyi Gao, Wei Ao, Xing-ao Wang, Yuanhao Zhao, Ping Ma, Mengjie Xie, Hang Fu, Jinchang Ren, Zhi Gao
PDF
Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian, Wenjing Lian, Ziwei Luo
PDF
Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations Jingguo Liu, Yijun Xu, Shigang Li, Jianfeng Li
PDF
Evaluating and Improving Compositional Text-to-Visual Generation Baiqi Li, Zhiqiu Lin, Deepak Pathak, Jiayao Li, Yixin Fei, Kewen Wu, Xide Xia, Pengchuan Zhang, Graham Neubig, Deva Ramanan
Evaluating Confidence Calibration in Endoscopic Diagnosis Models Nikoo Dehghani, Ayla Thijssen, Quirine E. W. van der Zander, Ramon-Michel Schreuder, Erik J. Schoon, Fons van der Sommen, Peter H. N. de With
PDF
Evaluating Multimodal Large Language Models Across Distribution Shifts and Augmentations Aayush Atul Verma, Amir Saeidi, Shamanthak Hegde, Ajay Therala, Fenil Denish Bardoliya, Nagaraju Machavarapu, Shri Ajay Kumar Ravindhiran, Srija Malyala, Agneet Chatterjee, Yezhou Yang, Chitta Baral
Evaluating the Effectiveness of Video Anomaly Detection in the Wild Online Learning and Inference for Real-World Deployment Shanle Yao, Ghazal Alinezhad Noghre, Armin Danesh Pazho, Hamed Tabkhi
PDF
Evaluating the Integration of Morph Attack Detection in Automated Face Recognition Systems Andrea Panzino, Simone Maurizio La Cava, Giulia Orrù, Gian Luca Marcialis
PDF
Event Camera Demosaicing via Swin Transformer and Pixel-Focus Loss Yunfan Lu, Yijie Xu, Wenzong Ma, Weiyu Guo, Hui Xiong
PDF
Event-Based Ball Spin Estimation in Sports Takuya Nakabayashi, Kyota Higa, Masahiro Yamaguchi, Ryo Fujiwara, Hideo Saito
Event-Based Eye Tracking. AIS 2024 Challenge Survey Zuowen Wang, Chang Gao, Zongwei Wu, Marcos V. Conde, Radu Timofte, Shih-Chii Liu, Qinyu Chen, Zhengjun Zha, Wei Zhai, Han Han, Bohao Liao, Yuliang Wu, Zengyu Wan, Zhong Wang, Yang Cao, Ganchao Tan, Jinze Chen, Yan Ru Pei, Sasskia Brüers, Sébastien M. Crouzet, Douglas McLelland, Olivier Coenen, Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So, Philippe Bich, Chiara Boretti, Luciano Prono, Mircea Lica, David Dinucu-Jianu, Catalin Grîu, Xiaopeng Lin, Hongwei Ren, Bojun Cheng, Xinan Zhang, Valentin Vial, Anthony Yezzi, James Tsai
PDF
ExerAIde: AI-Assisted Multimodal Diagnosis for Enhanced Sports Performance and Personalised Rehabilitation Ahmed Rashid Qazi, Asim Iqbal
Exploiting CLIP Self-Consistency to Automate Image Augmentation for Safety Critical Scenarios Sujan Sai Gannamaneni, Frederic Klein, Michael Mock, Maram Akila
Exploration of Data Augmentation Techniques for Bush Detection in Blueberry Orchards Boris Culjak, Nina Pajevic, Vladan Filipovic, Dimitrije Stefanovic, Zeljana Grbovic, Nemanja Djuric, Marko Panic
PDF
Exploring AI-Based Satellite Pose Estimation: From Novel Synthetic Dataset to In-Depth Performance Evaluation Fabien Gallet, Christophe Marabotto, Thomas Chambon
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap Bowen Qu, Xiaoyu Liang, Shangkun Sun, Wei Gao
PDF
Exploring Facial Expression Recognition Through Semi-Supervised Pre-Training and Temporal Modeling Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang
PDF
Exploring Real World mAP Change Generalization of Prior-Informed HD mAP Prediction Models Samuel M. Bateman, Ning Xu, H. Charles Zhao, Yael Ben Shalom, Vince Gong, Greg Long, Will Maddern
PDF
Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery Xavier Bou, Gabriele Facciolo, Rafael Grompone von Gioi, Jean-Michel Morel, Thibaud Ehret
PDF
Exploring Text-to-Motion Generation with Human Preference Jenny Sheng, Matthieu Lin, Andrew Zhao, Kevin Pruvost, Yu-Hui Wen, Yangguang Li, Gao Huang, Yong-Jin Liu
PDF
Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó Bence Englert, Fabrizio J. Piva, Tommie Kerssies, Daan de Geus, Gijs Dubbelman
PDF
Exploring the Impact of Dataset Bias on Dataset Distillation Yao Lu, Jianyang Gu, Xuguang Chen, Saeed Vahidian, Qi Xuan
PDF
Exploring the Limits: Applying State-of-the-Art Stereo Matching Algorithms to Rectified Ultra-Wide Stereo Filip Slezak, Morten Stigaard Laursen, Thomas B. Moeslund
PDF
Exploring the Role of Audio in Video Captioning Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang
PDF
Exploring the Usage of Diffusion Models for Thermal Image Super-Resolution: A Generic, Uncertainty-Aware Approach for Guided and Non-Guided Schemes Carlos Cortés-Mendez, Jean-Bernard Hayet
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following Anshul Gupta, Pierre Vuillecard, Arya Farkhondeh, Jean-Marc Odobez
PDF
Extending Global-Local View Alignment for Self-Supervised Learning with Remote Sensing Imagery Xinye Wanyan, Sachith Seneviratne, Shuchang Shen, Michael Kirley
PDF
FairSSD: Understanding Bias in Synthetic Speech Detectors Amit Kumar Singh Yadav, Kratika Bhagtani, Davide Salvi, Paolo Bestagini, Edward J. Delp
PDF
Fake It to Make It: Using Synthetic Data to Remedy the Data Shortage in Joint Multi-Modal Speech-and-Gesture Synthesis Shivam Mehta, Anna Deichler, Jim O'Regan, Birger Moëll, Jonas Beskow, Gustav Eje Henter, Simon Alexanderson
PDF
FAPNet: An Effective Frequency Adaptive Point-Based Eye Tracker Xiaopeng Lin, Hongwei Ren, Bojun Cheng
PDF
Fast-NTK: Parameter-Efficient Unlearning for Large-Scale Models Guihong Li, Hsiang Hsu, Chun-Fu Richard Chen, Radu Marculescu
PDF
Faster than Lies: Real-Time Deepfake Detection Using Binary Neural Networks Romeo Lanzino, Federico Fontana, Anxhelo Diko, Marco Raoul Marini, Luigi Cinque
PDF
FE-Det: An Effective Traffic Object Detection Framework for Fish-Eye Cameras Xingshuang Luo, Zhe Cui, Fei Su
Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions Chuheng Wei, Guoyuan Wu, Matthew J. Barth
PDF
Federated Hyperparameter Optimization Through Reward-Based Strategies: Challenges and Insights Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes, Xue Jiang, Xuebing Zhou
Federated Learning with a Single Shared Image Sunny Soni, Aaqib Saeed, Yuki M. Asano
PDF
FedProK: Trustworthy Federated Class-Incremental Learning via Prototypical Feature Knowledge Transfer Xin Gao, Xin Yang, Hao Yu, Yan Kang, Tianrui Li
PDF
Fetal ECG Extraction on Time-Frequency Domain Using Conditional GAN Vuong D. Nguyen
Finding AI-Generated Faces in the Wild Gonzalo J. Aniano Porcile, Jack Gindi, Shivansh Mundra, James R. Verbus, Hany Farid
PDF
FineRehab: A Multi-Modality and Multi-Task Dataset for Rehabilitation Analysis Jianwei Li, Jun Xue, Rui Cao, Xiaoxia Du, Siyu Mo, Kehao Ran, Zeyan Zhang
FIQA-FAS: Face Image Quality Assessment Based Face Anti-Spoofing Ya-Chi Liang, Min-Xuan Qiu, Shang-Hong Lai
FisheyeBEVSeg: Surround View Fisheye Cameras Based Bird's-Eye View Segmentation for Autonomous Driving Senthil Kumar Yogamani, David Unger, Venkatraman Narayanan, Varun Ravi Kumar
PDF
Flexible Window-Based Self-Attention Transformer in Thermal Image Super-Resolution Hongcheng Jiang, ZhiQiang Chen
FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal Consistency and Correlation Debiasing Anant Khandelwal
PDF
FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes Marcel Büsching, Josef Bengtson, David Nilsson, Mårten Björkman
PDF
Focusing on What Matters: Fine-Grained Medical Activity Recognition for Trauma Resuscitation via Actor Tracking Wenjin Zhang, Keyi Li, Sen Yang, Sifan Yuan, Ivan Marsic, Genevieve J. Sippel, Mary S. Kim, Randall S. Burd
PDF
Food Portion Estimation via 3D Object Scaling Gautham Vinod, Jiangpeng He, Zeman Shao, Fengqing Zhu
PDF
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models Gong Zhang, Kai Wang, Xingqian Xu, Zhangyang Wang, Humphrey Shi
PDF
Forward-Forward Algorithm for Hyperspectral Image Classification Abel A. Reyes Angulo, Sidike Paheding
Fourier Prior-Based Two-Stage Architecture for Image Restoration Hemkant Nehete, Amit Monga, Partha Kaushik, Brajesh Kumar Kaushik
FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography Julia Yang, Alina Jade Barnett, Jon Donnelly, Satvik Kishore, Jerry Fang, Fides Regina Schwartz, Chaofan Chen, Joseph Y. Lo, Cynthia Rudin
PDF
Fractals as Pre-Training Datasets for Anomaly Detection and Localization Cynthia Ifeyinwa Ugwu, Sofia Casarin, Oswald Lanz
PDF
From 2D Portraits to 3D Realities: Advancing GAN Inversion for Enhanced Image Synthesis Wonseok Oh, Youngjoo Jo
From Synthetic to Real: A Calibration-Free Pipeline for Few-Shot Raw Image Denoising Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun
Fully Test-Time Adaptation for Object Detection Xiaoqian Ruan, Wei Tang
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis Dimitrios Karageorgiou, Giorgos Kordopatis-Zilos, Symeon Papadopoulos
PDF
Gain-First or Exposure-First: Benchmark for Better Low-Light Video Photography and Enhancement Haiyang Jiang, Zhihang Zhong, Yinqiang Zheng
Gasformer: A Transformer-Based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging Toqi Tahamid Sarker, Mohamed G. Embaby, Khaled R. Ahmed, Amer AbuGhazaleh
PDF
Gaussian Splatting Decoder for 3D-Aware Generative Adversarial Networks Florian Barthel, Arian Beckmann, Wieland Morgenstern, Anna Hilsmann, Peter Eisert
PDF
Gaze Scanpath Transformer: Predicting Visual Search Target by Spatiotemporal Semantic Modeling of Gaze Scanpath Takumi Nishiyasu, Yoichi Sato
Gene-Level Representation Learning via Interventional Style Transfer in Optical Pooled Screening Mahtab Bigverdi, Burkhard Höckendorf, Heming Yao, Phil Hanslovsky, Romain Lopez, David Richmond
PDF
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework Zhuohong Li, Fangxiao Lu, Jiaqi Zou, Lei Hu, Hongyan Zhang
PDF
Generalized Foggy-Scene Semantic Segmentation by Frequency Decoupling Qi Bi, Shaodi You, Theo Gevers
Generalized Single-Image-Based Morphing Attack Detection Using Deep Representations from Vision Transformer Haoyu Zhang, Raghavendra Ramachandra, Kiran B. Raja, Christoph Busch
Generating Diverse Agricultural Data for Vision-Based Farming Applications Mikolaj Cieslak, Umabharathi Govindarajan, Alejandro Garcia, Anuradha Chandrashekar, Torsten Hädrich, Aleksander Mendoza-Drosik, Dominik L. Michels, Sören Pirk, Chia-Chun Fu, Wojciech Palubicki
PDF
Generating Material-Aware 3D Models from Sparse Views Shi Mao, Chenming Wu, Ran Yi, Zhelun Shen, Liangjun Zhang, Wolfgang Heidrich
Generation of Structurally Realistic Retinal Fundus Images with Diffusion Models Sojung Go, Younghoon Ji, Sang Jun Park, Soochahn Lee
PDF
Generative Dataset Distillation: Balancing Global Structure and Local Details Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
PDF
GenVideo: One-Shot Target-Image and Shape Aware Video Editing Using T2I Diffusion Models Sai Sree Harsha, Ambareesh Revanur, Dhwanit Agarwal, Shradha Agrawal
PDF
GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions Salvatore Esposito, Qingshan Xu, Kacper Kania, Charlie Hewitt, Octave Mariotti, Lohit Petikam, Julien Valentin, Arno Onken, Oisin Mac Aodha
PDF
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots Simranjit Singh, Michael Fore, Dimitrios Stamoulis
PDF
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis Srikumar Sastry, Subash Khanal, Aayush Dhakal, Nathan Jacobs
PDF
GESCAM : A Dataset and Method on Gaze Estimation for Classroom Attention Measurement Athul M. Mathew, Arshad Ali Khan, Thariq Khalid, Riad Souissi
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition Mallika Garg, Debashis Ghosh, Pyari Mohan Pradhan
PDF
GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields Arnab Dey, Di Yang, Rohith Agaram, Antitza Dantcheva, Andrew I. Comport, Srinath Sridhar, Jean Martinet
PDF
GM-DETR: Generalized Muiltispectral DEtection TRansformer with Efficient Fusion Encoder for Visible-Infrared Detection Yiming Xiao, Fanman Meng, Qingbo Wu, Linfeng Xu, Mingzhou He, Hongliang Li
Good at Captioning, Bad at Counting: Benchmarking GPT-4V on Earth Observation Data Chenhui Zhang, Sherrie Wang
PDF
GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan
PDF
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Jiaxi Lv, Yi Huang, Mingfu Yan, Jiancheng Huang, Jianzhuang Liu, Yifan Liu, Yafei Wen, Xiaoxin Chen, Shifeng Chen
PDF
Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles from 3D Cell Painting Images Vivek Gopalakrishnan, Jingzhe Ma, Zhiyong Xie
PDF
GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes Jan Niklas Kolf, Naser Damer, Fadi Boutros
PDF
GRASP-GCN: Graph-Shape Prioritization for Neural Architecture Search Under Distribution Shifts Sofia Casarin, Oswald Lanz, Sergio Escalera
PDF
GRIB: Combining Global Reception and Inductive Bias for Human Segmentation and Matting Yezhi Shen, Weichen Xu, Qian Lin, Jan P. Allebach, Fengqing Zhu
Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images Yiran Luo, Joshua Feinglass, Tejas Gokhale, Kuan-Cheng Lee, Chitta Baral, Yezhou Yang
PDF
GSAM+Cutie: Text-Promptable Tool Mask Annotation for Endoscopic Video Roger D. Soberanis-Mukul, Jiahuan Cheng, Jan Emily Mangulabnan, S. Swaroop Vedula, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
H3Net: Irregular Posture Detection by Understanding Human Character and Core Structures Seungha Noh, Kangmin Bae, Yuseok Bae, Byong-Dai Lee
Hairy Ground Truth Enhancement for Semantic Segmentation Sophie Fischer, Irina Voiculescu
PDF
HaLViT: Half of the Weights Are Enough Onur Can Koyun, Behçet Ugur Töreyin
HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using Harvest Piles and Remote Sensing Jonathan Xu, Amna Elmustafa, Liya Weldegebriel, Emnet Negash, Richard Lee, Chenlin Meng, Stefano Ermon, David B. Lobell
PDF
Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality Assessment Lauren Okamoto, Paritosh Parmar
PDF
High Quality Reference Feature for Two Stage Bracketing Image Restoration and Enhancement Xiaoxia Xing, Hyunhee Park, Fan Wang, Ying Zhang, Sejun Song, Changho Kim, Xiangyu Kong
High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes Using Convolutional Neural Networks with Attention Layers Luiz Schirmer, Guilherme G. Schardong, Vinícius da Silva, Rogério Santos, Hélio Lopes
PDF
Hinge-Wasserstein: Estimating Multimodal Aleatoric Uncertainty in Regression Tasks Ziliang Xiong, Arvi Jonnarth, Abdelrahman Eldesokey, Joakim Johnander, Bastian Wandt, Per-Erik Forssén
PDF
HirFormer: Dynamic High Resolution Transformer for Large-Scale Image Shadow Removal Xin Lu, Yurui Zhu, Xi Wang, Dong Li, Jie Xiao, Yunpeng Zhang, Xueyang Fu, Zheng-Jun Zha
Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks Andrey Ignatov, Josephine Yates, Valentina Boeva
PDF
HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution Shu-Chuan Chu, Zhi-Chao Dou, Jeng-Shyang Pan, Shaowei Weng, Junbao Li
PDF
HNN: Hierarchical Noise-Deinterlace Net Towards Image Denoising Amogh Joshi, Nikhil Akalwadi, Chinmayee Mandi, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi
How Much You Ate? Food Portion Estimation on Spoons Aaryam Sharma, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong
PDF
How SAM Perceives Different Mp-MRI Brain Tumor Domains? Cecilia Diana-Albelda, Roberto Alcover-Couso, Álvaro García-Martín, Jesús Bescós
How Suboptimal Is Training rPPG Models with Videos and Targets from Different Body Sites? Björn Braun, Daniel McDuff, Christian Holz
PDF
How to Benchmark Vision Foundation Models for Semantic Segmentation? Tommie Kerssies, Daan de Geus, Gijs Dubbelman
PDF
Human-in-the-Loop Segmentation of Multi-Species Coral Imagery Scarlett Raine, Ross Marchant, Brano Kusy, Frédéric Maire, Niko Sünderhauf, Tobias Fischer
PDF
HumanFormer: Human-Centric Prompting Multi-Modal Perception Transformer for Referring Crowd Detection Heqian Qiu, Lanxiao Wang, Taijin Zhao, Fanman Meng, Hongliang Li
Hybrid Cross-View Attention Network for Lightweight Stereo Image Super-Resolution Yuqiang Yang, Zhiming Zhang, Yao Du, Jingjing Yang, Long Bao, Heng Sun
HyperLeaf2024 - A Hyperspectral Imaging Dataset for Classification and Regression of Wheat Leaves William Michael Laprade, Pawel Tomasz Pieta, Svetlana Kutuzova, Jesper Cairo Westergaard, Mads Nielsen, Svend Christensen, Anders Bjorholm Dahl
PDF
I-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable? Kevin Zhang, Zhiqiang Shen
PDF
ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models Avinash Madasu, Vasudev Lal
PDF
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Siying Cui, Jia Guo, Xiang An, Jiankang Deng, Yongle Zhao, Xinyu Wei, Ziyong Feng
PDF
IDENet: Implicit Degradation Estimation Network for Efficient Blind Super Resolution Asif Hussain Khan, Christian Micheloni, Niki Martinel
iEdit: Localised Text-Guided Image Editing with Weak Supervision Rumeysa Bodur, Erhan Gundogdu, Binod Bhattarai, Tae-Kyun Kim, Michael Donoser, Loris Bazzani
PDF
Image Restoration Refinement with Uformer GAN Xu Ouyang, Ying Chen, Kaiyue Zhu, Gady Agam
Image-Caption Difficulty for Efficient Weakly-Supervised Object Detection from In-the-Wild Data Giacomo Nebbia, Adriana Kovashka
Imaging Signal Recovery Using Neural Network Priors Under Uncertain Forward Model Parameters Xiwen Chen, Wenhui Zhu, Peijie Qiu, Abolfazl Razi
PDF
IMIL: Interactive Medical Image Learning Framework Adrit Rao, Andrea Fisher, Ken Chang, John Christopher Panagides, Katherine McNamara, Joon-Young Lee, Oliver O. Aalami
PDF
Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks Madhumitha Sakthi, Louis Kerofsky, Varun Ravi Kumar, Senthil Kumar Yogamani
PDF
Implicit Assimilation of Sparse in Situ Data for Dense & Global Storm Surge Forecasting Patrick Ebel, Brandon Victor, Peter Naylor, Gabriele Meoni, Federico Serva, Rochelle Schneider
ImplicitTerrain: A Continuous Surface Model for Terrain Data Analysis Haoan Feng, Xin Xu, Leila De Floriani
PDF
Improved Crop and Weed Detection with Diverse Data Ensemble Learning Muhammad Hamza Asad, Saeed Anwar, Abdul Bais
PDF
Improving Consistency in Cardiovascular Disease Risk Assessment: Cross-Camera Adaptation for Retinal Images Weiyi Zhang, Danli Shi, Mingguang He
Improving Object Detection to Fisheye Cameras with Open-Vocabulary Pseudo-Label Approach Long Hoang Pham, Quoc Pham-Nam Ho, Duong Nguyen-Ngoc Tran, Tai Huu-Phuong Tran, Huy-Hung Nguyen, Duong Khac Vu, Chi Dai Tran, Ngoc Doan-Minh Huynh, Hyung-Min Jeon, Hyung-Joon Jeon, Jae Wook Jeon
Improving the Efficiency-Accuracy Trade-Off of DETR-Style Models in Practice Yumin Suh, Dongwan Kim, Abhishek Aich, Samuel Schulter, Jong-Chyi Su, Bohyung Han, Manmohan Chandraker
Improving the Robustness of 3D Human Pose Estimation: A Benchmark Dataset and Learning from Noisy Input Trung-Hieu Hoang, Mona Zehni, Huy Phan, Duc Minh Vo, Minh N. Do
Improving Valence-Arousal Estimation with Spatiotemporal Relationship Learning and Multimodal Fusion Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Shuoping Yang, Yang Zheng, Qingsong Liu, Jiaen Liang
in2IN: Leveraging Individual Information to Generate Human INteractions Pablo Ruiz-Ponce, Germán Barquero, Cristina Palmero, Sergio Escalera, José García Rodríguez
PDF
Interpreting COVID Lateral Flow Tests' Results with Foundation Models Stuti Pandey, Josh Myers-Dean, Jarek Reynolds, Danna Gurari
PDF
InVERGe: Intelligent Visual Encoder for Bridging Modalities in Report Generation Ankan Deria, Komal Kumar, Snehashis Chakraborty, Dwarikanath Mahapatra, Sudipta Roy
Investigating Calibration and Corruption Robustness of Post-Hoc Pruned Perception CNNs: An Image Classification Benchmark Study Pallavi Mitra, Gesina Schwalbe, Nadja Klein
PDF
Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed, Wouter Van Gansbeke, Luc Van Gool
PDF
InViG: Benchmarking Open-Ended Interactive Visual Grounding with 500k Dialogues Hanbo Zhang, Jie Xu, Yuchen Mo, Tao Kong
IrrNet: Advancing Irrigation Mapping with Incremental Patch Size Training on Remote Sensing Imagery Oishee Bintey Hoque, Samarth Swarup, Abhijin Adiga, Sayjro Kossi Nouwakpo, Madhav V. Marathe
PDF
IrrNet: Spatio-Temporal Segmentation Guided Classification for Irrigation Mapping Oishee Bintey Hoque
Is Our Continual Learner Reliable? Investigating Its Decision Attribution Stability Through SHAP Value Consistency Yusong Cai, Shimou Ling, Liang Zhang, Lili Pan, Hongliang Li
Is Synthetic Data All We Need? Benchmarking the Robustness of Models Trained with Synthetic Images Krishnakant Singh, Thanush Navaratnam, Jannik Holmer, Simone Schaub-Meyer, Stefan Roth
PDF
ISSR-DIL: Image Specific Super-Resolution Using Deep Identity Learning Sree Rama Vamsidhar S., Jayadeep D, Rama Krishna Gorthi
Joint Motion Detection in Neural Videos Training Niloufar Pourian, Alexey Supikov
Joint Multimodal Transformer for Emotion Recognition in the Wild Paul Waligora, Muhammad Haseeb Aslam, Muhammad Osama Zeeshan, Soufiane Belharbi, Alessandro Lameiras Koerich, Marco Pedersoli, Simon Bacon, Eric Granger
PDF
Joint Physical-Digital Facial Attack Detection via Simulating Spoofing Clues Xianhua He, Dashuang Liang, Song Yang, Zhanlong Hao, Hui Ma, Binjie Mao, Xi Li, Yao Wang, Pengfei Yan, Ajian Liu
PDF
Key Patches Are All You Need: A Multiple Instance Learning Framework for Robust Medical Diagnosis D. J. Araújo, Maria Rita Verdelho, Alceu Bissoto, Jacinto C. Nascimento, Carlos Santiago, Catarina Barata
PDF
KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections Chuheng Wei, Guoyuan Wu, Matthew J. Barth, Amr Abdelraouf, Rohit Gupta, Kyungtae Han
PDF
Knowledge Distillation for Efficient Instance Semantic Segmentation with Transformers Maohui Li, Michael Halstead, Chris McCool
Label Efficient Lifelong Multi-View Broiler Detection Thorsten Cardoen, Sam Leroux, Pieter Simoens
PDF
Label-Free Anomaly Detection in Aerial Agricultural Images with Masked Image Modeling Sambal Shikhar, Anupam Sobti
PDF
Lacunarity Pooling Layers for Plant Image Classification Using Texture Analysis Akshatha Mohan, Joshua Peeples
PDF
LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces Xuhui Liu, Bohan Zeng, Sicheng Gao, Shanglin Li, Yutang Feng, Hong Li, Boyu Liu, Jianzhuang Liu, Baochang Zhang
LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints Mengmeng Liu, Hao Cheng, Lin Chen, Hellward Broszio, Jiangtao Li, Runjiang Zhao, Monika Sester, Michael Ying Yang
PDF
Language-Guided Multi-Modal Emotional Mimicry Intensity Estimation Feng Qiu, Wei Zhang, Chen Liu, Lincheng Li, Heming Du, Tianchen Guo, Xin Yu
LaPA: Latent Prompt Assist Model for Medical Visual Question Answering Tiancheng Gu, Kaicheng Yang, Dongnan Liu, Weidong Cai
PDF
Large Kernel Frequency-Enhanced Network for Efficient Single Image Super-Resolution Jiadi Chen, Chunjiang Duanmu, Huanhuan Long
Large Language Models in Wargaming: Methodology, Application, and Robustness Yuwei Chen, Shiyong Chu
Large-Scale Bidirectional Training for Zero-Shot Image Captioning Taehoon Kim, Mark Marsden, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Alessandra Sala, Seung Hwan Kim
PDF
Large-Scale Dataset Pruning with Dynamic Uncertainty Muyang He, Shuo Yang, Tiejun Huang, Bo Zhao
PDF
Latent Flow Diffusion for Deepfake Video Generation Aashish Chandra K, A V Aashutosh, Srijan Das, Abhijit Das
Latent-Based Diffusion Model for Long-Tailed Recognition Pengxiao Han, Changkun Ye, Jieming Zhou, Jing Zhang, Jie Hong, Xuesong Li
PDF
LatentMan : Generating Consistent Animated Characters Using Image Diffusion Models Abdelrahman Eldesokey, Peter Wonka
PDF
LD-Pruner: Efficient Pruning of Latent Diffusion Models Using Task-Agnostic Insights Thibault Castells, Hyoung-Kyu Song, Bo-Kyeong Kim, Shinkook Choi
PDF
Learnable Global Spatio-Temporal Adaptive Aggregation for Bracketing Image Restoration and Enhancement Xinwei Dai, Yuanbo Zhou, Xintao Qiu, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong
Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain Steve Andreas Immanuel, Hagai Raja Sinulingga
PDF
Learning Optimized Low-Light Image Enhancement for Edge Vision Tasks Sma Sharif, Azamat Myrzabekov, Nodirkhuja Khujaev, Roman Tsoy, Seongwan Kim, Jaeho Lee
Learning Surface Terrain Classifications from Ground Penetrating Radar Anja Sheppard, Jason Brown, Nilton O. Renno, Katherine A. Skinner
PDF
Learning to Classify New Foods Incrementally via Compressed Exemplars Justin Yang, Zhihao Duan, Jiangpeng He, Fengqing Zhu
PDF
Learning to Schedule Resistant to Adversarial Attacks in Diffusion Probabilistic Models Under the Threat of Lipschitz Singularities SangHwa Hong
Learning Tracking Representations from Single Point Annotations Qiangqiang Wu, Antoni B. Chan
PDF
Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining Feng Qiu, Heming Du, Wei Zhang, Chen Liu, Lincheng Li, Tianchen Guo, Xin Yu
Let Me Show You How It's Done - Cross-Modal Knowledge Distillation as Pretext Task for Semantic Segmentation Rudhishna Narayanan Nair, Ronny Hänsch
Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning Zaber Ibn Abdul Hakim, Najibul Haque Sarker, Rahul Pratap Singh, Bishmoy Paul, Ali Dabouei, Min Xu
Leveraging Large Language Models for Multimodal Search Oriol Barbany, Michael Huang, Xinliang Zhu, Arnab Dhua
PDF
Leveraging Pre-Trained Multi-Task Deep Models for Trustworthy Facial Analysis in Affective Behaviour Analysis In-the-Wild Andrey V. Savchenko
LGAfford-Net: A Local Geometry Aware Affordance Detection Network for 3D Point Clouds Ramesh Ashok Tabib, Dikshit Hegde, Uma Mudenagudi
LGFN: Lightweight Light Field Image Super-Resolution Using Local Convolution Modulation and Global Attention Feature Extraction Zhongxin Yu, Liang Chen, Zhiyun Zeng, Kunping Yang, Shaofei Luo, Shaorui Chen, Cheng Zhong
Lift-Attend-Splat: Bird's-Eye-View Camera-LiDAR Fusion Using Transformers James Gunn, Zygmunt Lenyk, Anuj Sharma, Andrea Donati, Alexandru Buburuzan, John Redford, Romain Mueller
PDF
Lifting Multi-View Detection and Tracking to the Bird's Eye View Torben Teepe, Philipp Wolters, Johannes Gilg, Fabian Herzog, Gerhard Rigoll
PDF
Lightweight Maize Disease Detection Through Post-Training Quantization with Similarity Preservation Carlos Victorino Padeiro, Tse-Wei Chen, Takahiro Komamizu, Ichiro Ide
Listen Then See: Video Alignment with Speaker Attention Aviral Agrawal, Carlos Mateo Samudio Lezcano, Iqui Balam Heredia-Marin, Prabhdeep Singh Sethi
PDF
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning Junchi Wang, Lei Ke
PDF
Localised-NeRF: Specular Highlights and Colour Gradient Localising in NeRF Dharmendra Selvaratnam, Dena Bazazian
LOFI: LOng-Tailed FIne-Grained Network for Food Recognition Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Marc Bolaños, Bhalaji Nagarajan, Petia Radeva
LogicAL: Towards Logical Anomaly Synthesis for Unsupervised Anomaly Localization Ying Zhao
PDF
Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition Hasan Abed Al Kader Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad Albalawi, Bernard Ghanem
PDF
Lost in Translation: Lip-Sync Deepfake Detection from Audio-Video Mismatch Matyas Bohacek, Hany Farid
Low Latency Point Cloud Rendering with Learned Splatting Yueyu Hu, Ran Gong, Qi Sun, Yao Wang
Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets Dai Quoc Tran, Armstrong Aboah, Yuntae Jeon, Maged Shoman, Minsoo Park, Seunghee Park
PDF
Low-Rank Few-Shot Adaptation of Vision-Language Models Maxime Zanella, Ismail Ben Ayed
PDF
Low-Resolution-Only Microscopy Super-Resolution Models Generalizing to Non-Periodicities at Atomic Scale Björn Möller, Zhengyang Li, Markus Etzkorn, Tim Fingscheidt
LVS: A Learned Video Storage for Fast and Efficient Video Understanding Yunghee Lee, Jongse Park
MA-AVT: Modality Alignment for Parameter-Efficient Audio-Visual Transformers Tanvir Mahmud, Shentong Mo, Yapeng Tian, Diana Marculescu
PDF
Making Use of Unlabeled Data: Comparing Strategies for Marine Animal Detection in Long-Tailed Datasets Using Self-Supervised and Semi-Supervised Pre-Training Tarun Sharma, Danelle E. Cline, Duane Edgington
MambaPupil: Bidirectional Selective Recurrent Model for Event-Based Eye Tracking Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai, Yang Cao, Zheng-Jun Zha
PDF
Manifold DivideMix: A Semi-Supervised Contrastive Learning Framework for Severe Label Noise Fahimeh Fooladgar, Minh Nguyen Nhat To, Parvin Mousavi, Purang Abolmaesumi
PDF
Masked Autoencoders Are Secretly Efficient Learners Zihao Wei, Chen Wei, Jieru Mei, Yutong Bai, Zeyu Wang, Xianhang Li, Hongru Zhu, Huiyu Wang, Alan L. Yuille, Yuyin Zhou, Cihang Xie
MaskSim: Detection of Synthetic Images by Masked Spectrum Similarity Analysis Yanhao Li, Quentin Bammey, Marina Gardella, Tina Nikoukhah, Jean-Michel Morel, Miguel Colom, Rafael Grompone von Gioi
PDF
Matting Anything Jiachen Li, Jitesh Jain, Humphrey Shi
Medical Image Segmentation with InTEnt: Integrated Entropy Weighting for Single Image Test-Time Adaptation Haoyu Dong, Nicholas Konz, Hanxue Gu, Maciej A. Mazurowski
PDF
Medium Scale Benchmark for Cricket Excited Actions Understanding Altaf Hussain, Noman Khan, Muhammad Munsif, Min Je Kim, Sung Wook Baik
MIMIC: Masked Image Modeling with Image Correspondences Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Patrick Howe, Sharan Ranjit S, Anand Bhattad, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna
PDF
MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng, Yongyong Chen, Jingyong Su, Xianyu Guan, Hongyuan Yu, Cheng Wan, Jiamin Lin, Binnan Han, Yajun Zou, Zhuoyuan Wu, Yuan Huang, Yongsheng Yu, Daoan Zhang, Jizhe Li, Xuanwu Yin, Kunlong Zuo, Yunfan Lu, Yijie Xu, Wenzong Ma, Weiyu Guo, Hui Xiong, Wei Yu, Bingchun Luo, Sabari Nathan, Priya Kansal
PDF
MIPI 2024 Challenge on Few-Shot RAW Image Denoising: Methods and Results Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang
PDF
MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu Jin, Guanqun Liu, Chen Change Loy
PDF
Mitigating Bias Using Model-Agnostic Data Attribution Sander De Coninck, Sam Leroux, Pieter Simoens
PDF
Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT Miguel Ortiz del Castillo, Jonathan Morgan, Jack McRobbie, Clint Therakam, Zaher Joukhadar, Robert Mearns, Simon Barraclough, Richard O. Sinnott, Andrew Woods, Chris Bayliss, Kris Ehinger, Benjamin I. P. Rubinstein, James Bailey, Airlie Chapman, Michele Trenti
MixStyle-Based Contrastive Test-Time Adaptation: Pathway to Domain Generalization Kota Yamashita, Kazuhiro Hotta
MixSyn: Compositional Image Synthesis with Fuzzy Masks and Style Fusion Ilke Demir, Umur Aybars Ciftci
MMA-DFER: MultiModal Adaptation of Unimodal Models for Dynamic Facial Expression Recognition In-the-Wild Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj
PDF
MMIST-ccRCC: A Real World Medical Dataset for the Development of Multi-Modal Systems Tiago Mota, Maria Rita Verdelho, Diogo J. Araújo, Alceu Bissoto, Carlos Santiago, Catarina Barata
PDF
Mobile Aware Denoiser Network (MADNet) for Quad Bayer Images Pavan C. Madhusudana, Jing Li, Zeeshan Nadir, Hamid R. Sheikh, Seok-Jun Lee
MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints Bedirhan Uguz, Ozhan Suat, Batuhan Karagöz, Emre Akbas
PDF
MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation Fei Pan, Xu Yin, Seokju Lee, Axi Niu, Sung-Eui Yoon, In So Kweon
PDF
Model-Guided Contrastive Fine-Tuning for Industrial Anomaly Detection Aitor Artola, Yannis Kolodziej, Jean-Michel Morel, Thibaud Ehret
Modeling Detailed Human Geometry with Adaptive Local Refinement Bang Du, Kunyao Chen, Haochen Zhang, Fei Yin, Baichuan Wu, Truong Nguyen
MoE-AGIQA: Mixture-of-Experts Boosted Visual Perception-Driven and Semantic-Aware Quality Assessment for AI-Generated Images Junfeng Yang, Jing Fu, Wei Zhang, Wenzhi Cao, Limei Liu, Han Peng
Monitoring Social Insect Activity with Minimal Human Supervision Tarun Sharma, Julian Morgan Wagner, Sara Beery, William B. Dickson, Michael H. Dickinson, Joseph Parker
Monocular 6-DoF Pose Estimation of Spacecrafts Utilizing Self-Iterative Optimization and Motion Consistency Yunfeng Zhang, Linjing You, Luyu Yang, Zhiwei Zhang, Xiangli Nie, Bo Zhang
MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views Runfa Li, Upal Mahbub, Vasudev Bhaskaran, Truong Q. Nguyen
PDF
Motion-Aware Needle Segmentation in Ultrasound Images Raghavv Goel, Cecilia G. Morales, Manpreet Singh, Artur Dubrawski, Jonh Galeotti, Howie Choset
Motorcyclist Helmet Violation Detection Framework by Leveraging Robust Ensemble and Augmentation Methods Thien Van Luong, Huu Si Phuc Nguyen, Duy Khanh Dinh, Viet Hung Duong, Duy Hong Sam Vo, Huan Vu, Minh Tuan Hoang, Tien Cuong Nguyen
MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images Ke-Lei Wang, Pin-Hsuan Chou, Young-Ching Chou, Chia-Jen Liu, Cheng-Kuan Lin, Yu-Chee Tseng
PDF
MULi-Ev: Maintaining Unperturbed LiDAR-Event Calibration Mathieu Cocheteux, Julien Moreau, Franck Davoine
PDF
Multi Model Ensemble for Compound Expression Recognition Jun Yu, Jichao Zhu, Wangyuan Zhu, Zhongpeng Cai, Gongpeng Zhao, Zhihong Wei, Guochen Xie, Zerui Zhang, Qingsong Liu, Jiaen Liang
Multi-Angle Consistent Generative NeRF with Additive Angular Margin Momentum Contrastive Learning Hang Zou, Hui Zhang, Yuan Zhang, Hui Ma, Dexin Zhao, Qi Zhang, Qi Li
Multi-Bit, Black-Box Watermarking of Deep Neural Networks in Embedded Applications Sam Leroux, Stijn Vanassche, Pieter Simoens
PDF
Multi-Explainable TemporalNet: An Interpretable Multimodal Approach Using Temporal Convolutional Network for User-Level Depression Detection Anas Zafar, Danyal Aftab, Rizwan Qureshi, Yaofeng Wang, Hong Yan
Multi-Level Feature Fusion Network for Lightweight Stereo Image Super-Resolution Yunxiang Li, Wenbin Zou, Qiaomu Wei, Feng Huang, Jing Wu
PDF
Multi-Modal Aerial View Image Challenge: SAR Classification Spencer Low, Oliver Nina, Dylan Bowald, Angel Domingo Sappa, Nathan Inkawhich, Peter Bruns
Multi-Modal Aerial View Image Challenge: Sensor Domain Translation Spencer Low, Oliver Nina, Dylan Bowald, Angel Domingo Sappa, Nathan Inkawhich, Peter Bruns
Multi-Modal Arousal and Valence Estimation Under Noisy Conditions Denis Dresvyanskiy, Maxim Markitantov, Jiawei Yu, Heysem Kaya, Alexey Karpov
Multi-Modal Fusion of Event and RGB for Monocular Depth Estimation Using a Unified Transformer-Based Architecture Anusha Devulapally, Md Fahim Faysal Khan, Siddharth Advani, Vijaykrishnan Narayanan
Multi-Modal Hit Detection and Positional Analysis in Padel Competitions Robbe Decorte, Martin Paré, Jelle Vanhaeverbeke, Joachim Taelman, Maarten Slembrouck, Steven Verstockt
PDF
Multi-Objective Hardware Aware Neural Architecture Search Using Hardware Cost Diversity Nilotpal Sinha, Peyman Rostami, Abd El Rahman Shabayek, Anis Kacem, Djamila Aouada
PDF
Multi-Perspective Traffic Video Description Model with Fine-Grained Refinement Approach Tuan-An To, Minh-Nam Tran, Trong-Bao Ho, Thien-Loc Ha, Quang-Tan Nguyen, Hoang-Chau Luong, Thanh-Duy Cao, Minh-Triet Tran
Multi-Resolution Rescored ByteTrack for Video Object Detection on Ultra-Low-Power Embedded Systems Luca Bompani, Manuele Rusci, Daniele Palossi, Francesco Conti, Luca Benini
PDF
Multi-Scale Attention Network for Single Image Super-Resolution Yan Wang, Yusen Li, Gang Wang, Xiaoguang Liu
PDF
Multi-Scale Attention-Based Inclination Angles Estimation for Panoramic Camera Yuhao Shan, Heyu Chen, Jiaying Zhang, Shigang Li, Jianfeng Li
Multi-Scale Feature Fusion Using Channel Transformers for Guided Thermal Image Super Resolution Raghunath Sai Puttagunta, Birendra Kathariya, Zhu Li, George York
Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments Benoît Gérin, Anaïs Halin, Anthony Cioppa, Maxim Henry, Bernard Ghanem, Benoît Macq, Christophe De Vleeschouwer, Marc Van Droogenbroeck
PDF
Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition Marah Halawa, Florian Blume, Pia Bideau, Martin Maier, Rasha Abdel Rahman, Olaf Hellwich
PDF
Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation Mathis Petrovich, Or Litany, Umar Iqbal, Michael J. Black, Gül Varol, Xue Bin Peng, Davis Rempe
PDF
Multi-View Action Recognition for Distracted Driver Behavior Localization Yuehuan Xu, Shuai Jiang, Zhe Cui, Fei Su
Multi-View Spatial-Temporal Learning for Understanding Unusual Behaviors in Untrimmed Naturalistic Driving Videos Huy-Hung Nguyen, Chi Dai Tran, Long Hoang Pham, Duong Nguyen-Ngoc Tran, Tai Huu-Phuong Tran, Duong Khac Vu, Quoc Pham-Nam Ho, Ngoc Doan-Minh Huynh, Hyung-Min Jeon, Hyung-Joon Jeon, Jae Wook Jeon
Multiattention-Net: A Novel Approach to Face Anti-Spoofing with Modified Squeezed Residual Blocks Sabari Nathan, M. Parisa Beham, A Nagaraj, S. Mohamed Mansoor Roomi
Multimodal Attack Detection for Action Recognition Models Furkan Mumcu, Yasin Yilmaz
PDF
Multimodal Understanding of Memes with Fair Explanations Yang Zhong, Bhiman Kumar Baghel
MultIOD: Rehearsal-Free Multihead Incremental Object Detector Eden Belouadah, Arnaud Dapogny, Kevin Bailly
PDF
MultiPanoWise: Holistic Deep Architecture for Multi-Task Dense Prediction from a Single Panoramic Image Uzair Shah, Muhammad Tukur, Mahmood Alzubaidi, Giovanni Pintore, Enrico Gobbetti, Mowafa S. Househ, Jens Schneider, Marco Agus
Must Unsupervised Continual Learning Relies on Previous Information? Haoyang Cheng, Haitao Wen, Heqian Qiu, Lanxiao Wang, Minjian Zhang, Hongliang Li
MV-Soccer: Motion-Vector Augmented Instance Segmentation for Soccer Player Tracking Fahad Majeed, Nauman Ullah Gilal, Khaled A. Al-Thelaya, Yin Yang, Marco Agus, Jens Schneider
MvAV-pix2pixHD: Multi-View Aerial View Image Translation Jun Yu, Keda Lu, Shenshen Du, Lin Xu, Peng Chang, Houde Liu, Bin Lan, Tianyu Liu
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View Emmanuelle Bourigault, Pauline Bourigault
PDF
Narrowing the Synthetic-to-Real Gap for Thermal Infrared Semantic Image Segmentation Using Diffusion-Based Conditional Image Synthesis Christian Mayr, Christian Kübler, Norbert Haala, Michael Teutsch
NeRF as Pretraining at Scale: Generalizable 3D-Aware Semantic Representation Learning from View Prediction Wenyan Cong, Hanxue Liang, Zhiwen Fan, Peihao Wang, Yifan Jiang, Dejia Xu, A. Cengiz Öztireli, Zhangyang Wang
Neural Fields for Co-Reconstructing 3D Objects from Incidental 2D Data Dylan Campbell, Eldar Insafutdinov, João F. Henriques, Andrea Vedaldi
Neuromorphic Lip-Reading with Signed Spiking Gated Recurrent Units Manon Dampfhoffer, Thomas Mesquida
PDF
NICE: CVPR 2023 Challenge on Zero-Shot Image Captioning Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae-Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun
PDF
nnMobileNet: Rethinking CNN for Retinopathy Research Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xin Li, Natasha Leporé, Oana M. Dumitrascu, Yalin Wang
PDF
No Bells, Just Whistles: Sports Field Registration by Leveraging Geometric Properties Marc Gutiérrez-Pérez, Antonio Agudo
PDF
NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer Sai Kumar Reddy Manne, Brendan Martin, Tyler Roy, Ryan Neilson, Rebecca Peters, Meghana Chillara, Christine W. Lary, Katherine J. Motyl, Michael Wan
PDF
NTIRE 2024 Challenge on Blind Enhancement of Compressed Image: Methods and Results Ren Yang, Radu Timofte, Bingchen Li, Xin Li, Mengxi Guo, Shijie Zhao, Li Zhang, Zhibo Chen, Dongyang Zhang, Yash Arora, Aditya Arora, Yuanbin Chen, Hui Tang, Tao Wang, Longxuan Zhao, Bin Chen, Tong Tong, Qiao Mo, Jingwei Bao, Jinhua Hao, Yukang Ding, Hantang Li, Ming Sun, Chao Zhou, Shuyuan Zhu, Zhi Jin, Wei Wang, Dandan Zhan, Jiawei Wu, Jiahao Wu, Luwei Tu, Hongyu An, Xinfeng Zhang, Woon-Ha Yeo, Wang-Taek Oh, Young-Il Kim, Han-Cheol Ryu, Long Sun, Mingjun Zhen, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Yapeng Du, Ao Li, Ziyang He, Lei Luo, Ce Zhu, Xin Yao, Sunder Ali Khowaja, Ikhyun Lee, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy
NTIRE 2024 Challenge on Bracketing Image Restoration and Enhancement: Datasets, Methods and Results Zhilu Zhang, Shuohao Zhang, Renlong Wu, Wangmeng Zuo, Radu Timofte, Xiaoxia Xing, Hyunhee Park, Sejun Song, Changho Kim, Xiangyu Kong, Jinlong Wu, Jianxing Zhang, Jingfan Tan, Zikun Liu, Wenhan Luo, Wenjie Lin, Chengzhi Jiang, Mingyan Han, Zhen Liu, Ting Jiang, Jinting Luo, Shen Cheng, Linze Li, Xinhan Niu, Shuaicheng Liu, Kexin Dai, Kangzhen Yang, Tao Hu, Xiangyu Chen, Yu Cao, Qingsen Yan, Yanning Zhang, Genggeng Chen, Yongqing Yang, Wei Dong, Xinwei Dai, Yuanbo Zhou, Xintao Qiu, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Peng Zhang, Yifei Chen, Wenbo Xiong, Zhijun Song, Pu Cheng, Taolue Feng, Yunqing He, Daiguo Zhou, Ying Huang, Xiaowen Ma, Peng Wu
NTIRE 2024 Challenge on HR Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez, Fabio Tosi, Luigi Di Stefano, Radu Timofte, Alex Costanzino, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Yangyang Zhang, Cailin Wu, Zhuangda He, Shuangshuang Yin, Jiaxu Dong, Yangchenxu Liu, Hao Jiang, Jun Shi, Yong A, Yixiang Jin, Dingzhe Li, Bingxin Ke, Anton Obukhov, Tinafu Wang, Nando Metzger, Shengyu Huang, Konrad Schindler, Yachuan Huang, Jiaqi Li, Junrui Zhang, Yiran Wang, Zihao Huang, Tianqi Liu, Zhiguo Cao, Pengzhi Li, Jui-Lin Wang, Wenjie Zhu, Hui Geng, Yuxin Zhang, Long Lan, Kele Xu, Tao Sun, Qisheng Xu, Sourav Saini, Aashray Gupta, Sahaj K. Mistry, Aryan Shukla, Vinit Jakhetiya, Sunil Prasad Jaiswal, Yuejin Sun, Zhuofan Zheng, Yi Ning, Jen-Hao Cheng, Hou-I Liu, Hsiang-Wei Huang, Cheng-Yen Yang, Zhongyu Jiang, Yi-Hao Peng, Aishi Huang, Jenq-Neng Hwang
PDF
NTIRE 2024 Challenge on Image Super-Resolution (×4): Methods and Results Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou, Hongyu An, Xinfeng Zhang, Zhiyuan Song, Ziyue Dong, Qing Zhao, Xiaogang Xu, Pengxu Wei, Zhi-Chao Dou, Gui-Ling Wang, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Cansu Korkmaz, A. Murat Tekalp, Yubin Wei, Xiaole Yan, Binren Li, Haonan Chen, Siqi Zhang, Sihan Chen, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Anjali Sarvaiya, Pooja Choksy, Jagrit Joshi, Shubh Kawa, Kishor P. Upla, Sushrut Patwardhan, Raghavendra Ramachandra, Sadat Hossain, Geongi Park, S. M. Nadim Uddin, Hao Xu, Yanhui Guo, Aman Urumbekov, Xingzhuo Yan, Wei Hao, Minghan Fu, Isaac Orais, Samuel Smith, Ying Liu, Wangwang Jia, Qisheng Xu, Kele Xu, Weijun Yuan, Zhan Li, Wenqing Kuang, Ruijin Guan, Ruting Deng, Zhao Zhang, Bo Wang, Suiyi Zhao, Yan Luo, Yanyan Wei, Asif Hussain Khan, Christian Micheloni, Niki Martinel
NTIRE 2024 Challenge on Light Field Image Super-Resolution: Methods and Results Yingqian Wang, Zhengyu Liang, Qianyu Chen, Longguang Wang, Jungang Yang, Radu Timofte, Yulan Guo, Wentao Chao, Yiming Kan, Xuechun Wang, Fuqing Duan, Guanghui Wang, Wang Xia, Ziqi Wang, Yue Yan, Peiqi Xia, Shunzhou Wang, Yao Lu, Angulia Yang, Kai Jin, Zeqiang Wei, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou, Zhongxin Yu, Shaofei Luo, Cheng Zhong, Shaorui Chen, Long Peng, Yuhong He, Gaosheng Liu, Huanjing Yue, Jingyu Yang, Zhengjian Yao, Jiakui Hu, Lujia Jin, Zhi-Song Liu, Chenhang He, Jun Xiao, Xiuyuan Wang, Zonglin Tian, Yifan Mao, Deyang Liu, Shizheng Li, Ping An
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan L. Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang, Qingsen Yan, Wenbin Zou, Weipeng Yang, Yunxiang Li, Qiaomu Wei, Tian Ye, Sixiang Chen, Zhao Zhang, Suiyi Zhao, Bo Wang, Yan Luo, Zhichao Zuo, Mingshen Wang, Junhu Wang, Yanyan Wei, Xiaopeng Sun, Yu Gao, Jiancheng Huang, Hongming Chen, Xiang Chen, Hui Tang, Yuanbin Chen, Yuanbo Zhou, Xinwei Dai, Xintao Qiu, Wei Deng, Qinquan Gao, Tong Tong, Mingjia Li, Jin Hu, Xinyu He, Xiaojie Guo, Sabarinathan, K. Uma, A. Sasithradevi, B. Sathya Bama, S. Mohamed Mansoor Roomi, V. Srivatsav, Jinjuan Wang, Long Sun, Qiuying Chen, Jiahong Shao, Yizhi Zhang, Marcos V. Conde, Daniel Feijoo, Juan C. Benito, Álvaro García, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenyi Lian, Wenjing Lian, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C, Xingzhuo Yan, Wei Hao, Minghan Fu, Pooja Choksy, Anjali Sarvaiya, Kishor P. Upla, Kiran B. Raja, Hailong Yan, Yunkai Zhang, Baiang Li, Jingyi Zhang, Huan Zheng
PDF
NTIRE 2024 Challenge on Night Photography Rendering Egor I. Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banic, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy P. Terekhin
PDF
NTIRE 2024 Challenge on Short-Form UGC Video Quality Assessment: Methods and Results Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie Zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, Wangmeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, Huimin Zheng, Junhao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu
PDF
NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results Longguang Wang, Yulan Guo, Juncheng Li, Hongda Liu, Yang Zhao, Yingqian Wang, Zhi Jin, Shuhang Gu, Radu Timofte
PDF
NTIRE 2024 Dense and Non-Homogeneous Dehazing Challenge Report Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Yidi Liu, Xingbo Wang, Yurui Zhu, Gege Shi, Xin Lu, Xueyang Fu, Zheng-Jun Zha, Wei Dong, Han Zhou, Ruiyi Wang, Xiaohong Liu, Guangtao Zhai, Jun Chen, Wei Song, Yichang Gao, Jiahao Xiong, Hualiang Lin, Xianger Li, Dong Li, Mohab Kishawy, Ruibin Li, Seyed Amirreza Mousavi, Rana Rauf, Yangyi Liu, Huan Liu, Mingsheng Tu, Kele Xu, JiaWen Chen, Qisheng Xu, Tao Sun, Jin Guo, Ben Shao, Tianli Liu, Mohao Wu, Xingzhuo Yan, Minghan Fu, Lehan Yang, Xin Lin, Lu Qi, Jincen Song, Xiaoqian Hu, Linwei Tao, Hongming Chen, Xiang Chen, Chuanlong Xie, Zhao Zhang, Junhu Wang, Yanyan Wei, Suiyi Zhao, Shengeng Tang, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenjing Jiang, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C
NTIRE 2024 Image Shadow Removal Challenge Report Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Zongwei Wu, Cailian Chen, Radu Timofte, Wei Dong, Han Zhou, Yuqiong Tian, Jun Chen, Xueyang Fu, Xin Lu, Yurui Zhu, Xi Wang, Dong Li, Jie Xiao, Yunpeng Zhang, Zheng-Jun Zha, Zhao Zhang, Suiyi Zhao, Bo Wang, Yan Luo, Yanyan Wei, Zhihao Zhao, Long Sun, Tingting Yang, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Bilel Benjdira, Mohammed Nassif, Anis Koubaa, Ahmed Elhayek, Anas M. Ali, Kyotaro Tokoro, Kento Kawai, Kaname Yokoyama, Takuya Seno, Yuki Kondo, Norimichi Ukita, Chenghua Li, Bo Yang, Zhiqi Wu, Gao Chen, Yihan Yu, Sixiang Chen, Kai Zhang, Tian Ye, Wenbin Zou, Yunlong Lin, Zhaohu Xing, Jinbin Bai, Wenhao Chai, Lei Zhu, Ritik Maheshwari, Rakshank Verma, Rahul Tekchandani, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi, Subrahmanyam Murala, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy, Fan Gao, Weidan Yan, Wenze Shao, Dengyin Zhang, Bin Chen, Siqi Zhang, Yanxin Qian, Yuanbin Chen, Yuanbo Zhou, Tong Tong, Rongfeng Wei, Ruiqi Sun, Yue Liu, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Adinath Dukre, Vivek Deshmukh, Shruti S. Phutke, Ashutosh Kulkarni, Anil Gonde, Arun karthik K, Manasa N, Shri Hari Priya, Wei Hao, Xingzhuo Yan, Minghan Fu
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei Zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie Zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao
PDF
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang, Yibin Huang, Shuai Liu, Yongqiang Li, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yuxiang Chen, Xiangyu Chen, Qiubo Chen, Fengyu Sun, Mengying Cui, Jiaxu Chen, Zhenyu Hu, Jingyun Liu, Wenzhuo Ma, Ce Wang, Hanyou Zheng, Wanjie Sun, Zhenzhong Chen, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Xiong Dun, Pengzhou Ji, Yujie Xing, Xuquan Wang, Zhanshan Wang, Xinbin Cheng, Jun Xiao, Chenhang He, Xiuyuan Wang, Zhi-Song Liu, Zimeng Miao, Zhicun Yin, Ming Liu, Wangmeng Zuo, Shuai Li
PDF
NurtureNet: A Multi-Task Video-Based Approach for Newborn Anthropometry Yash Khandelwal, Mayur Arvind, Sriram Kumar, Ashish Gupta, Sachin Kumar Danisetty, Piyush Bagad, Anish Madan, Mayank Lunayach, Aditya Annavajjala, Abhishek Maiti, Sansiddh Jain, Aman Dalmia, Namrata Deka, Jerome White, Jigar Doshi, Angjoo Kanazawa, Rahul Panicker, Alpan Raval, Srinivas Rana, Makarand Tapaswi
PDF
OccFeat: Self-Supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks Sophia Sirko-Galouchenko, Alexandre Boulch, Spyros Gidaris, Andrei Bursuc, Antonín Vobecký, Patrick Pérez, Renaud Marlet
PDF
OCMCTrack: Online Multi-Target Multi-Camera Tracking with Corrective Matching Cascade Andreas Specker
OGRMPI: An Efficient Multiview Integrated Multiplane Image Based on Occlusion Guided Residuals Dae Yeol Lee, Guan-Ming Su, Peng Yin
Omni-Crack30k: A Benchmark for Crack Segmentation and the Reasonable Effectiveness of Transfer Learning Christian Benz, Volker Rodehorst
OmniControlNet: Dual-Stage Integration for Conditional Image Generation Yilin Wang, Haiyang Xu, Xiang Zhang, Zeyuan Chen, Zhizhou Sha, Zirui Wang, Zhuowen Tu
On Accuracy and Speed of Geodesic Regression: Do Geometric Priors Improve Learning on Small Datasets? Adele Myers, Nina Miolane
On the Efficiency of Privacy Attacks in Federated Learning Nawrin Tabassum, Ka-Ho Chow, Xuyu Wang, Wenbin Zhang, Yanzhao Wu
PDF
One Class Classification-Based Quality Assurance of Organs-at-Risk Delineation in Radiotherapy Yihao Zhao, Cuiyun Yuan, Ying Liang, Yang Li, Chunxia Li, Man Zhao, Jun Hu, Ningze Zhong, Chenbin Liu
One Embedding to Predict Them All: Visible and Thermal Universal Face Representations for Soft Biometric Estimation via Vision Transformers Nélida Mirabet-Herranz, Chiara Galdi, Jean-Luc Dugelay
PDF
One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham, Yao Wang
PDF
Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering Riu Cherdchusakulchai, Sasin Phimsiri, Visarut Trairattanapa, Suchat Tungjitnob, Wasu Kudisthalert, Pornprom Kiawjak, Ek Thamwiwatthana, Phawat Borisuitsawat, Teepakorn Tosawadi, Pakcheera Choppradit, Kasisdis Mahakijdechachai, Supawit Vatathanavaro, Worawit Saetan, Vasin Suttichaya
Open-World Instance Segmentation: Top-Down Learning with Bottom-up Supervision Tarun Kalluri, Weiyao Wang, Heng Wang, Manmohan Chandraker, Lorenzo Torresani, Du Tran
PDF
OpenStory: A Large-Scale Open-Domain Dataset for Subject-Driven Visual Storytelling Zilyu Ye, Jinxiu Liu, Jinjin Cao, Zhiyang Chen, Ziwei Xuan, Mingyuan Zhou, Qi Liu, Guo-Jun Qi
OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities Lasse H. Hansen, Simon Buus Jensen, Mark P. Philipsen, Andreas Møgelmose, Lars Bodum, Thomas B. Moeslund
PDF
Optimized Martian Dust Displacement Detection Using Explainable Machine Learning Ana Lomashvili, Kristin Rammelkamp, Olivier Gasnault, Protim Bhattacharjee, Elise Clavé, Christoph H. Egerland, Susanne Schröder, Begüm Demir, Nina L. Lanza
PDF
Optimizing Object Detection via Metric-Driven Training Data Selection Changyuan Zhou, Yumin Guo, Qinxue Lv, Ji Yuan
Orientation-Conditioned Facial Texture Mapping for Video-Based Facial Remote Photoplethysmography Estimation Sam Cantrill, David Ahmedt-Aristizabal, Lars Petersson, Hanna Suominen, Mohammad Ali Armin
PDF
Our Deep CNN Face Matchers Have Developed Achromatopsia Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Michael C. King, Kevin W. Bowyer
Outsmarting Biometric Imposters: Enhancing Iris-Recognition System Security Through Physical Adversarial Example Generation and PAD Fine-Tuning Yuka Ogino, Kazuya Kakizaki, Takahiro Toizumi, Atsushi Ito
Overlap Suppression Clustering for Offline Multi-Camera People Tracking Ryuto Yoshida, Junichi Okubo, Junichiro Fujii, Masazumi Amakata, Takayoshi Yamashita
Paediatric Pulse Rate Measurements: A Comparison of Methods Using Remote Photoplethysmography Simon Wegerif, Ivan Veleslavov, Lieke Dorine van Putten, Kate Emily Bamford, Gauri Misra, Niall Mullen
Parameter Efficient Fine-Tuning of Self-Supervised ViTs Without Catastrophic Forgetting Reza Akbarian Bafghi, Nidhin Harilal, Claire Monteleoni, Maziar Raissi
PDF
PARASOL: Parametric Style Control for Diffusion Image Synthesis Gemma Canet Tarres, Dan Ruta, Tu Bui, John P. Collomosse
PDF
PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition Xi Fang, Weigang Wang, Xiaoxin Lv, Jun Yan
PDF
Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery Yona Falinie A. Gaus, Neelanjan Bhowmik, Brian K. S. Isaac-Medina, Toby P. Breckon
PDF
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön
PDF
Photorealistic Arm Robot Simulation for 3D Plant Reconstruction and Automatic Annotation Using Unreal Engine 5 Xingjian Li, Jeremy Park, Chris Reberg-Horton, Steven B. Mirsky, Edgar J. Lobaton, Lirong Xiang
Physics Based Camera Privacy: Lens and Network Co-Design to the Rescue Marius Dufraisse, Marcela Carvalho, Pauline Trouvé-Peloux, Frédéric Champagnat
PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A. Clausi, John S. Zelek
PDF
PMAFusion: Projection-Based Multi-Modal Alignment for 3D Semantic Occupancy Prediction Shiyao Li, Wenming Yang, Qingmin Liao
Point-Supervised Semantic Segmentation of Natural Scenes via Hyperspectral Imaging Tianqi Ren, Qiu Shen, Ying Fu, Shaodi You
PointOfView: A Multi-Modal Network for Few-Shot 3D Point Cloud Classification Fusing Point and Multi-View Image Features Huantao Ren, Jiyang Wang, Minmin Yang, Senem Velipasalar
PointPrompt: A Multi-Modal Prompting Dataset for Segment Anything Model Jorge Quesada, Mohammad Alotaibi, Mohit Prabhushankar, Ghassan AlRegib
POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference Zhiwen Fan, Panwang Pan, Peihao Wang, Yifan Jiang, Dejia Xu, Zhangyang Wang
PDF
Potential Risk Localization via Weak Labeling Out of Blind Spot Kota Shimomura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi
PP-SAM: Perturbed Prompts for Robust Adaption of Segment Anything Model for Polyp Segmentation Md Mostafijur Rahman, Mustafa Munir, Debesh Jha, Ulas Bagci, Radu Marculescu
PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization Lun Huang, Qiang Qiu, Guillermo Sapiro
Practical Region-Level Attack Against Segment Anything Models Yifan Shen, Zhengyuan Li, Gang Wang
PDF
Pre-Trained Bidirectional Dynamic Memory Network for Long Video Question Answering Jinmeng Wu, Pengcheng Shu, Hanyu Hong, Lei Ma, Ying Zhu, Lei Wang
Privacy-Preserving Collaboration for Multi-Organ Segmentation via Federated Learning from Sites with Partial Labels Adway U. Kanhere, Pranav Kulkarni, Paul H. Yi, Vishwa S. Parekh
Probing Conceptual Understanding of Large Visual-Language Models Madeline Schiappa, Raiyaan Abdullah, Shehreen Azad, Jared Claypoole, Michael Cogswell, Ajay Divakaran, Yogesh S. Rawat
PDF
Prompt Learning with One-Shot Setting Based Feature Space Analysis in Vision-and-Language Models Yuki Hirohashi, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi
PromptCIR: Blind Compressed Image Restoration with Prompt Learning Bingchen Li, Xin Li, Yiting Lu, Ruoyu Feng, Mengxi Guo, Shijie Zhao, Li Zhang, Zhibo Chen
PDF
Prompting Foundational Models for Omni-Supervised Instance Segmentation Arnav M. Das, Ritwick Chaudhry, Kaustav Kundu, Davide Modolo
PromptSync: Bridging Domain Gaps in Vision-Language Models Through Class-Aware Prototype Alignment and Discrimination Anant Khandelwal
PDF
Prototype-Based Interpretable Model for Glaucoma Detection Mohana Singh, B. S. Vivek, Jayavardhana Gubbi, Arpan Pal
Prune Efficiently by Soft Pruning Parakh Agarwal, Manu Mathew, Kunal Ranjan Patel, Varun Tripathi, Pramod Swami
Pruning as a Binarization Technique Lukas Frickenstein, Pierpaolo Morì, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Claudio Passerone, Walter Stechele
Pseudo-Label Based Unsupervised Fine-Tuning of a Monocular 3D Pose Estimation Model for Sports Motions Tomohiro Suzuki, Ryota Tanaka, Kazuya Takeda, Keisuke Fujii
PUDD: Towards Robust Multi-Modal Prototype-Based Deepfake Detection Alvaro Lopez Pellcier, Yi Li, Plamen Angelov
PDF
Purposeful Regularization with Reinforcement Learning for Facial Expression Recognition In-the-Wild SangHwa Hong
PV-Cap: 3D Dynamic Scene Understanding Through Open Physics-Based Vocabulary Hidetomo Sakaino, Thao Nguyen Phuong, Vinh Nguyen Duy
QAttn: Efficient GPU Kernels for Mixed-Precision Vision Transformers Piotr Kluska, Adrián Castelló, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí
PDF
Quality-Based Artifact Modeling for Facial Deepfake Detection in Videos Sara Concas, Simone Maurizio La Cava, Roberto Casula, Giulia Orrù, Giovanni Puglisi, Gian Luca Marcialis
PDF
QuantNAS: Quantization-Aware Neural Architecture Search for Efficient Deployment on Mobile Device Tianxiao Gao, Li Guo, Shanwei Zhao, Peihan Xu, Yukun Yang, Xionghao Liu, Shihao Wang, Shiai Zhu, Dajiang Zhou
Radar Fields: An Extension of Radiance Fields to SAR Thibaud Ehret, Roger Marí, Dawa Derksen, Nicolas Gasnier, Gabriele Facciolo
PDF
Raising the Bar of AI-Generated Image Detection with CLIP Davide Cozzolino, Giovanni Poggi, Riccardo Corvi, Matthias Nießner, Luisa Verdoliva
PDF
RAVN: Reinforcement Aided Adaptive Vector Quantization of Deep Neural Networks Anamika Jha, Aratrik Chattopadhyay, Mrinal Banerji, Disha Jain
RBSFormer: Enhanced Transformer Network for Raw Image Super-Resolution Siyuan Jiang, Senyan Xu, Xingfu Wang
RDPN6D: Residual-Based Dense Point-Wise Network for 6DoF Object Pose Estimation Based on RGB-D Images Zong-Wei Hong, Yen-Yang Hung, Chu-Song Chen
PDF
Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression Dilyara Bareeva, Maximilian Dreyer, Frederik Pahde, Wojciech Samek, Sebastian Lapuschkin
PDF
Real-Time 4k Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey Marcos V. Conde, Zhijun Lei, Wen Li, Ioannis Katsavounidis, Radu Timofte, Min Yan, Xin Liu, Qian Wang, Xiaoqian Ye, Zhan Du, Tiansen Zhang, Zhiyuan Li, Hao Wei, Chenyang Ge, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Menghan Zhou, Yiqiang Yan, Kihwan Yoon, Ganzorig Gankhuyag, Jae-Hyeon Lee, Ui-Jin Choi, Hyeon-Cheol Moon, Tae Hyun Jeong, Yoonmo Yang, Jae-Gon Kim, Jinwoo Jeong, Sunjei Kim, Xintao Qiu, Yuanbo Zhou, Kongxian Wu, Xinwei Dai, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Long Peng, Jiaming Guo, Xin Di, Bohao Liao, Zhibo Du, Peize Xia, Renjing Pei, Yang Wang, Yang Cao, Zhengjun Zha, Bingnan Han, Hongyuan Yu, Zhuoyuan Wu, Cheng Wan, Yuqing Liu, Haodong Yu, Jizhe Li, Zhijuan Huang, Yuan Huang, Yajun Zou, Xianyu Guan, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin
PDF
Reciprocal Attention Mixing Transformer for Lightweight Image Restoration Haram Choi, Cheolwoong Na, Jihyeon Oh, Seungjae Lee, Jinseop Kim, Subeen Choe, Jeongmin Lee, Taehoon Kim, Jihoon Yang
PDF
Recognize Anything: A Strong Image Tagging Model Youcai Zhang, Xinyu Huang, Jinyu Ma, Zhaoyang Li, Zhaochuan Luo, Yanchun Xie, Yuzhuo Qin, Tong Luo, Yaqian Li, Shilong Liu, Yandong Guo, Lei Zhang
PDF
Recon3D: High Quality 3D Reconstruction from a Single Image Using Generated Back-View Explicit Priors Ruiyang Chen, Mohan Yin, Jiawei Shen, Wei Ma
Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition R. Gnana Praveen, Jahangir Alam
PDF
Red-Teaming Segment Anything Model Krzysztof Jankowski, Bartlomiej Sobieski, Mateusz Kwiatkowski, Jakub Szulc, Michal Janik, Hubert Baniecki, Przemyslaw Biecek
PDF
REFA: Real-Time Egocentric Facial Animations for Virtual Reality Qiang Zhang, Tong Xiao, Haroun Habeeb, Larissa Laich, Sofien Bouaziz, Patrick Snape, Wenjing Zhang, Matthew Cioffi, Peizhao Zhang, Pavel Pidlypenskyi, Winnie Lin, Luming Ma, Mengjiao Wang, Kunpeng Li, Chengjiang Long, Steven Song, Martin Prazák, Alexander Sjoholm, Ajinkya Deogade, Jaebong Lee, Julio Delgado Mangas, Amaury Aubel
PDF
Reference-Based GAN Evaluation by Adaptive Inversion Jianbo Wang, Heliang Zheng, Toshihiko Yamasaki
Refining Biologically Inconsistent Segmentation Masks with Masked Autoencoders Alexander Sauer, Yuan Tian, Joerg Bewersdorf, Jens Rittscher
PDF
Refining Remote Photoplethysmography Architectures Using CKA and Empirical Methods Nathan Vance, Patrick J. Flynn
PDF
Reliable Trajectory Prediction and Uncertainty Quantification with Conditioned Diffusion Models Marion Neumeier, Sebastian Dorn, Michael Botsch, Wolfgang Utschick
PDF
ReMOVE: A Reference-Free Metric for Object Erasure Aditya Chandrasekar, Goirik Chakrabarty, Jai Bardhan, Ramya Hebbalaguppe, Prathosh Ap
Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling Abril Corona-Figueroa, Hubert P. H. Shum, Chris G. Willcocks
PDF
RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis Anant Khandelwal
PDF
Repurposing the Image Generative Potential: Exploiting GANs to Grade Diabetic Retinopathy Isabella Poles, Eleonora D'Arnese, Luca G. Cellamare, Marco D. Santambrogio, Darvin Yi
Residual-Based Language Models Are Free Boosters for Biomedical Imaging Tasks Zhixin Lai, Jing Wu, Suiyao Chen, Yucheng Zhou, Naira Hovakimyan
Rethinking the Domain Gap in Near-Infrared Face Recognition Michail Tarasiou, Jiankang Deng, Stefanos Zafeiriou
Retina : Low-Power Eye Tracking with Event Camera and Spiking Hardware Pietro Bonazzi, Sizhen Bian, Giovanni Lippolis, Yawei Li, Sadique Sheik, Michele Magno
PDF
RetinaLiteNet: A Lightweight Transformer Based CNN for Retinal Feature Segmentation Mehwish Mehmood, Majed Alsharari, Shahzaib Iqbal, Ivor T. A. Spence, Muhammad Fahim
Revisiting Pre-Trained Remote Sensing Model Benchmarks: Resizing and Normalization Matters Isaac Corley, Caleb Robinson, Rahul Dodhia, Juan M. Lavista Ferres, Peyman Najafirad
PDF
Revisiting the Domain Gap Issue in Non-Cooperative Spacecraft Pose Tracking Kun Liu, Yongjun Yu
ReweightOOD: Loss Reweighting for Distance-Based OOD Detection Sudarshan Regmi, Bibek Panthi, Yifei Ming, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai
RGB-D Cube R-CNN: 3D Object Detection with Selective Modality Dropout Jens Piekenbrinck, Alexander Hermans, Narunas Vaskevicius, Timm Linder, Bastian Leibe
RLNet: Robust Linearized Networks for Efficient Private Inference Sreetama Sarkar, Souvik Kundu, Peter A. Beerel
Road Object Detection Robust to Distorted Objects at the Edge Regions of Images Wooksu Shin, Donghyuk Choi, Hancheol Park, Jeongho Kim
Robust and Explainable Fine-Grained Visual Classification with Transfer Learning: A Dual-Carriageway Framework Zheming Zuo, Joseph Smith, Jonathan Stonehouse, Boguslaw Obara
PDF
Robust Data Augmentation and Ensemble Method for Object Detection in Fisheye Camera Images Viet Hung Duong, Duc Quyen Nguyen, Thien Van Luong, Huan Vu, Tien Cuong Nguyen
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data Tarun Kalluri, Jihyeon Lee, Kihyuk Sohn, Sahil Singla, Manmohan Chandraker, Joseph Xu, Jeremiah Z. Liu
PDF
Robust Motorcycle Helmet Detection in Real-World Scenarios: Using Co-DETR and Minority Class Enhancement Hao Vo, Sieu Tran, Duc Minh Nguyen, Thua Nguyen, Tien Do, Duy-Dinh Le, Thanh Duc Ngo
Robust Perspective-N-Crater for Crater-Based Camera Pose Estimation Sofia McLeod, Chee Kheng Chng, Tatsuharu Ono, Yuta Shimizu, Ryodo Hemmi, Lachlan Holden, Matthew Rodda, Feras Dayoub, Hirdy Miyamoto, Yukihiro Takahashi, Yasuko Kasai, Tat-Jun Chin
Robustness Analysis on Foundational Segmentation Models Madeline Chantry Schiappa, Shehreen Azad, Sachidanand Vs, Yunhao Ge, Ondrej Miksik, Yogesh S. Rawat, Vibhav Vineet
PDF
Rugby Scene Classification Enhanced by Vision Language Model Naoki Nonaka, Ryo Fujihira, Toshiki Koshiba, Akira Maeda, Jun Seita
Run-Time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns Hakan Yekta Yatbaz, Mehrdad Dianati, Konstantinos Koufos, Roger Woodman
PDF
S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal Nikolina Kubiak, Armin Mustafa, Graeme Phillipson, Stephen Jolly, Simon Hadfield
PDF
SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jérôme Revaud, Yohann Cabon, Romain Brégier, JongMin Lee, Philippe Weinzaepfel
PDF
SAD-GS: Shape-Aligned Depth-Supervised Gaussian Splatting Pou-Chun Kung, Seth Isaacson, Ram Vasudevan, Katherine A. Skinner
Salient Object-Aware Background Generation Using Text-Guided Diffusion Models Amir Erfan Eshratifar, João V. B. Soares, Kapil Thadani, Shaunak Mishra, Mikhail Kuznetsov, Yueh-Ning Ku, Paloma de Juan
PDF
SAM-CLIP: Merging Vision Foundation Models Towards Semantic and Spatial Understanding Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari
PDF
SAM-PM: Enhancing Video Camouflaged Object Detection Using Spatio-Temporal Attention Muhammad Nawfal Meeran, Gokul Adethya T, Bhanu Pratyush Mantha
PDF
Sat2Cap: Mapping Fine-Grained Textual Descriptions from Satellite Images Aayush Dhakal, Adeel Ahmad, Subash Khanal, Srikumar Sastry, Hannah Kerner, Nathan Jacobs
PDF
Scaling Graph Convolutions for Mobile Vision William Avery, Mustafa Munir, Radu Marculescu
PDF
Scattering Prompt Tuning: A Fine-Tuned Foundation Model for SAR Object Recognition Weilong Guo, Shengyang Li, Jian Yang
SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations Jamie Menjay Lin, Jisoo Jeong, Hong Cai, Risheek Garrepalli, Kai Wang, Fatih Porikli
PDF
SDCNet: Spatially-Adaptive Deformable Convolution Networks for HR NonHomogeneous Dehazing Yidi Liu, Xingbo Wang, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha
SDFConnect: Neural Implicit Surface Reconstruction of a Sparse Point Cloud with Topological Constraints Anushrut Jignasu, Aditya Balu, Soumik Sarkar, Chinmay Hegde, Baskar Ganapathysubramanian, Adarsh Krishnamurthy
Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data Ivan DeAndres-Tame, Ruben Tolosana, Pietro Melzi, Rubén Vera-Rodríguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Aythami Morales, Julian Fiérrez, Javier Ortega-Garcia, Zhizhou Zhong, Yuge Huang, Yuxi Mi, Shouhong Ding, Shuigeng Zhou, Shuai He, Lingzhi Fu, Heng Cong, Rongyu Zhang, Zhihong Xiao, Evgeny Smirnov, Anton Pimenov, Aleksei Grigorev, Denis Timoshenko, Kaleb Mesfin Asfaw, Cheng-Yaw Low, Hao Liu, Chuyi Wang, Qing Zuo, Zhixiang He, Hatef Otroshi-Shahreza, Anjith George, Alexander Unnervik, Parsa Rahimi, Sébastien Marcel, Pedro C. Neto, Marco Huber, Jan Niklas Kolf, Naser Damer, Fadi Boutros, Jaime S. Cardoso, Ana Filipa Sequeira, Andrea Atzori, Gianni Fenu, Mirko Marras, Vitomir Struc, Jiang Yu, Zhangjie Li, Jichun Li, Weisong Zhao, Zhen Lei, Xiangyu Zhu, Xiaoyu Zhang, Bernardo Biesseck, Pedro Vidal, Luiz Coelho, Roger Granada, David Menotti
PDF
Seeing the Vibration from Fiber-Optic Cables: Rain Intensity Monitoring Using Deep Frequency Filtering Zhuocheng Jiang, Yangmin Ding, Junhui Zhao, Yue Tian, Shaobo Han, Sarper Ozharar, Ting Wang, James M. Moore
SegFormer3D: An Efficient Transformer for 3D Medical Image Segmentation Shehan Perera, Pouyan Navard, Alper Yilmaz
PDF
Segment Anything in Food Images Saeed S. Alahmari, Michael Gardner, Tawfiq Salem
Segment Anything Model for Road Network Graph Extraction Congrui Hetang, Haoru Xue, Cindy X. Le, Tianwei Yue, Wenping Wang, Yihui He
PDF
Segmentation-Free Guidance for Text-to-Image Diffusion Models Kambiz Azarian, Debasmit Das, Qiqi Hou, Fatih Porikli
PDF
Selective Multi-View Deep Model for 3D Object Classification Mona Saleh Alzahrani, Muhammad Usman, Saeed Anwar, Tarek Helmy
Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-Based Embedded 3D Object Detection Seongmin Park, Minjae Lee, Junwon Choi, Jungwook Choi
PDF
Self-Supervised Learning with Generative Adversarial Networks for Electron Microscopy Bashir Kazimi, Karina Ruzaeva, Stefan Sandfeld
PDF
Semantic Pre-Supplement for Exposure Correction Zhen Zou, Wei Yu, Jie Huang, Feng Zhao
Semi-Stereo: A Universal Stereo Matching Framework for Imperfect Data via Semi-Supervised Learning Xin Yue, Zongqing Lu, Xiangru Lin, Wenjia Ren, Zhijing Shao, Haonan Hu, Yu Zhang, Qingmin Liao
SemiGPC: Distribution-Aware Label Refinement for Imbalanced Semi-Supervised Learning Using Gaussian Processes Abdelhak Lemkhenter, Manchen Wang, Luca Zancato, Gurumurthy Swaminathan, Paolo Favaro, Davide Modolo
PDF
Sensor Equivariance: A Framework for Semantic Segmentation with Diverse Camera Models Hannes Reichert, Manuel Hetzel, Andreas Hubert, Konrad Doll, Bernhard Sick
Separating Lungs in CT Scans for Improved COVID19 Detection Robert Turnbull, Simon J. Mutch
SF-IQA: Quality and Similarity Integration for AI Generated Image Quality Assessment Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen
Shadow Removal Based on Diffusion, Segmentation and Super-Resolution Models Chenghua Li, Bo Yang, Zhiqi Wu, Gao Chen, Yihan Yu, Shengxiao Zhou
Shadow Removal via Global Residual Free UNet and Shadow Generation Dong Li, Xin Lu, Yurui Zhu, Xi Wang, Jie Xiao, Yunpeng Zhang, Xueyang Fu, Zheng-Jun Zha
ShadowRefiner: Towards Mask-Free Shadow Removal via Fast Fourier Transformer Wei Dong, Han Zhou, Yuqiong Tian, Jingke Sun, Xiaohong Liu, Guangtao Zhai, Jun Chen
PDF
Shape-Preserving Generation of Food Images for Automatic Dietary Assessment Guangzong Chen, Zhi-Hong Mao, Mingui Sun, Kangni Liu, Wenyan Jia
PDF
Sharpness-Aware Optimization for Real-World Adversarial Attacks for Diverse Compute Platforms with Enhanced Transferability Muchao Ye, Xiang Xu, Qin Zhang, Jonathan Wu
ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo, Zihao Li, Yilin Lang, Qinyuan Ren
PDF
Short-Form UGC Video Quality Assessment Based on Multi-Level Video Fusion with Rank-Aware Haoran Xu, Mengduo Yang, Jie Zhou, Jiaze Li
Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning Byoungjip Kim, Dasol Hwang, Sungjun Cho, Youngsoo Jang, Honglak Lee, Moontae Lee
Simple In-Place Data Augmentation for Surveillance Object Detection Munkh-Erdene Otgonbold, Ganzorig Batnasan, Munkhjargal Gochoo
PDF
SimpliCity: Reconstructing Buildings with Simple Regularized 3D Models Jean-Philippe Bauchet, Raphael Sulzer, Florent Lafarge, Yuliya Tarabalka
PDF
Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection Using Budding Ensemble Architecture for Object Detection Syed Sha Qutub, Michael Paulitsch, Kay-Ulrich Scholl, Neslihan Köse Cihangir, Korbinian Hagn, Fabian Oboril, Gereon Hinz, Alois Knoll
PDF
Sketch-Guided Image Inpainting with Partial Discrete Diffusion Process Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand Mishra
PDF
SkipPLUS: Skip the First Few Layers to Better Explain Vision Transformers Faridoun Mehri, Mohsen Fayyaz, Mahdieh Soleymani Baghshah, Mohammad Taher Pilehvar
PDF
SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping Vincent Cartillier, Grant Schindler, Irfan Essa
PDF
Snapshot Spectral Imaging for Face Anti-Spoofing: Addressing Data Challenges with Advanced Processing and Training Hui Li, Yaowen Xu, Zhaofan Zou, Zhixiang He
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap Vladimir Somers, Victor Joos, Anthony Cioppa, Silvio Giancola, Seyed Abolfazl Ghasemzadeh, Floriane Magera, Baptiste Standaert, Amir M. Mansourian, Xin Zhou, Shohreh Kasaei, Bernard Ghanem, Alexandre Alahi, Marc Van Droogenbroeck, Christophe De Vleeschouwer
PDF
SoccerNet-Depth: A Scalable Dataset for Monocular Depth Estimation in Sports Videos Arnaud Leduc, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck
PDF
Source-Free Domain Adaptation for Video Object Detection Under Adverse Image Conditions Xingguang Zhang, Chih-Hsien Chou
PDF
Source-Free Domain Adaptation of Weakly-Supervised Object Localization Models for Histology Alexis Guichemerre, Soufiane Belharbi, Tsiry Mayet, Shakeeb Murtaza, Pourya Shamsolmoali, Luke McCaffrey, Eric Granger
PDF
Sparse Multi-View Hand-Object Reconstruction for Unseen Environments Yik Lung Pang, Changjae Oh, Andrea Cavallaro
PDF
Spatio-Temporal Attention and Gaussian Processes for Personalized Video Gaze Estimation Swati Jindal, Mohit Yadav, Roberto Manduchi
PDF
Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha
PDF
SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection Mathis Kruse, Marco Rudolph, Dominik Woiwode, Bodo Rosenhahn
ST-Gait++: Leveraging Spatio-Temporal Convolutions for Gait-Based Emotion Recognition on Videos Maria Luísa Lima, Willams de Lima Costa, Estefania Talavera Martínez, Veronica Teichrieb
PDF
ST2ST: Self-Supervised Test-Time Adaptation for Video Action Recognition Masud An Nur Islam Fahim, Mohammed Innat, Jani Boutellier
StampOne: Addressing Frequency Balance in Printer-Proof Steganography Farhad Shadmand, Iurii Medvedev, Luiz Schirmer, João Marcos, Nuno Gonçalves
StegaNeRV: Video Steganography Using Implicit Neural Representation Monsij Biswal, Tong Shao, Kenneth Rose, Peng Yin, Sean McCarthy
StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models Lezhong Wang, Jeppe Revall Frisvad, Mark Bo Jensen, Siavash Arjomand Bigdeli
PDF
Strategies to Improve Real-World Applicability of Laparoscopic Anatomy Segmentation Models Fiona R. Kolbinger, Jiangpeng He, Jinge Ma, Fengqing Zhu
PDF
Strategies to Leverage Foundational Model Knowledge in Object Affordance Grounding Arushi Rai, Kyle Buettner, Adriana Kovashka
Structured Sparse Back-Propagation for Lightweight On-Device Continual Learning on Microcontroller Units Francesco Paissan, Davide Nadalini, Manuele Rusci, Alberto Ancilotto, Francesco Conti, Luca Benini, Elisabetta Farella
PDF
Style Transfer for 2D Talking Head Generation Trong-Thang Pham, Tuong Do, Nhat Le, Ngan Le, Hung Nguyen, Erman Tjiputra, Quang Tran, Anh Nguyen
SUNDIAL: 3D Satellite Understanding Through Direct, Ambient, and Complex Lighting Decomposition Nikhil Behari, Akshat Dave, Kushagra Tiwary, William Yang, Ramesh Raskar
PDF
Super-Resolution of Biomedical Volumes with 2D Supervision Cheng Jiang, Alexander Gedeon, Yiwei Lyu, Eric Landgraf, Yufeng Zhang, Xinhai Hou, Akhil Kondepudi, Asadur Chowdury, Honglak Lee, Todd C. Hollon
PDF
SuperLoRA: Parameter-Efficient Unified Adaptation for Large Vision Models Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand, Guanghui Wang, Toshiaki Koike-Akino
Supervised Contrastive Learning for Snapshot Spectral Imaging Face Anti-Spoofing Chuanbiao Song, Yan Hong, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang
PDF
Swift Parameter-Free Attention Network for Efficient Super-Resolution Cheng Wan, Hongyuan Yu, Zhiqi Li, Yihang Chen, Yajun Zou, Yuqing Liu, Xuanwu Yin, Kunlong Zuo
PDF
SwinFuSR: An Image Fusion-Inspired Model for RGB-Guided Thermal Image Super-Resolution Cyprien Arnold, Philippe Jouvet, Lama Seoud
PDF
SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation While Maintaining Stereo Constraint Vasudha Venkatesan, Daniel Panangian, Mario Fuentes Reyes, Ksenia Bittner
PDF
T-DEED: Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in Sports Videos Artur Xarles, Sergio Escalera, Thomas B. Moeslund, Albert Clapés
PDF
T2FNorm: Train-Time Feature Normalization for OOD Detection in Image Classification Sudarshan Regmi, Bibek Panthi, Sakar Dotel, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai
T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Grégory Rogez
PDF
TAB: Text-Align Anomaly Backbone Model for Industrial Inspection Tasks Ho-Weng Lee, Shang-Hong Lai
PDF
Table Tennis Ball Spin Estimation with an Event Camera Thomas Gossard, Julian Krismer, Andreas Ziegler, Jonas Tebbe, Andreas Zell
PDF
Tackling Domain Shifts in Person Re-Identification: A Survey and Analysis Vuong D. Nguyen, Samiha Mirza, Abdollah Zakeri, Ayush Gupta, Khadija Khaldi, Rahma Aloui, Pranav Mantini, Shishir K. Shah, Fatima A. Merchant
Tackling the Satellite Downlink Bottleneck with Federated Onboard Learning of Image Compression Pablo Gómez, Gabriele Meoni
TAME: Task Agnostic Continual Learning Using Multiple Experts Haoran Zhu, Maryam Majzoubi, Arihant Jain, Anna Choromanska
PDF
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models Feipeng Ma, Yizhou Zhou, Yueyi Zhang, Siying Wu, Zheyu Zhang, Zilong He, Fengyun Rao, Xiaoyan Sun
TattTRN: Template Reconstruction Network for Tattoo Retrieval Lázaro Janier González-Soler, Maciej Salwowski, Christian Rathgeb, Daniel Fischer
PDF
TCCT-Net: Two-Stream Network Architecture for Fast and Efficient Engagement Estimation via Behavioral Feature Signals Alexander Vedernikov, Puneet Kumar, Haoyu Chen, Tapio Seppänen, Xiaobai Li
PDF
TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-Pitch Videos Atom Scott, Ikuma Uchida, Ning Ding, Rikuhei Umemoto, Rory P. Bunker, Ren Kobayashi, Takeshi Koyama, Masaki Onishi, Yoshinari Kameda, Keisuke Fujii
PDF
Technical Report of NICE Challenge at CVPR 2024: Caption Re-Ranking Evaluation Using Ensembled CLIP and Consensus Scores Kiyoon Jeong, Woojun Lee, Woongchan Nam, Minjeong Ma, Pilsung Kang
PDF
Temporal Surface Frame Anomalies for Deepfake Video Detection Andrea Ciamarra, Roberto Caldelli, Alberto Del Bimbo
Test Time Training for Industrial Anomaly Segmentation Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano
PDF
Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero-Shot Medical Image Segmentation Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor, Suzanne Little
PDF
Test-Time Assessment of a Model's Performance on Unseen Domains via Optimal Transport Akshay Mehra, Yunbei Zhang, Jihun Hamm
PDF
Test-Time Specialization of Dynamic Neural Networks Sam Leroux, Dewant Katare, Aaron Yi Ding, Pieter Simoens
PDF
TFNet: Exploiting Temporal Cues for Fast and Accurate LiDAR Semantic Segmentation Rong Li, Shijie Li, Xieyuanli Chen, Teli Ma, Juergen Gall, Junwei Liang
PDF
The 6th Affective Behavior Analysis In-the-Wild (ABAW) Competition Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Stefanos Zafeiriou, Irene Kotsia, Alice Baird, Chris Gagne, Chunchang Shao, Guanyu Hu
PDF
The 8th AI City Challenge Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Pranamesh Chakraborty, Sanjita Prajapati, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Fady Alnajjar, Ganzorig Batnasan, Ping-Yang Chen, Jun-Wei Hsieh, Xunlei Wu, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Rama Chellappa
The Devil Is in Discretization Discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol Konstanty Subbotko, Wojciech Jablonski, Piotr Bilinski
PDF
The Expanding Scope of the Stability Gap: Unveiling Its Presence in Joint Incremental Learning of Homogeneous Tasks Sandesh Kamath, Albin Soutif-Cormerais, Joost van de Weijer, Bogdan Raducanu
PDF
The Myth of the Pyramid Ramon Izquierdo-Cordova, Walterio W. Mayol-Cuevas
The New Agronomists: Language Models Are Experts in Crop Management Jing Wu, Zhixin Lai, Suiyao Chen, Ran Tao, Pan Zhao, Naira Hovakimyan
PDF
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, Zhengjun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Qian Wang, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Emad Barsoum, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo Wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, Yuhan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi
PDF
The Penalized Inverse Probability Measure for Conformal Classification Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre Da Costa
PDF
The Revenge of BiSeNet: Efficient Multi-Task Image Segmentation Gabriele Rosi, Claudia Cuttano, Niccolò Cavagnero, Giuseppe Averta, Fabio Cermelli
PDF
The Third Monocular Depth Estimation Challenge Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, YiPing Bao, Xiao Liu, Dohyeong Kim, Jinseong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, Jinqiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora, Pihai Sun, Kui Jiang, Gang Wu, Jian Liu, Xianming Liu, Junjun Jiang, Xidan Zhang, Jianing Wei, Fangjun Wang, Zhiming Tan, Jiabao Wang, Albert Luginov, Muhammad Shahzad, Seyed Hosseini, Aleksander Trajcevski, James H. Elder
PDF
Thermal Image Super-Resolution Challenge Results - PBVS 2024 Rafael E. Rivadeneira, Angel Domingo Sappa, Chenyang Wang, Junjun Jiang, Zhiwei Zhong, Peilin Chen, Shiqi Wang
Toward Motion Robustness: A Masked Attention Regularization Framework in Remote Photoplethysmography Pengfei Zhao, Qigong Sun, Xiaolin Tian, Yige Yang, Shuo Tao, Jie Cheng, Jiantong Chen
PDF
Towards Efficient Audio-Visual Learners via Empowering Pre-Trained Vision Transformers with Cross-Modal Adaptation Kai Wang, Yapeng Tian, Dimitrios Hatzinakos
Towards Efficient Machine Unlearning with Data Augmentation: Guided Loss-Increasing (GLI) to Prevent the Catastrophic Model Utility Drop Dasol Choi, Soora Choi, Eunsun Lee, Jinwoo Seo, Dongbin Na
Towards Engineered Safe AI with Modular Concept Models Lena Heidemann, Iwo Kurzidem, Maureen Monnet, Karsten Roscher, Stephan Günnemann
Towards Explainable Visual Vessel Recognition Using Fine-Grained Classification and Image Retrieval Heiko Karus, Friedhelm Schwenker, Michael Munz, Michael Teutsch
Towards Learning Image Similarity from General Triplet Labels Radu Dondera
Towards Online Real-Time Memory-Based Video Inpainting Transformers Guillaume Thiry, Hao Tang, Radu Timofte, Luc Van Gool
PDF
Towards Quantitative Evaluation Metrics for Image Editing Approaches Dana Cohen Hochberg, Oron Anschel, Alon Shoshan, Igor Kviatkovsky, Manoj Aggarwal, Gérard Guy Medioni
Towards Real-World Video Face Restoration: A New Benchmark Ziyan Chen, Jingwen He, Xinqi Lin, Yu Qiao, Chao Dong
PDF
Towards Weakly-Supervised Domain Adaptation for Lane Detection Jingxing Zhou, Chongzhe Zhang, Jürgen Beyerer
Tracking and Counting Apples in Orchards Under Intermittent Occlusions and Low Frame Rates Gonçalo P. Matos, Carlos Santiago, João Paulo Costeira, Ricardo L. Saldanha, Ernesto M. Morgado
Tracklet-Based Explainable Video Anomaly Localization Ashish Singh, Michael J. Jones, Erik G. Learned-Miller
TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning Quang Minh Dinh, Minh Khoi Ho, Anh Quan Dang, Hung Phong Tran
PDF
Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution Cansu Korkmaz, A. Murat Tekalp
PDF
TrajFine: Predicted Trajectory Refinement for Pedestrian Trajectory Forecasting Kuan-Lin Wang, Li-Wu Tsao, Jhih-Ciang Wu, Hong-Han Shuai, Wen-Huang Cheng
Transformers for Orbit Determination Anomaly Detection and Classification Nathan Parrish Ré, Matthew Popplewell, Michael Caudill, Timothy Sullivan, Tyler Hanf, Benjamin Tatman, Kanak Parmar, Tyler Presser, Sai Chikine, Michael Grant, Richard Poulson
Tri-VAE: Triplet Variational Autoencoder for Unsupervised Anomaly Detection in Brain Tumor MRI Hansen Wijanarko, Evelyne Calista, Li-Fen Chen, Yong-Sheng Chen
Triage of 3D Pathology Data via 2.5d Multiple-Instance Learning to Guide Pathologist Assessments Gan Gao, Andrew H. Song, Fiona Wang, David Brenes, Rui Wang, Sarah S. L. Chow, Kevin W. Bishop, Lawrence D. true, Faisal Mahmood, Jonathan T. C. Liu
PDF
Two Stage Dehazing Framework for Dense and Non-Homogeneous Dehazing Wei Song, Yichang Gao, Jiahao Xiong, Hualiang Lin, Dong Li, Yun Zhang
Two-Person Interaction Augmentation with Skeleton Priors Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang
PDF
UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial Imagery Wenhui Chang, Hongming Chen, Xin He, Xiang Chen, Liangduo Shen
PDF
UDAC: Under-Display Array Cameras Chengyu Wang, Jing Li, Pavan C. Madhusudanarao, Jinhan Hu, Jitesh K. Singh, WooJhon Choi, Seok-Jun Lee, Hamid R. Sheikh
UltraAugment: Fan-Shape and Artifact-Based Data Augmentation for 2D Ultrasound Images Florian Ramakers, Tom Vercauteren, Jan Deprest, Helena Williams
PDF
Uncertainty Estimation for Tumor Prediction with Unlabeled Data Juyoung Yun, Shahira Abousamra, Chen Li, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Alison L. Van Dyke, Joel H. Saltz, Chao Chen
PDF
Uncertainty-Based Forgetting Mitigation for Generalized Few-Shot Object Detection Karim Guirguis, George Eskandar, Mingyang Wang, Matthias Kayser, Eduardo Monari, Bin Yang, Jürgen Beyerer
Uncovering Hidden Emotions with Adaptive Multi-Attention Graph Networks Ankith Jain Rakesh Kumar, Bir Bhanu
Uncovering the Hidden Cost of Model Compression Diganta Misra, Muawiz Chaudhary, Agam Goyal, Bharat Runwal, Pin-Yu Chen
PDF
Understanding ReLU Network Robustness Through Test Set Certification Performance Nicola Franco, Jeanette Miriam Lorenz, Karsten Roscher, Stephan Günnemann
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-Based Explanations Maximilian Dreyer, Reduan Achtibat, Wojciech Samek, Sebastian Lapuschkin
PDF
Unified Face Attack Detection with Micro Disturbance and a Two-Stage Training Strategy Jiaruo Yu, Dagong Lu, Xingyue Shi, Chenfan Qu, Fengjun Guo
Unified Physical-Digital Attack Detection Challenge Haocheng Yuan, Ajian Liu, Junze Zheng, Jun Wan, Jiankang Deng, Sergio Escalera, Hugo Jair Escalante, Isabelle Guyon, Zhen Lei
PDF
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction Tobias Hallmen, Fabian Deuser, Norbert Oswald, Elisabeth André
PDF
Unknown Sample Discovery for Source Free Open Set Domain Adaptation Chowdhury Sadman Jahan, Andreas E. Savakis
PDF
Unravelling Robustness of Deep Face Recognition Networks Against Illicit Drug Abuse Images Hruturaj Dhake, Akshay Agarwal
Unsupervised Domain Adaptation Architecture Search with Self-Training for Land Cover Mapping Clifford Broni-Bediako, Junshi Xia, Naoto Yokoya
PDF
Unsupervised Domain Adaptation for Multi-Stain Cell Detection in Breast Cancer with Transformers Oscar Pina, Verónica Vilaplana
PDF
Unsupervised Domain Adaptation for Weed Segmentation Using Greedy Pseudo-Labelling Yingchao Huang, Abdul Bais
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement Igor Morawski, Kai He, Shusil Dangi, Winston H. Hsu
PDF
Unsupervised Microscopy Video Denoising Mary Damilola Aiyetigbo, Alexander Korte, Ethan Anderson, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li
PDF
Unsupervised Multi-Person 3D Human Pose Estimation from 2D Poses Alone Peter Hardy, Hansung Kim
PDF
Unveiling the Ambiguity in Neural Inverse Rendering: A Parameter Compensation Analysis Georgios Kouros, Minye Wu, Sushruth Nagesh, Xianling Zhang, Tinne Tuytelaars
PDF
Unveiling the Anomalies in an Ever-Changing World: A Benchmark for Pixel-Level Anomaly Detection in Continual Learning Nikola Bugarin, Jovana Bugaric, Manuel Barusco, Davide Dalle Pezze, Gian Antonio Susto
PDF
UP-NAS: Unified Proxy for Neural Architecture Search Yi-Cheng Huang, Wei-Hua Li, Chih-Han Tsou, Jun-Cheng Chen, Chu-Song Chen
UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Mapping Jie Zhao, Zhitong Xiong, Xiao Xiang Zhu
PDF
Using Counterfactual Information for Breast Classification Diagnosis Miguel Cardoso, Carlos Santiago, Jacinto C. Nascimento
Using Language-Aligned Gesture Embeddings for Understanding Gestures Accompanying Math Terms Tristan Maidment, Purav J. Patel, Erin Walker, Adriana Kovashka
uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories Giacomo D'Amicantonio, Egor Bondarau, Peter H. N. de With
PDF
UVIS: Unsupervised Video Instance Segmentation Shuaiyi Huang, Saksham Suri, Kamal Gupta, Sai Saketh Rambhatla, Ser-Nam Lim, Abhinav Shrivastava
PDF
V-VIPE: Variational View Invariant Pose Embedding Mara Levy, Abhinav Shrivastava
PDF
Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor
PDF
Video Based Computational Coding of Movement Anomalies in ASD Children Priya Singh, Abhishek Pathak, Umer Jon Ganai, Braj Bhushan, Venkatesh K. Subramanian
Video Interaction Recognition Using an Attention Augmented Relational Network and Skeleton Data Farzaneh Askari, Cyril Yared, Rohit Ramaprasad, Devin Garg, Anjun Hu, James J. Clark
Video Representation Learning for Conversational Facial Expression Recognition Guided by Multiple View Reconstruction Valeriya Strizhkova, Laura M. Ferrari, Hadi Kachmar, Antitza Dantcheva, François Brémond
PDF
VideoSAGE: Video Summarization with Graph Representation Learning Jose M. Rojas Chaves, Subarna Tripathi
PDF
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images Ali Nasiri-Sarvi, Vincent Quoc-Huy Trinh, Hassan Rivaz, Mahdi S. Hosseini
PDF
Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation? Dmitry Ignatov, Andrey Ignatov, Radu Timofte
PDF
Vision-Language Models for Decoding Provider Attention During Neonatal Resuscitation Felipe Parodi, Jordan K. Matelsky, Alejandra Regla-Vargas, Elizabeth E. Foglia, Charis Lim, Danielle Weinberg, Konrad P. Kording, Heidi M. Herrick, Michael L. Platt
PDF
Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning Xin Xing, Zhexiao Xiong, Abby Stylianou, Srikumar Sastry, Liyu Gong, Nathan Jacobs
PDF
VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture Heesup Yun, Sassoum Lo, Christine H. Diepenbrock, Brian N. Bailey, J. Mason Earles
PDF
ViTA: An Efficient Video-to-Text Algorithm Using VLM for RAG-Based Video Analysis System Md. Adnan Arefeen, Biplob Debnath, Md. Yusuf Sarwar Uddin, Srimat Chakradhar
ViTKD: Feature-Based Knowledge Distillation for Vision Transformers Zhendong Yang, Zhe Li, Ailing Zeng, Zexian Li, Chun Yuan, Yu Li
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model Junsu Kim, Yunhoe Ku, Jihyeon Kim, Junuk Cha, Seungryul Baek
PDF
VMCML: Video and Music Matching via Cross-Modality Lifting Yi-Shan Lee, Wei-Cheng Tseng, Fu-En Wang, Min Sun
PDF
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting Yujin Tang, Peijie Dong, Zhenheng Tang, Xiaowen Chu, Junwei Liang
PDF
VolRAFT: Volumetric Optical Flow Network for Digital Volume Correlation of Synchrotron Radiation-Based Micro-CT Images of Bone-Implant Interfaces Tak Ming Wong, Julian Moosmann, Berit Zeller-Plumhoff
VT-Former: An Exploratory Study on Vehicle Trajectory Prediction for Highway Surveillance Through Graph Isomorphism and Transformer Armin Danesh Pazho, Ghazal Alinezhad Noghre, Vinit Katariya, Hamed Tabkhi
PDF
Wake-Sleep Energy Based Models for Continual Learning Vaibhav Singh, Anna Choromanska, Shuang Li, Yilun Du
Weakly Supervised End2End Deep Visual Odometry Amin Abouee, Ashwanth Ravi, Lars Hinneburg, Mateusz Dziwulski, Florian Ölsner, Jürgen Hess, Stefan Milz, Patrick Mäder
Weakly Supervised Set-Consistency Learning Improves Morphological Profiling of Single-Cell Images Heming Yao, Phil Hanslovsky, Jan-Christian Huetter, Burkhard Hoeckendorf, David Richmond
PDF
Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers Xin Hu, Kai Li, Deep Patel, Erik Kruus, Martin Renqiang Min, Zhengming Ding
What Does CLIP Know About Peeling a Banana? Claudia Cuttano, Gabriele Rosi, Gabriele Trivigno, Giuseppe Averta
PDF
What Is Point Supervision Worth in Video Instance Segmentation? Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, José M. Álvarez, Abhinav Shrivastava, Anima Anandkumar
PDF
What Makes Multimodal In-Context Learning Work? Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski
PDF
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs Davide Caffagni, Federico Cocchi, Nicholas Moratelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
PDF
X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models Jan Held, Hani Itani, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck
PDF
XoFTR: Cross-Modal Feature Matching Transformer Önder Tuzcuoglu, Aybora Köksal, Bugra Sofu, Sinan Kalkan, A. Aydin Alatan
PDF
Zero-Shot Audio-Visual Compound Expression Recognition Method Based on Emotion Probability Fusion Elena Ryumina, Maxim Markitantov, Dmitry Ryumin, Heysem Kaya, Alexey Karpov
Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation Tri Ton, Ji Woo Hong, SooHwan Eom, Jun Yeop Shim, Junyeong Kim, Chang D. Yoo
PDF
Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion Yuxiang Huang, Yuhao Chen, John S. Zelek
PDF
ZInD-Tell: Towards Translating Indoor Panoramas into Descriptions Tonmoay Deb, Lichen Wang, Zachary Bessinger, Naji Khosravan, Eric Penner, Sing Bing Kang