CVPRW 2024 800 papers
3D Clothed Human Reconstruction from Sparse Multi-View Images
Jin Gyu Hong, Seung Young Noh, Hee Kyung Lee, Won-Sik Cheong, Ju Yong Chang Cite
3D Human Scan with a Moving Event Camera
Kai Kohyama, Shintaro Shiba, Yoshimitsu Aoki 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data
Zhi-Yi Lin, Bofan Lyu, Judith Cueto Fernandez, Eline van der Kruk, Ajay Seth, Xucong Zhang A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection
Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai A Comprehensive Analysis of Factors Impacting Membership Inference
Daniel DeAlcala, Gonzalo Mancera, Aythami Morales, Julian Fiérrez, Ruben Tolosana, Javier Ortega-Garcia Cite
A Deep Biclustering Framework for Brain Network Analysis
Md Abdur Rahaman, Zening Fu, Armin Iraji, Vince D. Calhoun Cite
A Dual-Mode Approach for Vision-Based Navigation in a Lunar Landing Scenario
Luca Ostrogovich, Roberto Del Prete, Giuseppe Tomasicchio, Nicolas Longépé, Alfredo Renga Cite
A Generative Exploration of Cuisine Transfer
Philip Wootaek Shin, Ajay Narayanan Sridhar, Jack Sampson, Vijaykrishnan Narayanan Cite
A Lightweight Spatiotemporal Network for Online Eye Tracking with Event Camera
Yan Ru Pei, Sasskia Brüers, Sébastien M. Crouzet, Douglas McLelland, Olivier Coenen A Universal Protocol to Benchmark Camera Calibration for Sports
Floriane Magera, Thomas Hoyoux, Olivier Barnich, Marc Van Droogenbroeck Active Transferability Estimation
Tarun Ram Menta, Surgan Jandial, Akash Patil, Saketh Bachu, K. B. Vimal, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Mausoom Sarkar, Chirag Agarwal Cite
Adaptive Memory Replay for Continual Learning
James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogério Feris, Zsolt Kira, Leonid Karlinsky Adaptive Render-Video Streaming for Virtual Environments
Jia-Jie Lim, Matthias Sebastian Treder, Aaron Chadha, Yiannis Andreopoulos Cite
Advancing Brain Tumor Analysis: Curating a High-Quality MRI Dataset for Deep Learning-Based Molecular Marker Profiling
Divya D. Reddy, Niloufar Saadat, James M. Holcomb, Benjamin C. Wagner, Nghi C. Truong, Jason Bowerman, Kimmo J. Hatanpaa, Toral R. Patel, Marco C. Pinho, Ananth J. Madhuranthakam, Chandan Ganesh Bangalore Yogananda, Joseph A. Maldjian Cite
Advancing COVID-19 Detection in 3D CT Scans
Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen Adversarial Identity Injection for Semantic Face Image Synthesis
Giuseppe Tarollo, Tomaso Fontanini, Claudio Ferrari, Guido Borghi, Andrea Prati Cite
Affine-Based Deformable Attention and Selective Fusion for Semi-Dense Matching
Hongkai Chen, Zixin Luo, Yurun Tian, Xuyang Bai, Ziyu Wang, Lei Zhou, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan AffordanceLLM: Grounding Affordance from Vision Language Models
Shengyi Qian, Weifeng Chen, Min Bai, Xiong Zhou, Zhuowen Tu, Li Erran Li AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
Faizan Farooq Khan, Diana Kim, Divyansh Jha, Youssef Mohamed, Hanna H. Chang, Ahmed Elgammal, Luba Elliott, Mohamed Elhoseiny AIGC Image Quality Assessment via Image-Prompt Correspondence
Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen Cite
AIGC-VQA: A Holistic Perception Metric for AIGC Video Quality Assessment
Yiting Lu, Xin Li, Bingchen Li, Zihao Yu, Fengbin Guan, Xinrui Wang, Ruling Liao, Yan Ye, Zhibo Chen Cite
AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment
Chunyi Li, Tengchuan Kou, Yixuan Gao, Yuqin Cao, Wei Sun, Zicheng Zhang, Yingjie Zhou, Zhichao Zhang, Weixia Zhang, Haoning Wu, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai, Wei Sun, Yuqin Cao, Yanwei Jiang, Jun Jia, Zhichao Zhang, Zijian Chen, Weixia Zhang, Xiongkuo Min, Steve Göring, Zihao Qi, Chen Feng ALINA: Advanced Line Identification and Notation Algorithm
Mohammed Abdul Hafeez Khan, Parth Ganeriwala, Siddhartha Bhattacharyya, Natasha A. Neogi, Raja Muthalagu An Effective Ensemble Learning Framework for Affective Behaviour Analysis
Wei Zhang, Feng Qiu, Chen Liu, Lincheng Li, Heming Du, Tianchen Guo, Xin Yu Cite
An Effective Method for Detecting Violation of Helmet Rule for Motorcyclists
Yunliang Chen, Wei Zhou, Zicen Zhou, Bing Ma, Chen Wang, Yingda Shang, An Guo, Tianshu Chu Cite
An Empty Room Is All We Want: Automatic Defurnishing of Indoor Panoramas
Mira Slavcheva, Dave Gausebeck, Kevin Chen, David Buchhofer, Azwad Sabik, Chen Ma, Sachal Dhillon, Olaf Brandt, Alan Dolhasz An Online Approach and Evaluation Method for Tracking People Across Cameras in Extremely Long Video Sequence
Cheng-Yen Yang, Hsiang-Wei Huang, Pyong-Kun Kim, Zhongyu Jiang, Kwang-Ju Kim, Chung-I Huang, Haiqing Du, Jenq-Neng Hwang Cite
Analyzing the Internals of Neural Radiance Fields
Lukas Radl, Andreas Kurz, Michael Steiner, Markus Steinberger Are NeRFs Ready for Autonomous Driving? Towards Closing the Real-to-Simulation Gap
Carl Lindström, Georg Hess, Adam Lilja, Maryam Fatemi, Lars Hammarstrand, Christoffer Petersson, Lennart Svensson ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong Cite
ATOM: Attention Mixer for Efficient Dataset Distillation
Samir Khaki, Ahmad Sajedi, Kai Wang, Lucy Z. Liu, Yuri A. Lawryshyn, Konstantinos N. Plataniotis Attention Guidance Distillation Network for Efficient Image Super-Resolution
Hongyuan Wang, Ziyan Wei, Qingting Tang, Shuli Cheng, Liejun Wang, Yongming Li Cite
AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts
Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation
Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Seymanur Akti, Hazim Kemal Ekenel, Alexander Waibel Augmented Self-Mask Attention Transformer for Naturalistic Driving Action Recognition
Tiantian Zhang, Qingtian Wang, Xiaodong Dong, Wenqing Yu, Hao Sun, Xuyang Zhou, Aigong Zhen, Shun Cui, Dong Wu, Zhongjiang He Cite
Automatic Recognition of Food Ingestion Environment from the AIM-2 Wearable Sensor
Yuning Huang, M. A Hassan, Jiangpeng He, Janine A. Higgins, Megan A. McCrory, Heather A. Eicher-Miller, J. Graham Thomas, Edward Sazonov, Fengqing Zhu BAA-NGP: Bundle-Adjusting Accelerated Neural Graphics Primitives
Sainan Liu, Shan Lin, Jingpei Lu, Alexey Supikov, Michael C. Yip Benchmarking Robustness in Neural Radiance Fields
Chen Wang, Angtian Wang, Junbo Li, Alan L. Yuille, Cihang Xie Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity
Zhenlin Xu, Yi Zhu, Siqi Deng, Abhay Mittal, Yanbei Chen, Manchen Wang, Paolo Favaro, Joseph Tighe, Davide Modolo Beyond Deepfake Images: Detecting AI-Generated Videos
Danial Samadi Vahdati, Tai D. Nguyen, Aref Azizpour, Matthew C. Stamm Beyond the Premier: Assessing Action Spotting Transfer Capability Across Diverse Domains
Bruno Cabado, Anthony Cioppa, Silvio Giancola, Andrés Villa, Bertha Guijarro-Berdiñas, Emilio J. Padrón, Bernard Ghanem, Marc Van Droogenbroeck BGDNet: Background-Guided Indoor Panorama Depth Estimation
Jiajing Chen, Zhiqiang Wan, Manjunath Narayana, Yuguang Li, Will Hutchcroft, Senem Velipasalar, Sing Bing Kang Cite
BigEPIT: Scaling EPIT for Light Field Image Super-Resolution
Wentao Chao, Yiming Kan, Xuechun Wang, Fuqing Duan, Guanghui Wang Cite
BMAD: Benchmarks for Medical Anomaly Detection
Jinan Bao, Hanshi Sun, Hanqiu Deng, Yinsheng He, Zhaoxiang Zhang, Xingyu Li BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects
Tomas Hodan, Martin Sundermeyer, Yann Labbé, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition
Genggeng Chen, Kexin Dai, Kangzhen Yang, Tao Hu, Xiangyu Chen, Yongqing Yang, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan Building Secure and Engaging Video Communication by Using Monitor Illumination
Jun Myeong Choi, Johnathan Chi-Ho Leung, Noah Frahm, Max Christman, Gedas Bertasius, Roni Sengupta Cite
Burst Image Super-Resolution with Base Frame Selection
Sanghyun Kim, Min Jung Lee, Woohyeok Kim, Deunsol Jung, Jaesung Rim, Sunghyun Cho, Minsu Cho Cache and Reuse: Rethinking the Efficiency of On-Device Transfer Learning
Yuedong Yang, Hung-Yueh Chiang, Guihong Li, Diana Marculescu, Radu Marculescu Cite
CAGE: Circumplex Affect Guided Expression Inference
Niklas Wagner, Felix Mätzler, Samed Rouven Vossberg, Helen Schneider, Svetlana Pavlitska, J. Marius Zöllner Calibration of Continual Learning Models
Lanpei Li, Elia Piccoli, Andrea Cossu, Davide Bacciu, Vincenzo Lomonaco Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics
Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery
Sai Bhargav Rongali, Sarthak Mehrotra, Ankit Jha, N C Mohamad Hassan, Shirsha Bose, Tanisha Gupta, Mainak Singha, Biplab Banerjee CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario
Zhizhao Duan, Hao Cheng, Duo Xu, Xi Wu, Xiangxie Zhang, Xi Ye, Zhen Xie Classification of 2D Ultrasound Breast Cancer Images with Deep Learning
Jack Ellis, Kofi Appiah, Emmanuel Amankwaa-Frempong, Sze Chai Kwok Cite
Cluster Triplet Loss for Unsupervised Domain Adaptation on Histology Images
Ruby Wood, Enric Domingo, Viktor Hendrik Koelzer, Timothy S. Maughan, Jens Rittscher Cite
CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels
Chi-Hsuan Wu, Shih-Yang Liu, Xijie Huang, Xingbo Wang, Rong Zhang, Luca Minciullo, Wong Kai Yiu, Kenny Kwan, Kwang-Ting Cheng Coarse or Fine? Recognising Action End States Without Labels
Davide Moltisanti, Hakan Bilen, Laura Sevilla-Lara, Frank Keller Codebook VQ-VAE Approach for Prostate Cancer Diagnosis Using Multiparametric MRI
Ekaterina Redekop, Mara Pleasure, Zichen Wang, Karthik V. Sarma, Adam Kinnaird, William Speier, Corey W. Arnold Cite
CoDISP: Exploring Compressed Domain Camera ISP with RGB-Guided Encoder
Molin Zhang, Soumendu Majee, Chengyu Wang, Seok-Jun Lee, Hamid R. Sheikh Cite
CoLa-SDF: Controllable Latent StyleSDF for Disentangled 3D Face Generation
Rahul Dey, Bernhard Egger, Vishnu Naresh Boddeti, Ye Wang, Tim K. Marks Cite
Collaborative Blind Image Deblurring
Thomas Eboli, Jean-Michel Morel, Gabriele Facciolo Collaborative Visual Place Recognition Through Federated Learning
Mattia Dutto, Gabriele Moreno Berton, Debora Caldarola, Eros Fanì, Gabriele Trivigno, Carlo Masone Complex Style Image Transformations for Domain Generalization in Medical Images
Nikolaos Spanos, Anastasios Arsenos, Paraskevi-Antonia Theofilou, Paraskevi K. Tzouveli, Athanasios Voulodimos, Stefanos D. Kollias Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis
Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao Connecting NeRFs, Images, and Text
Francesco Ballerini, Pierluigi Zama Ramirez, Roberto Mirabella, Samuele Salti, Luigi Di Stefano Continual Diffusion with STAMINA: STack-and-Mask INcremental Adapters
James Seale Smith, Yen-Chang Hsu, Zsolt Kira, Yilin Shen, Hongxia Jin Continual Learning with Weight Interpolation
Jedrzej Kozal, Jan Wasilewski, Bartosz Krawczyk, Michal Wozniak Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes
Ivica Obadic, Alex Levering, Lars Pennig, Dário A. B. Oliveira, Diego Marcos, Xiaoxiang Zhu ControlPolypNet: Towards Controlled Colon Polyp Synthesis for Improved Polyp Segmentation
Vanshali Sharma, Abhishek Kumar, Debesh Jha, Manas Kamal Bhuyan, Pradip K. Das, Ulas Bagci Cite
Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets
Hao Chen, Ran Tao, Han Zhang, Yidong Wang, Xiang Li, Wei Ye, Jindong Wang, Guosheng Hu, Marios Savvides Coreset Selection for Object Detection
Hojun Lee, Suyoung Kim, Junhoo Lee, Jaeyoung Yoo, Nojun Kwak COVER: A Comprehensive Video Quality Evaluator
Chenlong He, Qi Zheng, Ruoxi Zhu, Xiaoyang Zeng, Yibo Fan, Zhengzhong Tu Cite
Creating a Digital Twin of Spinal Surgery: A Proof of Concept
Jonas Hein, Frédéric Giraud, Lilian Calvet, Alexander Schwarz, Nicola Alessandro Cavalcanti, Sergey Prokudin, Mazda Farshad, Siyu Tang, Marc Pollefeys, Fabio Carrillo, Philipp Fürnstahl CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task
Kangzhen Yang, Tao Hu, Kexin Dai, Genggeng Chen, Yu Cao, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan Cross-View Aggregation Network for Stereo Image Super-Resolution
Zhitao Chen, Tao Lu, Kanghui Zhao, Bolin Zhu, Zhen Li, Jiaming Wang, Yanduo Zhang Cite
CSCO: Connectivity Search of Convolutional Operators
Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen Data-Free Defense of Black Box Models Against Adversarial Attacks
Gaurav Kumar Nayak, Inder Khatri, Ruchit Rawal, Anirban Chakraborty Data-Free Model Fusion with Generator Assistants
Luyao Shi, Prashanth Vijayaraghavan, Ehsan Degan Cite
DCE-Diff: Diffusion Model for Synthesis of Early and Late Dynamic Contrast-Enhanced MR Images from Non-Contrast Multimodal Inputs
Kishore Kumar M, Sriprabha Ramanarayanan, Sadhana S, Arunima Sarkar, Matcha Naga Gayathri, Keerthi Ram, Mohanasankar Sivaprakasam Cite
Deep Generative Data Assimilation in Multimodal Setting
Yongquan Qu, Juan Nathaniel, Shuolin Li, Pierre Gentine Deep Learning-Based Identification of Arctic Ocean Boundaries and Near-Surface Phenomena in Underwater Echograms
Femina Senjaliya, Melissa Cote, Amanda Dash, Alexandra Branzan Albu, Andrea Niemi, Stéphane Gauthier, Julek Chawarski, Steve Pearce, Kaan Ersahin, Keath Borg Cite
Deep Portrait Quality Assessment. a NTIRE 2024 Challenge Survey
Nicolas Chahine, Marcos V. Conde, Daniela Carfora, Gabriel Pacianotto, Benoit Pochon, Sira Ferradans, Radu Timofte, Zhichao Duan, Xinrui Xu, Yipo Huang, Quan Yuan, Xiangfei Sheng, Zhichao Yang, Leida Li, Haotian Fan, Fangyuan Kong, Yifang Xu, Wei Sun, Weixia Zhang, Yanwei Jiang, Haoning Wu, Zicheng Zhang, Jun Jia, Yingjie Zhou, Zhongpeng Ji, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Xiaoqi Wang, Junqi Liu, Zixi Guo, Yun Zhang, Zewen Chen, Wen Wang, Juan Wang, Bing Li Deep RAW Image Super-Resolution. a NTIRE 2024 Challenge Survey
Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhijing Sun, Jiaying Zhu, Liangyan Li, Ke Chen, Yunzhe Li, Yimo Ning, Guanhua Zhao, Jun Chen, Jinyang Yu, Kele Xu, Qisheng Xu, Yong Dou Deep Video Codec Control for Vision Models
Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar Demographic Bias Effects on Face Image Synthesis
Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera
Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha Deploying Machine Learning Anomaly Detection Models to Flight Ready AI Boards
James Murphy, Maria Buckley, Léonie Buckley, Adam Taylor, Jake O'Brien, Brian Mac Namee Cite
Dformer: Learning Efficient Image Restoration with Perceptual Guidance
Nodirkhuja Khudjaev, Roman Tsoy, Sma Sharif, Azamat Myrzabekov, Seongwan Kim, Jaeho Lee Cite
DIA: Diffusion Based Inverse Network Attack on Collaborative Inference
Dake Chen, Shiduo Li, Yuke Zhang, Chenghao Li, Souvik Kundu, Peter A. Beerel Cite
DiffLight: Integrating Content and Detail for Low-Light Image Enhancement
Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Qingsen Yan, Yanning Zhang Cite
Domain Adaptation Using Pseudo Labels for COVID-19 Detection
Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen Drone-HAT: Hybrid Attention Transformer for Complex Action Recognition in Drone Surveillance Videos
Mustaqeem Khan, Jamil Ahmad, Abdulmotaleb El Saddik, Wail Gueaieb, Giulia De Masi, Fakhri Karray Cite
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang EarthMatch: Iterative Coregistration for Fine-Grained Localization of Astronaut Photography
Gabriele Moreno Berton, Gabriele Goletto, Gabriele Trivigno, Alex Stoken, Barbara Caputo, Carlo Masone ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation
Iaroslav Melekhov, Anand Umashankar, Hyeong-Jin Kim, Vladislav Serkov, Dusty Argyle Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation
Jun Yu, Wangyuan Zhu, Jichao Zhu, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Guochen Xie, Zhihong Wei, Qingsong Liu, Jiaen Liang Efficient Skeleton-Based Action Recognition for Real-Time Embedded Systems
Nadhira Noor, Fabianaugie Jametoni, Jinbeom Kim, Hyunsu Hong, In Kyu Park Cite
EfficientNet-SAM: A Novel EffecientNet with Spatial Attention Mechanism for COVID-19 Detection in Pulmonary CT Scans
Ramy Farag, Parth Upadhay, Jacket Demby's, Yixiang Gao, Katherin Garces Montoya, Seyed Mohamad Ali Tousi, Gbenga Omotara, Guilherme N. DeSouza Cite
Efflex: Efficient and Flexible Pipeline for Spatio-Temporal Trajectory Graph Modeling and Representation Learning
Ming Cheng, Ziyi Zhou, Bowen Zhang, Ziyu Wang, Jiaqi Gan, Ziang Ren, Weiqi Feng, Yi Lyu, Hefan Zhang, Xingjian Diao EgoSG: Learning 3D Scene Graphs from Egocentric RGB-D Sequences
Chaoyi Zhang, Xitong Yang, Ji Hou, Kris Kitani, Weidong Cai, Fu-Jen Chu Cite
ELSA: Exploiting Layer-Wise N: M Sparsity for Vision Transformer Acceleration
Ning-Chi Huang, Chi-Chih Chang, Wei-Cheng Lin, Endri Taka, Diana Marculescu, Kai-Chiang Wu Cite
End-to-End Deep Learning Models for Gap Identification in Maize Fields
Rana Waqar, Zeljana Grbovic, Maryam Khan, Nina Pajevic, Dimitrije Stefanovic, Vladan Filipovic, Marko Panic, Nemanja Djuric Enhancing 2D Representation Learning with a 3D Prior
Mehmet Aygün, Prithviraj Dhar, Zhicheng Yan, Oisin Mac Aodha, Rakesh Ranjan Evaluating and Improving Compositional Text-to-Visual Generation
Baiqi Li, Zhiqiu Lin, Deepak Pathak, Jiayao Li, Yixin Fei, Kewen Wu, Xide Xia, Pengchuan Zhang, Graham Neubig, Deva Ramanan Cite
Evaluating Confidence Calibration in Endoscopic Diagnosis Models
Nikoo Dehghani, Ayla Thijssen, Quirine E. W. van der Zander, Ramon-Michel Schreuder, Erik J. Schoon, Fons van der Sommen, Peter H. N. de With Evaluating Multimodal Large Language Models Across Distribution Shifts and Augmentations
Aayush Atul Verma, Amir Saeidi, Shamanthak Hegde, Ajay Therala, Fenil Denish Bardoliya, Nagaraju Machavarapu, Shri Ajay Kumar Ravindhiran, Srija Malyala, Agneet Chatterjee, Yezhou Yang, Chitta Baral Cite
Event-Based Ball Spin Estimation in Sports
Takuya Nakabayashi, Kyota Higa, Masahiro Yamaguchi, Ryo Fujiwara, Hideo Saito Cite
Event-Based Eye Tracking. AIS 2024 Challenge Survey
Zuowen Wang, Chang Gao, Zongwei Wu, Marcos V. Conde, Radu Timofte, Shih-Chii Liu, Qinyu Chen, Zhengjun Zha, Wei Zhai, Han Han, Bohao Liao, Yuliang Wu, Zengyu Wan, Zhong Wang, Yang Cao, Ganchao Tan, Jinze Chen, Yan Ru Pei, Sasskia Brüers, Sébastien M. Crouzet, Douglas McLelland, Olivier Coenen, Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So, Philippe Bich, Chiara Boretti, Luciano Prono, Mircea Lica, David Dinucu-Jianu, Catalin Grîu, Xiaopeng Lin, Hongwei Ren, Bojun Cheng, Xinan Zhang, Valentin Vial, Anthony Yezzi, James Tsai Exploration of Data Augmentation Techniques for Bush Detection in Blueberry Orchards
Boris Culjak, Nina Pajevic, Vladan Filipovic, Dimitrije Stefanovic, Zeljana Grbovic, Nemanja Djuric, Marko Panic Exploring Facial Expression Recognition Through Semi-Supervised Pre-Training and Temporal Modeling
Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang Exploring Real World mAP Change Generalization of Prior-Informed HD mAP Prediction Models
Samuel M. Bateman, Ning Xu, H. Charles Zhao, Yael Ben Shalom, Vince Gong, Greg Long, Will Maddern Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery
Xavier Bou, Gabriele Facciolo, Rafael Grompone von Gioi, Jean-Michel Morel, Thibaud Ehret Exploring Text-to-Motion Generation with Human Preference
Jenny Sheng, Matthieu Lin, Andrew Zhao, Kevin Pruvost, Yu-Hui Wen, Yangguang Li, Gao Huang, Yong-Jin Liu Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation
Brunó Bence Englert, Fabrizio J. Piva, Tommie Kerssies, Daan de Geus, Gijs Dubbelman Exploring the Role of Audio in Video Captioning
Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang FairSSD: Understanding Bias in Synthetic Speech Detectors
Amit Kumar Singh Yadav, Kratika Bhagtani, Davide Salvi, Paolo Bestagini, Edward J. Delp Faster than Lies: Real-Time Deepfake Detection Using Binary Neural Networks
Romeo Lanzino, Federico Fontana, Anxhelo Diko, Marco Raoul Marini, Luigi Cinque Finding AI-Generated Faces in the Wild
Gonzalo J. Aniano Porcile, Jack Gindi, Shivansh Mundra, James R. Verbus, Hany Farid FineRehab: A Multi-Modality and Multi-Task Dataset for Rehabilitation Analysis
Jianwei Li, Jun Xue, Rui Cao, Xiaoxia Du, Siyu Mo, Kehao Ran, Zeyan Zhang Cite
Food Portion Estimation via 3D Object Scaling
Gautham Vinod, Jiangpeng He, Zeman Shao, Fengqing Zhu Fourier Prior-Based Two-Stage Architecture for Image Restoration
Hemkant Nehete, Amit Monga, Partha Kaushik, Brajesh Kumar Kaushik Cite
FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography
Julia Yang, Alina Jade Barnett, Jon Donnelly, Satvik Kishore, Jerry Fang, Fides Regina Schwartz, Chaofan Chen, Joseph Y. Lo, Cynthia Rudin Gaussian Splatting Decoder for 3D-Aware Generative Adversarial Networks
Florian Barthel, Arian Beckmann, Wieland Morgenstern, Anna Hilsmann, Peter Eisert Gene-Level Representation Learning via Interventional Style Transfer in Optical Pooled Screening
Mahtab Bigverdi, Burkhard Höckendorf, Heming Yao, Phil Hanslovsky, Romain Lopez, David Richmond Generating Diverse Agricultural Data for Vision-Based Farming Applications
Mikolaj Cieslak, Umabharathi Govindarajan, Alejandro Garcia, Anuradha Chandrashekar, Torsten Hädrich, Aleksander Mendoza-Drosik, Dominik L. Michels, Sören Pirk, Chia-Chun Fu, Wojciech Palubicki Generating Material-Aware 3D Models from Sparse Views
Shi Mao, Chenming Wu, Ran Yi, Zhelun Shen, Liangjun Zhang, Wolfgang Heidrich Cite
Generative Dataset Distillation: Balancing Global Structure and Local Details
Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions
Salvatore Esposito, Qingshan Xu, Kacper Kania, Charlie Hewitt, Octave Mariotti, Lohit Petikam, Julien Valentin, Arno Onken, Oisin Mac Aodha GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields
Arnab Dey, Di Yang, Rohith Agaram, Antitza Dantcheva, Andrew I. Comport, Srinath Sridhar, Jean Martinet GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing
Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv, Yi Huang, Mingfu Yan, Jiancheng Huang, Jianzhuang Liu, Yifan Liu, Yafei Wen, Xiaoxin Chen, Shifeng Chen GSAM+Cutie: Text-Promptable Tool Mask Annotation for Endoscopic Video
Roger D. Soberanis-Mukul, Jiahuan Cheng, Jan Emily Mangulabnan, S. Swaroop Vedula, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath Cite
HaLViT: Half of the Weights Are Enough
Onur Can Koyun, Behçet Ugur Töreyin Cite
HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using Harvest Piles and Remote Sensing
Jonathan Xu, Amna Elmustafa, Liya Weldegebriel, Emnet Negash, Richard Lee, Chenlin Meng, Stefano Ermon, David B. Lobell High Quality Reference Feature for Two Stage Bracketing Image Restoration and Enhancement
Xiaoxia Xing, Hyunhee Park, Fan Wang, Ying Zhang, Sejun Song, Changho Kim, Xiangyu Kong Cite
Hinge-Wasserstein: Estimating Multimodal Aleatoric Uncertainty in Regression Tasks
Ziliang Xiong, Arvi Jonnarth, Abdelrahman Eldesokey, Joakim Johnander, Bastian Wandt, Per-Erik Forssén HirFormer: Dynamic High Resolution Transformer for Large-Scale Image Shadow Removal
Xin Lu, Yurui Zhu, Xi Wang, Dong Li, Jie Xiao, Yunpeng Zhang, Xueyang Fu, Zheng-Jun Zha Cite
HNN: Hierarchical Noise-Deinterlace Net Towards Image Denoising
Amogh Joshi, Nikhil Akalwadi, Chinmayee Mandi, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi Cite
How Much You Ate? Food Portion Estimation on Spoons
Aaryam Sharma, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong How SAM Perceives Different Mp-MRI Brain Tumor Domains?
Cecilia Diana-Albelda, Roberto Alcover-Couso, Álvaro García-Martín, Jesús Bescós Cite
Human-in-the-Loop Segmentation of Multi-Species Coral Imagery
Scarlett Raine, Ross Marchant, Brano Kusy, Frédéric Maire, Niko Sünderhauf, Tobias Fischer HyperLeaf2024 - A Hyperspectral Imaging Dataset for Classification and Regression of Wheat Leaves
William Michael Laprade, Pawel Tomasz Pieta, Svetlana Kutuzova, Jesper Cairo Westergaard, Mads Nielsen, Svend Christensen, Anders Bjorholm Dahl iEdit: Localised Text-Guided Image Editing with Weak Supervision
Rumeysa Bodur, Erhan Gundogdu, Binod Bhattarai, Tae-Kyun Kim, Michael Donoser, Loris Bazzani Image Restoration Refinement with Uformer GAN
Xu Ouyang, Ying Chen, Kaiyue Zhu, Gady Agam Cite
IMIL: Interactive Medical Image Learning Framework
Adrit Rao, Andrea Fisher, Ken Chang, John Christopher Panagides, Katherine McNamara, Joon-Young Lee, Oliver O. Aalami Implicit Assimilation of Sparse in Situ Data for Dense & Global Storm Surge Forecasting
Patrick Ebel, Brandon Victor, Peter Naylor, Gabriele Meoni, Federico Serva, Rochelle Schneider Cite
Improving Object Detection to Fisheye Cameras with Open-Vocabulary Pseudo-Label Approach
Long Hoang Pham, Quoc Pham-Nam Ho, Duong Nguyen-Ngoc Tran, Tai Huu-Phuong Tran, Huy-Hung Nguyen, Duong Khac Vu, Chi Dai Tran, Ngoc Doan-Minh Huynh, Hyung-Min Jeon, Hyung-Joon Jeon, Jae Wook Jeon Cite
Improving the Efficiency-Accuracy Trade-Off of DETR-Style Models in Practice
Yumin Suh, Dongwan Kim, Abhishek Aich, Samuel Schulter, Jong-Chyi Su, Bohyung Han, Manmohan Chandraker Cite
Improving Valence-Arousal Estimation with Spatiotemporal Relationship Learning and Multimodal Fusion
Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Shuoping Yang, Yang Zheng, Qingsong Liu, Jiaen Liang Cite
in2IN: Leveraging Individual Information to Generate Human INteractions
Pablo Ruiz-Ponce, Germán Barquero, Cristina Palmero, Sergio Escalera, José García Rodríguez InVERGe: Intelligent Visual Encoder for Bridging Modalities in Report Generation
Ankan Deria, Komal Kumar, Snehashis Chakraborty, Dwarikanath Mahapatra, Sudipta Roy Cite
Joint Multimodal Transformer for Emotion Recognition in the Wild
Paul Waligora, Muhammad Haseeb Aslam, Muhammad Osama Zeeshan, Soufiane Belharbi, Alessandro Lameiras Koerich, Marco Pedersoli, Simon Bacon, Eric Granger Joint Physical-Digital Facial Attack Detection via Simulating Spoofing Clues
Xianhua He, Dashuang Liang, Song Yang, Zhanlong Hao, Hui Ma, Binjie Mao, Xi Li, Yao Wang, Pengfei Yan, Ajian Liu Key Patches Are All You Need: A Multiple Instance Learning Framework for Robust Medical Diagnosis
D. J. Araújo, Maria Rita Verdelho, Alceu Bissoto, Jacinto C. Nascimento, Carlos Santiago, Catarina Barata LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces
Xuhui Liu, Bohan Zeng, Sicheng Gao, Shanglin Li, Yutang Feng, Hong Li, Boyu Liu, Jianzhuang Liu, Baochang Zhang Cite
LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints
Mengmeng Liu, Hao Cheng, Lin Chen, Hellward Broszio, Jiangtao Li, Runjiang Zhao, Monika Sester, Michael Ying Yang Language-Guided Multi-Modal Emotional Mimicry Intensity Estimation
Feng Qiu, Wei Zhang, Chen Liu, Lincheng Li, Heming Du, Tianchen Guo, Xin Yu Cite
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Taehoon Kim, Mark Marsden, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Alessandra Sala, Seung Hwan Kim Latent Flow Diffusion for Deepfake Video Generation
Aashish Chandra K, A V Aashutosh, Srijan Das, Abhijit Das Cite
Latent-Based Diffusion Model for Long-Tailed Recognition
Pengxiao Han, Changkun Ye, Jieming Zhou, Jing Zhang, Jie Hong, Xuesong Li Learning Optimized Low-Light Image Enhancement for Edge Vision Tasks
Sma Sharif, Azamat Myrzabekov, Nodirkhuja Khujaev, Roman Tsoy, Seongwan Kim, Jaeho Lee Cite
Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining
Feng Qiu, Heming Du, Wei Zhang, Chen Liu, Lincheng Li, Tianchen Guo, Xin Yu Cite
Leveraging Large Language Models for Multimodal Search
Oriol Barbany, Michael Huang, Xinliang Zhu, Arnab Dhua Lift-Attend-Splat: Bird's-Eye-View Camera-LiDAR Fusion Using Transformers
James Gunn, Zygmunt Lenyk, Anuj Sharma, Andrea Donati, Alexandru Buburuzan, John Redford, Romain Mueller Lifting Multi-View Detection and Tracking to the Bird's Eye View
Torben Teepe, Philipp Wolters, Johannes Gilg, Fabian Herzog, Gerhard Rigoll Listen Then See: Video Alignment with Speaker Attention
Aviral Agrawal, Carlos Mateo Samudio Lezcano, Iqui Balam Heredia-Marin, Prabhdeep Singh Sethi LOFI: LOng-Tailed FIne-Grained Network for Food Recognition
Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Marc Bolaños, Bhalaji Nagarajan, Petia Radeva Cite
Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition
Hasan Abed Al Kader Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad Albalawi, Bernard Ghanem MambaPupil: Bidirectional Selective Recurrent Model for Event-Based Eye Tracking
Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai, Yang Cao, Zheng-Jun Zha Masked Autoencoders Are Secretly Efficient Learners
Zihao Wei, Chen Wei, Jieru Mei, Yutong Bai, Zeyu Wang, Xianhang Li, Hongru Zhu, Huiyu Wang, Alan L. Yuille, Yuyin Zhou, Cihang Xie Cite
MaskSim: Detection of Synthetic Images by Masked Spectrum Similarity Analysis
Yanhao Li, Quentin Bammey, Marina Gardella, Tina Nikoukhah, Jean-Michel Morel, Miguel Colom, Rafael Grompone von Gioi Matting Anything
Jiachen Li, Jitesh Jain, Humphrey Shi Cite
Medium Scale Benchmark for Cricket Excited Actions Understanding
Altaf Hussain, Noman Khan, Muhammad Munsif, Min Je Kim, Sung Wook Baik Cite
MIMIC: Masked Image Modeling with Image Correspondences
Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Patrick Howe, Sharan Ranjit S, Anand Bhattad, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results
Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng, Yongyong Chen, Jingyong Su, Xianyu Guan, Hongyuan Yu, Cheng Wan, Jiamin Lin, Binnan Han, Yajun Zou, Zhuoyuan Wu, Yuan Huang, Yongsheng Yu, Daoan Zhang, Jizhe Li, Xuanwu Yin, Kunlong Zuo, Yunfan Lu, Yijie Xu, Wenzong Ma, Weiyu Guo, Hui Xiong, Wei Yu, Bingchun Luo, Sabari Nathan, Priya Kansal MIPI 2024 Challenge on Few-Shot RAW Image Denoising: Methods and Results
Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results
Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu Jin, Guanqun Liu, Chen Change Loy Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT
Miguel Ortiz del Castillo, Jonathan Morgan, Jack McRobbie, Clint Therakam, Zaher Joukhadar, Robert Mearns, Simon Barraclough, Richard O. Sinnott, Andrew Woods, Chris Bayliss, Kris Ehinger, Benjamin I. P. Rubinstein, James Bailey, Airlie Chapman, Michele Trenti Cite
MMIST-ccRCC: A Real World Medical Dataset for the Development of Multi-Modal Systems
Tiago Mota, Maria Rita Verdelho, Diogo J. Araújo, Alceu Bissoto, Carlos Santiago, Catarina Barata Mobile Aware Denoiser Network (MADNet) for Quad Bayer Images
Pavan C. Madhusudana, Jing Li, Zeeshan Nadir, Hamid R. Sheikh, Seok-Jun Lee Cite
Model-Guided Contrastive Fine-Tuning for Industrial Anomaly Detection
Aitor Artola, Yannis Kolodziej, Jean-Michel Morel, Thibaud Ehret Cite
Modeling Detailed Human Geometry with Adaptive Local Refinement
Bang Du, Kunyao Chen, Haochen Zhang, Fei Yin, Baichuan Wu, Truong Nguyen Cite
Monitoring Social Insect Activity with Minimal Human Supervision
Tarun Sharma, Julian Morgan Wagner, Sara Beery, William B. Dickson, Michael H. Dickinson, Joseph Parker Cite
Motion-Aware Needle Segmentation in Ultrasound Images
Raghavv Goel, Cecilia G. Morales, Manpreet Singh, Artur Dubrawski, Jonh Galeotti, Howie Choset Cite
Motorcyclist Helmet Violation Detection Framework by Leveraging Robust Ensemble and Augmentation Methods
Thien Van Luong, Huu Si Phuc Nguyen, Duy Khanh Dinh, Viet Hung Duong, Duy Hong Sam Vo, Huan Vu, Minh Tuan Hoang, Tien Cuong Nguyen Cite
MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Ke-Lei Wang, Pin-Hsuan Chou, Young-Ching Chou, Chia-Jen Liu, Cheng-Kuan Lin, Yu-Chee Tseng Multi Model Ensemble for Compound Expression Recognition
Jun Yu, Jichao Zhu, Wangyuan Zhu, Zhongpeng Cai, Gongpeng Zhao, Zhihong Wei, Guochen Xie, Zerui Zhang, Qingsong Liu, Jiaen Liang Cite
Multi-Modal Aerial View Image Challenge: SAR Classification
Spencer Low, Oliver Nina, Dylan Bowald, Angel Domingo Sappa, Nathan Inkawhich, Peter Bruns Cite
Multi-Modal Aerial View Image Challenge: Sensor Domain Translation
Spencer Low, Oliver Nina, Dylan Bowald, Angel Domingo Sappa, Nathan Inkawhich, Peter Bruns Cite
Multi-Modal Arousal and Valence Estimation Under Noisy Conditions
Denis Dresvyanskiy, Maxim Markitantov, Jiawei Yu, Heysem Kaya, Alexey Karpov Cite
Multi-Modal Hit Detection and Positional Analysis in Padel Competitions
Robbe Decorte, Martin Paré, Jelle Vanhaeverbeke, Joachim Taelman, Maarten Slembrouck, Steven Verstockt Multi-Perspective Traffic Video Description Model with Fine-Grained Refinement Approach
Tuan-An To, Minh-Nam Tran, Trong-Bao Ho, Thien-Loc Ha, Quang-Tan Nguyen, Hoang-Chau Luong, Thanh-Duy Cao, Minh-Triet Tran Cite
Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments
Benoît Gérin, Anaïs Halin, Anthony Cioppa, Maxim Henry, Bernard Ghanem, Benoît Macq, Christophe De Vleeschouwer, Marc Van Droogenbroeck Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition
Marah Halawa, Florian Blume, Pia Bideau, Martin Maier, Rasha Abdel Rahman, Olaf Hellwich Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation
Mathis Petrovich, Or Litany, Umar Iqbal, Michael J. Black, Gül Varol, Xue Bin Peng, Davis Rempe Multi-View Spatial-Temporal Learning for Understanding Unusual Behaviors in Untrimmed Naturalistic Driving Videos
Huy-Hung Nguyen, Chi Dai Tran, Long Hoang Pham, Duong Nguyen-Ngoc Tran, Tai Huu-Phuong Tran, Duong Khac Vu, Quoc Pham-Nam Ho, Ngoc Doan-Minh Huynh, Hyung-Min Jeon, Hyung-Joon Jeon, Jae Wook Jeon Cite
MultiPanoWise: Holistic Deep Architecture for Multi-Task Dense Prediction from a Single Panoramic Image
Uzair Shah, Muhammad Tukur, Mahmood Alzubaidi, Giovanni Pintore, Enrico Gobbetti, Mowafa S. Househ, Jens Schneider, Marco Agus Cite
Must Unsupervised Continual Learning Relies on Previous Information?
Haoyang Cheng, Haitao Wen, Heqian Qiu, Lanxiao Wang, Minjian Zhang, Hongliang Li Cite
MV-Soccer: Motion-Vector Augmented Instance Segmentation for Soccer Player Tracking
Fahad Majeed, Nauman Ullah Gilal, Khaled A. Al-Thelaya, Yin Yang, Marco Agus, Jens Schneider Cite
MvAV-pix2pixHD: Multi-View Aerial View Image Translation
Jun Yu, Keda Lu, Shenshen Du, Lin Xu, Peng Chang, Houde Liu, Bin Lan, Tianyu Liu Cite
NeRF as Pretraining at Scale: Generalizable 3D-Aware Semantic Representation Learning from View Prediction
Wenyan Cong, Hanxue Liang, Zhiwen Fan, Peihao Wang, Yifan Jiang, Dejia Xu, A. Cengiz Öztireli, Zhangyang Wang Cite
Neural Fields for Co-Reconstructing 3D Objects from Incidental 2D Data
Dylan Campbell, Eldar Insafutdinov, João F. Henriques, Andrea Vedaldi Cite
NICE: CVPR 2023 Challenge on Zero-Shot Image Captioning
Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae-Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun nnMobileNet: Rethinking CNN for Retinopathy Research
Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xin Li, Natasha Leporé, Oana M. Dumitrascu, Yalin Wang NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer
Sai Kumar Reddy Manne, Brendan Martin, Tyler Roy, Ryan Neilson, Rebecca Peters, Meghana Chillara, Christine W. Lary, Katherine J. Motyl, Michael Wan NTIRE 2024 Challenge on Blind Enhancement of Compressed Image: Methods and Results
Ren Yang, Radu Timofte, Bingchen Li, Xin Li, Mengxi Guo, Shijie Zhao, Li Zhang, Zhibo Chen, Dongyang Zhang, Yash Arora, Aditya Arora, Yuanbin Chen, Hui Tang, Tao Wang, Longxuan Zhao, Bin Chen, Tong Tong, Qiao Mo, Jingwei Bao, Jinhua Hao, Yukang Ding, Hantang Li, Ming Sun, Chao Zhou, Shuyuan Zhu, Zhi Jin, Wei Wang, Dandan Zhan, Jiawei Wu, Jiahao Wu, Luwei Tu, Hongyu An, Xinfeng Zhang, Woon-Ha Yeo, Wang-Taek Oh, Young-Il Kim, Han-Cheol Ryu, Long Sun, Mingjun Zhen, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Yapeng Du, Ao Li, Ziyang He, Lei Luo, Ce Zhu, Xin Yao, Sunder Ali Khowaja, Ikhyun Lee, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy Cite
NTIRE 2024 Challenge on Bracketing Image Restoration and Enhancement: Datasets, Methods and Results
Zhilu Zhang, Shuohao Zhang, Renlong Wu, Wangmeng Zuo, Radu Timofte, Xiaoxia Xing, Hyunhee Park, Sejun Song, Changho Kim, Xiangyu Kong, Jinlong Wu, Jianxing Zhang, Jingfan Tan, Zikun Liu, Wenhan Luo, Wenjie Lin, Chengzhi Jiang, Mingyan Han, Zhen Liu, Ting Jiang, Jinting Luo, Shen Cheng, Linze Li, Xinhan Niu, Shuaicheng Liu, Kexin Dai, Kangzhen Yang, Tao Hu, Xiangyu Chen, Yu Cao, Qingsen Yan, Yanning Zhang, Genggeng Chen, Yongqing Yang, Wei Dong, Xinwei Dai, Yuanbo Zhou, Xintao Qiu, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Peng Zhang, Yifei Chen, Wenbo Xiong, Zhijun Song, Pu Cheng, Taolue Feng, Yunqing He, Daiguo Zhou, Ying Huang, Xiaowen Ma, Peng Wu Cite
NTIRE 2024 Challenge on HR Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez, Fabio Tosi, Luigi Di Stefano, Radu Timofte, Alex Costanzino, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Yangyang Zhang, Cailin Wu, Zhuangda He, Shuangshuang Yin, Jiaxu Dong, Yangchenxu Liu, Hao Jiang, Jun Shi, Yong A, Yixiang Jin, Dingzhe Li, Bingxin Ke, Anton Obukhov, Tinafu Wang, Nando Metzger, Shengyu Huang, Konrad Schindler, Yachuan Huang, Jiaqi Li, Junrui Zhang, Yiran Wang, Zihao Huang, Tianqi Liu, Zhiguo Cao, Pengzhi Li, Jui-Lin Wang, Wenjie Zhu, Hui Geng, Yuxin Zhang, Long Lan, Kele Xu, Tao Sun, Qisheng Xu, Sourav Saini, Aashray Gupta, Sahaj K. Mistry, Aryan Shukla, Vinit Jakhetiya, Sunil Prasad Jaiswal, Yuejin Sun, Zhuofan Zheng, Yi Ning, Jen-Hao Cheng, Hou-I Liu, Hsiang-Wei Huang, Cheng-Yen Yang, Zhongyu Jiang, Yi-Hao Peng, Aishi Huang, Jenq-Neng Hwang NTIRE 2024 Challenge on Image Super-Resolution (×4): Methods and Results
Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou, Hongyu An, Xinfeng Zhang, Zhiyuan Song, Ziyue Dong, Qing Zhao, Xiaogang Xu, Pengxu Wei, Zhi-Chao Dou, Gui-Ling Wang, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Cansu Korkmaz, A. Murat Tekalp, Yubin Wei, Xiaole Yan, Binren Li, Haonan Chen, Siqi Zhang, Sihan Chen, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Anjali Sarvaiya, Pooja Choksy, Jagrit Joshi, Shubh Kawa, Kishor P. Upla, Sushrut Patwardhan, Raghavendra Ramachandra, Sadat Hossain, Geongi Park, S. M. Nadim Uddin, Hao Xu, Yanhui Guo, Aman Urumbekov, Xingzhuo Yan, Wei Hao, Minghan Fu, Isaac Orais, Samuel Smith, Ying Liu, Wangwang Jia, Qisheng Xu, Kele Xu, Weijun Yuan, Zhan Li, Wenqing Kuang, Ruijin Guan, Ruting Deng, Zhao Zhang, Bo Wang, Suiyi Zhao, Yan Luo, Yanyan Wei, Asif Hussain Khan, Christian Micheloni, Niki Martinel Cite
NTIRE 2024 Challenge on Light Field Image Super-Resolution: Methods and Results
Yingqian Wang, Zhengyu Liang, Qianyu Chen, Longguang Wang, Jungang Yang, Radu Timofte, Yulan Guo, Wentao Chao, Yiming Kan, Xuechun Wang, Fuqing Duan, Guanghui Wang, Wang Xia, Ziqi Wang, Yue Yan, Peiqi Xia, Shunzhou Wang, Yao Lu, Angulia Yang, Kai Jin, Zeqiang Wei, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou, Zhongxin Yu, Shaofei Luo, Cheng Zhong, Shaorui Chen, Long Peng, Yuhong He, Gaosheng Liu, Huanjing Yue, Jingyu Yang, Zhengjian Yao, Jiakui Hu, Lujia Jin, Zhi-Song Liu, Chenhang He, Jun Xiao, Xiuyuan Wang, Zonglin Tian, Yifan Mao, Deyang Liu, Shizheng Li, Ping An Cite
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan L. Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang, Qingsen Yan, Wenbin Zou, Weipeng Yang, Yunxiang Li, Qiaomu Wei, Tian Ye, Sixiang Chen, Zhao Zhang, Suiyi Zhao, Bo Wang, Yan Luo, Zhichao Zuo, Mingshen Wang, Junhu Wang, Yanyan Wei, Xiaopeng Sun, Yu Gao, Jiancheng Huang, Hongming Chen, Xiang Chen, Hui Tang, Yuanbin Chen, Yuanbo Zhou, Xinwei Dai, Xintao Qiu, Wei Deng, Qinquan Gao, Tong Tong, Mingjia Li, Jin Hu, Xinyu He, Xiaojie Guo, Sabarinathan, K. Uma, A. Sasithradevi, B. Sathya Bama, S. Mohamed Mansoor Roomi, V. Srivatsav, Jinjuan Wang, Long Sun, Qiuying Chen, Jiahong Shao, Yizhi Zhang, Marcos V. Conde, Daniel Feijoo, Juan C. Benito, Álvaro García, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenyi Lian, Wenjing Lian, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C, Xingzhuo Yan, Wei Hao, Minghan Fu, Pooja Choksy, Anjali Sarvaiya, Kishor P. Upla, Kiran B. Raja, Hailong Yan, Yunkai Zhang, Baiang Li, Jingyi Zhang, Huan Zheng NTIRE 2024 Challenge on Night Photography Rendering
Egor I. Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banic, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy P. Terekhin NTIRE 2024 Challenge on Short-Form UGC Video Quality Assessment: Methods and Results
Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie Zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, Wangmeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, Huimin Zheng, Junhao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results
Longguang Wang, Yulan Guo, Juncheng Li, Hongda Liu, Yang Zhao, Yingqian Wang, Zhi Jin, Shuhang Gu, Radu Timofte NTIRE 2024 Dense and Non-Homogeneous Dehazing Challenge Report
Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Yidi Liu, Xingbo Wang, Yurui Zhu, Gege Shi, Xin Lu, Xueyang Fu, Zheng-Jun Zha, Wei Dong, Han Zhou, Ruiyi Wang, Xiaohong Liu, Guangtao Zhai, Jun Chen, Wei Song, Yichang Gao, Jiahao Xiong, Hualiang Lin, Xianger Li, Dong Li, Mohab Kishawy, Ruibin Li, Seyed Amirreza Mousavi, Rana Rauf, Yangyi Liu, Huan Liu, Mingsheng Tu, Kele Xu, JiaWen Chen, Qisheng Xu, Tao Sun, Jin Guo, Ben Shao, Tianli Liu, Mohao Wu, Xingzhuo Yan, Minghan Fu, Lehan Yang, Xin Lin, Lu Qi, Jincen Song, Xiaoqian Hu, Linwei Tao, Hongming Chen, Xiang Chen, Chuanlong Xie, Zhao Zhang, Junhu Wang, Yanyan Wei, Suiyi Zhao, Shengeng Tang, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenjing Jiang, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C Cite
NTIRE 2024 Image Shadow Removal Challenge Report
Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Zongwei Wu, Cailian Chen, Radu Timofte, Wei Dong, Han Zhou, Yuqiong Tian, Jun Chen, Xueyang Fu, Xin Lu, Yurui Zhu, Xi Wang, Dong Li, Jie Xiao, Yunpeng Zhang, Zheng-Jun Zha, Zhao Zhang, Suiyi Zhao, Bo Wang, Yan Luo, Yanyan Wei, Zhihao Zhao, Long Sun, Tingting Yang, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Bilel Benjdira, Mohammed Nassif, Anis Koubaa, Ahmed Elhayek, Anas M. Ali, Kyotaro Tokoro, Kento Kawai, Kaname Yokoyama, Takuya Seno, Yuki Kondo, Norimichi Ukita, Chenghua Li, Bo Yang, Zhiqi Wu, Gao Chen, Yihan Yu, Sixiang Chen, Kai Zhang, Tian Ye, Wenbin Zou, Yunlong Lin, Zhaohu Xing, Jinbin Bai, Wenhao Chai, Lei Zhu, Ritik Maheshwari, Rakshank Verma, Rahul Tekchandani, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi, Subrahmanyam Murala, Jaeho Lee, Seongwan Kim, Sma Sharif, Nodirkhuja Khujaev, Roman Tsoy, Fan Gao, Weidan Yan, Wenze Shao, Dengyin Zhang, Bin Chen, Siqi Zhang, Yanxin Qian, Yuanbin Chen, Yuanbo Zhou, Tong Tong, Rongfeng Wei, Ruiqi Sun, Yue Liu, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Adinath Dukre, Vivek Deshmukh, Shruti S. Phutke, Ashutosh Kulkarni, Anil Gonde, Arun karthik K, Manasa N, Shri Hari Priya, Wei Hao, Xingzhuo Yan, Minghan Fu Cite
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei Zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie Zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang, Yibin Huang, Shuai Liu, Yongqiang Li, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yuxiang Chen, Xiangyu Chen, Qiubo Chen, Fengyu Sun, Mengying Cui, Jiaxu Chen, Zhenyu Hu, Jingyun Liu, Wenzhuo Ma, Ce Wang, Hanyou Zheng, Wanjie Sun, Zhenzhong Chen, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Xiong Dun, Pengzhou Ji, Yujie Xing, Xuquan Wang, Zhanshan Wang, Xinbin Cheng, Jun Xiao, Chenhang He, Xiuyuan Wang, Zhi-Song Liu, Zimeng Miao, Zhicun Yin, Ming Liu, Wangmeng Zuo, Shuai Li NurtureNet: A Multi-Task Video-Based Approach for Newborn Anthropometry
Yash Khandelwal, Mayur Arvind, Sriram Kumar, Ashish Gupta, Sachin Kumar Danisetty, Piyush Bagad, Anish Madan, Mayank Lunayach, Aditya Annavajjala, Abhishek Maiti, Sansiddh Jain, Aman Dalmia, Namrata Deka, Jerome White, Jigar Doshi, Angjoo Kanazawa, Rahul Panicker, Alpan Raval, Srinivas Rana, Makarand Tapaswi OccFeat: Self-Supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko, Alexandre Boulch, Spyros Gidaris, Andrei Bursuc, Antonín Vobecký, Patrick Pérez, Renaud Marlet OmniControlNet: Dual-Stage Integration for Conditional Image Generation
Yilin Wang, Haiyang Xu, Xiang Zhang, Zeyuan Chen, Zhizhou Sha, Zirui Wang, Zhuowen Tu Cite
On the Efficiency of Privacy Attacks in Federated Learning
Nawrin Tabassum, Ka-Ho Chow, Xuyu Wang, Wenbin Zhang, Yanzhao Wu One Class Classification-Based Quality Assurance of Organs-at-Risk Delineation in Radiotherapy
Yihao Zhao, Cuiyun Yuan, Ying Liang, Yang Li, Chunxia Li, Man Zhao, Jun Hu, Ningze Zhong, Chenbin Liu Cite
Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering
Riu Cherdchusakulchai, Sasin Phimsiri, Visarut Trairattanapa, Suchat Tungjitnob, Wasu Kudisthalert, Pornprom Kiawjak, Ek Thamwiwatthana, Phawat Borisuitsawat, Teepakorn Tosawadi, Pakcheera Choppradit, Kasisdis Mahakijdechachai, Supawit Vatathanavaro, Worawit Saetan, Vasin Suttichaya Cite
Open-World Instance Segmentation: Top-Down Learning with Bottom-up Supervision
Tarun Kalluri, Weiyao Wang, Heng Wang, Manmohan Chandraker, Lorenzo Torresani, Du Tran OpenStory: A Large-Scale Open-Domain Dataset for Subject-Driven Visual Storytelling
Zilyu Ye, Jinxiu Liu, Jinjin Cao, Zhiyang Chen, Ziwei Xuan, Mingyuan Zhou, Qi Liu, Guo-Jun Qi Cite
OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities
Lasse H. Hansen, Simon Buus Jensen, Mark P. Philipsen, Andreas Møgelmose, Lars Bodum, Thomas B. Moeslund Optimized Martian Dust Displacement Detection Using Explainable Machine Learning
Ana Lomashvili, Kristin Rammelkamp, Olivier Gasnault, Protim Bhattacharjee, Elise Clavé, Christoph H. Egerland, Susanne Schröder, Begüm Demir, Nina L. Lanza Our Deep CNN Face Matchers Have Developed Achromatopsia
Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Michael C. King, Kevin W. Bowyer Cite
Overlap Suppression Clustering for Offline Multi-Camera People Tracking
Ryuto Yoshida, Junichi Okubo, Junichiro Fujii, Masazumi Amakata, Takayoshi Yamashita Cite
Paediatric Pulse Rate Measurements: A Comparison of Methods Using Remote Photoplethysmography
Simon Wegerif, Ivan Veleslavov, Lieke Dorine van Putten, Kate Emily Bamford, Gauri Misra, Niall Mullen Cite
Physics Based Camera Privacy: Lens and Network Co-Design to the Rescue
Marius Dufraisse, Marcela Carvalho, Pauline Trouvé-Peloux, Frédéric Champagnat Cite
PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics
Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A. Clausi, John S. Zelek PointPrompt: A Multi-Modal Prompting Dataset for Segment Anything Model
Jorge Quesada, Mohammad Alotaibi, Mohit Prabhushankar, Ghassan AlRegib Cite
Potential Risk Localization via Weak Labeling Out of Blind Spot
Kota Shimomura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi Cite
Probing Conceptual Understanding of Large Visual-Language Models
Madeline Schiappa, Raiyaan Abdullah, Shehreen Azad, Jared Claypoole, Michael Cogswell, Ajay Divakaran, Yogesh S. Rawat PromptCIR: Blind Compressed Image Restoration with Prompt Learning
Bingchen Li, Xin Li, Yiting Lu, Ruoyu Feng, Mengxi Guo, Shijie Zhao, Li Zhang, Zhibo Chen Prototype-Based Interpretable Model for Glaucoma Detection
Mohana Singh, B. S. Vivek, Jayavardhana Gubbi, Arpan Pal Cite
Prune Efficiently by Soft Pruning
Parakh Agarwal, Manu Mathew, Kunal Ranjan Patel, Varun Tripathi, Pramod Swami Cite
Pruning as a Binarization Technique
Lukas Frickenstein, Pierpaolo Morì, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Claudio Passerone, Walter Stechele Cite
QAttn: Efficient GPU Kernels for Mixed-Precision Vision Transformers
Piotr Kluska, Adrián Castelló, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí Quality-Based Artifact Modeling for Facial Deepfake Detection in Videos
Sara Concas, Simone Maurizio La Cava, Roberto Casula, Giulia Orrù, Giovanni Puglisi, Gian Luca Marcialis QuantNAS: Quantization-Aware Neural Architecture Search for Efficient Deployment on Mobile Device
Tianxiao Gao, Li Guo, Shanwei Zhao, Peihan Xu, Yukun Yang, Xionghao Liu, Shihao Wang, Shiai Zhu, Dajiang Zhou Cite
Radar Fields: An Extension of Radiance Fields to SAR
Thibaud Ehret, Roger Marí, Dawa Derksen, Nicolas Gasnier, Gabriele Facciolo Raising the Bar of AI-Generated Image Detection with CLIP
Davide Cozzolino, Giovanni Poggi, Riccardo Corvi, Matthias Nießner, Luisa Verdoliva Real-Time 4k Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Marcos V. Conde, Zhijun Lei, Wen Li, Ioannis Katsavounidis, Radu Timofte, Min Yan, Xin Liu, Qian Wang, Xiaoqian Ye, Zhan Du, Tiansen Zhang, Zhiyuan Li, Hao Wei, Chenyang Ge, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Menghan Zhou, Yiqiang Yan, Kihwan Yoon, Ganzorig Gankhuyag, Jae-Hyeon Lee, Ui-Jin Choi, Hyeon-Cheol Moon, Tae Hyun Jeong, Yoonmo Yang, Jae-Gon Kim, Jinwoo Jeong, Sunjei Kim, Xintao Qiu, Yuanbo Zhou, Kongxian Wu, Xinwei Dai, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Long Peng, Jiaming Guo, Xin Di, Bohao Liao, Zhibo Du, Peize Xia, Renjing Pei, Yang Wang, Yang Cao, Zhengjun Zha, Bingnan Han, Hongyuan Yu, Zhuoyuan Wu, Cheng Wan, Yuqing Liu, Haodong Yu, Jizhe Li, Zhijuan Huang, Yuan Huang, Yajun Zou, Xianyu Guan, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin Reciprocal Attention Mixing Transformer for Lightweight Image Restoration
Haram Choi, Cheolwoong Na, Jihyeon Oh, Seungjae Lee, Jinseop Kim, Subeen Choe, Jeongmin Lee, Taehoon Kim, Jihoon Yang Recognize Anything: A Strong Image Tagging Model
Youcai Zhang, Xinyu Huang, Jinyu Ma, Zhaoyang Li, Zhaochuan Luo, Yanchun Xie, Yuzhuo Qin, Tong Luo, Yaqian Li, Shilong Liu, Yandong Guo, Lei Zhang Red-Teaming Segment Anything Model
Krzysztof Jankowski, Bartlomiej Sobieski, Mateusz Kwiatkowski, Jakub Szulc, Michal Janik, Hubert Baniecki, Przemyslaw Biecek REFA: Real-Time Egocentric Facial Animations for Virtual Reality
Qiang Zhang, Tong Xiao, Haroun Habeeb, Larissa Laich, Sofien Bouaziz, Patrick Snape, Wenjing Zhang, Matthew Cioffi, Peizhao Zhang, Pavel Pidlypenskyi, Winnie Lin, Luming Ma, Mengjiao Wang, Kunpeng Li, Chengjiang Long, Steven Song, Martin Prazák, Alexander Sjoholm, Ajinkya Deogade, Jaebong Lee, Julio Delgado Mangas, Amaury Aubel ReMOVE: A Reference-Free Metric for Object Erasure
Aditya Chandrasekar, Goirik Chakrabarty, Jai Bardhan, Ramya Hebbalaguppe, Prathosh Ap Cite
Repurposing the Image Generative Potential: Exploiting GANs to Grade Diabetic Retinopathy
Isabella Poles, Eleonora D'Arnese, Luca G. Cellamare, Marco D. Santambrogio, Darvin Yi Cite
Retina : Low-Power Eye Tracking with Event Camera and Spiking Hardware
Pietro Bonazzi, Sizhen Bian, Giovanni Lippolis, Yawei Li, Sadique Sheik, Michele Magno RetinaLiteNet: A Lightweight Transformer Based CNN for Retinal Feature Segmentation
Mehwish Mehmood, Majed Alsharari, Shahzaib Iqbal, Ivor T. A. Spence, Muhammad Fahim Cite
ReweightOOD: Loss Reweighting for Distance-Based OOD Detection
Sudarshan Regmi, Bibek Panthi, Yifei Ming, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai Cite
RGB-D Cube R-CNN: 3D Object Detection with Selective Modality Dropout
Jens Piekenbrinck, Alexander Hermans, Narunas Vaskevicius, Timm Linder, Bastian Leibe Cite
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data
Tarun Kalluri, Jihyeon Lee, Kihyuk Sohn, Sahil Singla, Manmohan Chandraker, Joseph Xu, Jeremiah Z. Liu Robust Perspective-N-Crater for Crater-Based Camera Pose Estimation
Sofia McLeod, Chee Kheng Chng, Tatsuharu Ono, Yuta Shimizu, Ryodo Hemmi, Lachlan Holden, Matthew Rodda, Feras Dayoub, Hirdy Miyamoto, Yukihiro Takahashi, Yasuko Kasai, Tat-Jun Chin Cite
Robustness Analysis on Foundational Segmentation Models
Madeline Chantry Schiappa, Shehreen Azad, Sachidanand Vs, Yunhao Ge, Ondrej Miksik, Yogesh S. Rawat, Vibhav Vineet Rugby Scene Classification Enhanced by Vision Language Model
Naoki Nonaka, Ryo Fujihira, Toshiki Koshiba, Akira Maeda, Jun Seita Cite
S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal
Nikolina Kubiak, Armin Mustafa, Graeme Phillipson, Stephen Jolly, Simon Hadfield SACReg: Scene-Agnostic Coordinate Regression for Visual Localization
Jérôme Revaud, Yohann Cabon, Romain Brégier, JongMin Lee, Philippe Weinzaepfel SAD-GS: Shape-Aligned Depth-Supervised Gaussian Splatting
Pou-Chun Kung, Seth Isaacson, Ram Vasudevan, Katherine A. Skinner Cite
Salient Object-Aware Background Generation Using Text-Guided Diffusion Models
Amir Erfan Eshratifar, João V. B. Soares, Kapil Thadani, Shaunak Mishra, Mikhail Kuznetsov, Yueh-Ning Ku, Paloma de Juan SAM-CLIP: Merging Vision Foundation Models Towards Semantic and Spatial Understanding
Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari Sat2Cap: Mapping Fine-Grained Textual Descriptions from Satellite Images
Aayush Dhakal, Adeel Ahmad, Subash Khanal, Srikumar Sastry, Hannah Kerner, Nathan Jacobs Scaling Graph Convolutions for Mobile Vision
William Avery, Mustafa Munir, Radu Marculescu SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations
Jamie Menjay Lin, Jisoo Jeong, Hong Cai, Risheek Garrepalli, Kai Wang, Fatih Porikli SDFConnect: Neural Implicit Surface Reconstruction of a Sparse Point Cloud with Topological Constraints
Anushrut Jignasu, Aditya Balu, Soumik Sarkar, Chinmay Hegde, Baskar Ganapathysubramanian, Adarsh Krishnamurthy Cite
Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data
Ivan DeAndres-Tame, Ruben Tolosana, Pietro Melzi, Rubén Vera-Rodríguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Aythami Morales, Julian Fiérrez, Javier Ortega-Garcia, Zhizhou Zhong, Yuge Huang, Yuxi Mi, Shouhong Ding, Shuigeng Zhou, Shuai He, Lingzhi Fu, Heng Cong, Rongyu Zhang, Zhihong Xiao, Evgeny Smirnov, Anton Pimenov, Aleksei Grigorev, Denis Timoshenko, Kaleb Mesfin Asfaw, Cheng-Yaw Low, Hao Liu, Chuyi Wang, Qing Zuo, Zhixiang He, Hatef Otroshi-Shahreza, Anjith George, Alexander Unnervik, Parsa Rahimi, Sébastien Marcel, Pedro C. Neto, Marco Huber, Jan Niklas Kolf, Naser Damer, Fadi Boutros, Jaime S. Cardoso, Ana Filipa Sequeira, Andrea Atzori, Gianni Fenu, Mirko Marras, Vitomir Struc, Jiang Yu, Zhangjie Li, Jichun Li, Weisong Zhao, Zhen Lei, Xiangyu Zhu, Xiaoyu Zhang, Bernardo Biesseck, Pedro Vidal, Luiz Coelho, Roger Granada, David Menotti Seeing the Vibration from Fiber-Optic Cables: Rain Intensity Monitoring Using Deep Frequency Filtering
Zhuocheng Jiang, Yangmin Ding, Junhui Zhao, Yue Tian, Shaobo Han, Sarper Ozharar, Ting Wang, James M. Moore Cite
Segment Anything in Food Images
Saeed S. Alahmari, Michael Gardner, Tawfiq Salem Cite
Segment Anything Model for Road Network Graph Extraction
Congrui Hetang, Haoru Xue, Cindy X. Le, Tianwei Yue, Wenping Wang, Yihui He Selective Multi-View Deep Model for 3D Object Classification
Mona Saleh Alzahrani, Muhammad Usman, Saeed Anwar, Tarek Helmy Cite
Semi-Stereo: A Universal Stereo Matching Framework for Imperfect Data via Semi-Supervised Learning
Xin Yue, Zongqing Lu, Xiangru Lin, Wenjia Ren, Zhijing Shao, Haonan Hu, Yu Zhang, Qingmin Liao Cite
Shadow Removal via Global Residual Free UNet and Shadow Generation
Dong Li, Xin Lu, Yurui Zhu, Xi Wang, Jie Xiao, Yunpeng Zhang, Xueyang Fu, Zheng-Jun Zha Cite
ShadowRefiner: Towards Mask-Free Shadow Removal via Fast Fourier Transformer
Wei Dong, Han Zhou, Yuqiong Tian, Jingke Sun, Xiaohong Liu, Guangtao Zhai, Jun Chen SimpliCity: Reconstructing Buildings with Simple Regularized 3D Models
Jean-Philippe Bauchet, Raphael Sulzer, Florent Lafarge, Yuliya Tarabalka Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection Using Budding Ensemble Architecture for Object Detection
Syed Sha Qutub, Michael Paulitsch, Kay-Ulrich Scholl, Neslihan Köse Cihangir, Korbinian Hagn, Fabian Oboril, Gereon Hinz, Alois Knoll SkipPLUS: Skip the First Few Layers to Better Explain Vision Transformers
Faridoun Mehri, Mohsen Fayyaz, Mahdieh Soleymani Baghshah, Mohammad Taher Pilehvar SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap
Vladimir Somers, Victor Joos, Anthony Cioppa, Silvio Giancola, Seyed Abolfazl Ghasemzadeh, Floriane Magera, Baptiste Standaert, Amir M. Mansourian, Xin Zhou, Shohreh Kasaei, Bernard Ghanem, Alexandre Alahi, Marc Van Droogenbroeck, Christophe De Vleeschouwer SoccerNet-Depth: A Scalable Dataset for Monocular Depth Estimation in Sports Videos
Arnaud Leduc, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck Source-Free Domain Adaptation of Weakly-Supervised Object Localization Models for Histology
Alexis Guichemerre, Soufiane Belharbi, Tsiry Mayet, Shakeeb Murtaza, Pourya Shamsolmoali, Luke McCaffrey, Eric Granger SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection
Mathis Kruse, Marco Rudolph, Dominik Woiwode, Bodo Rosenhahn Cite
StampOne: Addressing Frequency Balance in Printer-Proof Steganography
Farhad Shadmand, Iurii Medvedev, Luiz Schirmer, João Marcos, Nuno Gonçalves Cite
StegaNeRV: Video Steganography Using Implicit Neural Representation
Monsij Biswal, Tong Shao, Kenneth Rose, Peng Yin, Sean McCarthy Cite
Structured Sparse Back-Propagation for Lightweight On-Device Continual Learning on Microcontroller Units
Francesco Paissan, Davide Nadalini, Manuele Rusci, Alberto Ancilotto, Francesco Conti, Luca Benini, Elisabetta Farella Style Transfer for 2D Talking Head Generation
Trong-Thang Pham, Tuong Do, Nhat Le, Ngan Le, Hung Nguyen, Erman Tjiputra, Quang Tran, Anh Nguyen Cite
Super-Resolution of Biomedical Volumes with 2D Supervision
Cheng Jiang, Alexander Gedeon, Yiwei Lyu, Eric Landgraf, Yufeng Zhang, Xinhai Hou, Akhil Kondepudi, Asadur Chowdury, Honglak Lee, Todd C. Hollon SuperLoRA: Parameter-Efficient Unified Adaptation for Large Vision Models
Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand, Guanghui Wang, Toshiaki Koike-Akino Cite
Swift Parameter-Free Attention Network for Efficient Super-Resolution
Cheng Wan, Hongyuan Yu, Zhiqi Li, Yihang Chen, Yajun Zou, Yuqing Liu, Xuanwu Yin, Kunlong Zuo T2FNorm: Train-Time Feature Normalization for OOD Detection in Image Classification
Sudarshan Regmi, Bibek Panthi, Sakar Dotel, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai Cite
T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences
Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Grégory Rogez Table Tennis Ball Spin Estimation with an Event Camera
Thomas Gossard, Julian Krismer, Andreas Ziegler, Jonas Tebbe, Andreas Zell Tackling Domain Shifts in Person Re-Identification: A Survey and Analysis
Vuong D. Nguyen, Samiha Mirza, Abdollah Zakeri, Ayush Gupta, Khadija Khaldi, Rahma Aloui, Pranav Mantini, Shishir K. Shah, Fatima A. Merchant Cite
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models
Feipeng Ma, Yizhou Zhou, Yueyi Zhang, Siying Wu, Zheyu Zhang, Zilong He, Fengyun Rao, Xiaoyan Sun Cite
TattTRN: Template Reconstruction Network for Tattoo Retrieval
Lázaro Janier González-Soler, Maciej Salwowski, Christian Rathgeb, Daniel Fischer TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-Pitch Videos
Atom Scott, Ikuma Uchida, Ning Ding, Rikuhei Umemoto, Rory P. Bunker, Ren Kobayashi, Takeshi Koyama, Masaki Onishi, Yoshinari Kameda, Keisuke Fujii Test Time Training for Industrial Anomaly Segmentation
Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero-Shot Medical Image Segmentation
Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor, Suzanne Little Test-Time Specialization of Dynamic Neural Networks
Sam Leroux, Dewant Katare, Aaron Yi Ding, Pieter Simoens The 6th Affective Behavior Analysis In-the-Wild (ABAW) Competition
Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Stefanos Zafeiriou, Irene Kotsia, Alice Baird, Chris Gagne, Chunchang Shao, Guanyu Hu The 8th AI City Challenge
Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Pranamesh Chakraborty, Sanjita Prajapati, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Fady Alnajjar, Ganzorig Batnasan, Ping-Yang Chen, Jun-Wei Hsieh, Xunlei Wu, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Rama Chellappa Cite
The Myth of the Pyramid
Ramon Izquierdo-Cordova, Walterio W. Mayol-Cuevas Cite
The New Agronomists: Language Models Are Experts in Crop Management
Jing Wu, Zhixin Lai, Suiyao Chen, Ran Tao, Pan Zhao, Naira Hovakimyan The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, Zhengjun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Qian Wang, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Emad Barsoum, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo Wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, Yuhan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi The Penalized Inverse Probability Measure for Conformal Classification
Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre Da Costa The Revenge of BiSeNet: Efficient Multi-Task Image Segmentation
Gabriele Rosi, Claudia Cuttano, Niccolò Cavagnero, Giuseppe Averta, Fabio Cermelli The Third Monocular Depth Estimation Challenge
Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, YiPing Bao, Xiao Liu, Dohyeong Kim, Jinseong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, Jinqiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora, Pihai Sun, Kui Jiang, Gang Wu, Jian Liu, Xianming Liu, Junjun Jiang, Xidan Zhang, Jianing Wei, Fangjun Wang, Zhiming Tan, Jiabao Wang, Albert Luginov, Muhammad Shahzad, Seyed Hosseini, Aleksander Trajcevski, James H. Elder Thermal Image Super-Resolution Challenge Results - PBVS 2024
Rafael E. Rivadeneira, Angel Domingo Sappa, Chenyang Wang, Junjun Jiang, Zhiwei Zhong, Peilin Chen, Shiqi Wang Cite
Towards Engineered Safe AI with Modular Concept Models
Lena Heidemann, Iwo Kurzidem, Maureen Monnet, Karsten Roscher, Stephan Günnemann Cite
Towards Quantitative Evaluation Metrics for Image Editing Approaches
Dana Cohen Hochberg, Oron Anschel, Alon Shoshan, Igor Kviatkovsky, Manoj Aggarwal, Gérard Guy Medioni Cite
Tracking and Counting Apples in Orchards Under Intermittent Occlusions and Low Frame Rates
Gonçalo P. Matos, Carlos Santiago, João Paulo Costeira, Ricardo L. Saldanha, Ernesto M. Morgado Cite
Tracklet-Based Explainable Video Anomaly Localization
Ashish Singh, Michael J. Jones, Erik G. Learned-Miller Cite
Transformers for Orbit Determination Anomaly Detection and Classification
Nathan Parrish Ré, Matthew Popplewell, Michael Caudill, Timothy Sullivan, Tyler Hanf, Benjamin Tatman, Kanak Parmar, Tyler Presser, Sai Chikine, Michael Grant, Richard Poulson Cite
Triage of 3D Pathology Data via 2.5d Multiple-Instance Learning to Guide Pathologist Assessments
Gan Gao, Andrew H. Song, Fiona Wang, David Brenes, Rui Wang, Sarah S. L. Chow, Kevin W. Bishop, Lawrence D. true, Faisal Mahmood, Jonathan T. C. Liu Two Stage Dehazing Framework for Dense and Non-Homogeneous Dehazing
Wei Song, Yichang Gao, Jiahao Xiong, Hualiang Lin, Dong Li, Yun Zhang Cite
UDAC: Under-Display Array Cameras
Chengyu Wang, Jing Li, Pavan C. Madhusudanarao, Jinhan Hu, Jitesh K. Singh, WooJhon Choi, Seok-Jun Lee, Hamid R. Sheikh Cite
Uncertainty Estimation for Tumor Prediction with Unlabeled Data
Juyoung Yun, Shahira Abousamra, Chen Li, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Alison L. Van Dyke, Joel H. Saltz, Chao Chen Uncertainty-Based Forgetting Mitigation for Generalized Few-Shot Object Detection
Karim Guirguis, George Eskandar, Mingyang Wang, Matthias Kayser, Eduardo Monari, Bin Yang, Jürgen Beyerer Cite
Uncovering the Hidden Cost of Model Compression
Diganta Misra, Muawiz Chaudhary, Agam Goyal, Bharat Runwal, Pin-Yu Chen Unified Physical-Digital Attack Detection Challenge
Haocheng Yuan, Ajian Liu, Junze Zheng, Jun Wan, Jiankang Deng, Sergio Escalera, Hugo Jair Escalante, Isabelle Guyon, Zhen Lei Unsupervised Microscopy Video Denoising
Mary Damilola Aiyetigbo, Alexander Korte, Ethan Anderson, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li UP-NAS: Unified Proxy for Neural Architecture Search
Yi-Cheng Huang, Wei-Hua Li, Chih-Han Tsou, Jun-Cheng Chen, Chu-Song Chen Cite
UVIS: Unsupervised Video Instance Segmentation
Shuaiyi Huang, Saksham Suri, Kamal Gupta, Sai Saketh Rambhatla, Ser-Nam Lim, Abhinav Shrivastava Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach
Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor Video Based Computational Coding of Movement Anomalies in ASD Children
Priya Singh, Abhishek Pathak, Umer Jon Ganai, Braj Bhushan, Venkatesh K. Subramanian Cite
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images
Ali Nasiri-Sarvi, Vincent Quoc-Huy Trinh, Hassan Rivaz, Mahdi S. Hosseini Vision-Language Models for Decoding Provider Attention During Neonatal Resuscitation
Felipe Parodi, Jordan K. Matelsky, Alejandra Regla-Vargas, Elizabeth E. Foglia, Charis Lim, Danielle Weinberg, Konrad P. Kording, Heidi M. Herrick, Michael L. Platt Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning
Xin Xing, Zhexiao Xiong, Abby Stylianou, Srikumar Sastry, Liyu Gong, Nathan Jacobs ViTKD: Feature-Based Knowledge Distillation for Vision Transformers
Zhendong Yang, Zhe Li, Ailing Zeng, Zexian Li, Chun Yuan, Yu Li Cite
Wake-Sleep Energy Based Models for Continual Learning
Vaibhav Singh, Anna Choromanska, Shuang Li, Yilun Du Cite
Weakly Supervised End2End Deep Visual Odometry
Amin Abouee, Ashwanth Ravi, Lars Hinneburg, Mateusz Dziwulski, Florian Ölsner, Jürgen Hess, Stefan Milz, Patrick Mäder Cite
What Does CLIP Know About Peeling a Banana?
Claudia Cuttano, Gabriele Rosi, Gabriele Trivigno, Giuseppe Averta What Is Point Supervision Worth in Video Instance Segmentation?
Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, José M. Álvarez, Abhinav Shrivastava, Anima Anandkumar What Makes Multimodal In-Context Learning Work?
Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Davide Caffagni, Federico Cocchi, Nicholas Moratelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara XoFTR: Cross-Modal Feature Matching Transformer
Önder Tuzcuoglu, Aybora Köksal, Bugra Sofu, Sinan Kalkan, A. Aydin Alatan ZInD-Tell: Towards Translating Indoor Panoramas into Descriptions
Tonmoay Deb, Lichen Wang, Zachary Bessinger, Naji Khosravan, Eric Penner, Sing Bing Kang Cite