WACV 2024
846 papers
360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View
Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image
Junuk Cha, Hansol Lee, Jaewon Kim, Nhat Nguyen Bao Truong, Jaeshin Yoon, Seungryul Baek 3D-Aware Talking-Head Video Motion Transfer
Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang 3SD: Self-Supervised Saliency Detection with No Labels
Rajeev Yasarla, Renliang Weng, Wongun Choi, Vishal M. Patel, Amir Sadeghian A Closer Look at Robustness of Vision Transformers to Backdoor Attacks
Akshayvarun Subramanya, Soroush Abbasi Koohpayegani, Aniruddha Saha, Ajinkya Tejankar, Hamed Pirsiavash A Geometry Loss Combination for 3D Human Pose Estimation
Ai Matsune, Shichen Hu, Guangquan Li, Sihan Wen, Xiantan Zhu, Zhiming Tan A Hybrid Graph Network for Complex Activity Detection in Video
Salman Khan, Izzeddin Teeti, Andrew Bradley, Mohamed Elhoseiny, Fabio Cuzzolin A Visual Active Search Framework for Geospatial Exploration
Anindya Sarkar, Michael Lanier, Scott Alfeld, Jiarui Feng, Roman Garnett, Nathan Jacobs, Yevgeniy Vorobeychik Active Learning for Single-Stage Object Detection in UAV Images
Asma Yamani, Albandari Alyami, Hamzah Luqman, Bernard Ghanem, Silvio Giancola Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-Free Continual Learning
Filip Szatkowski, Mateusz Pyla, Marcin Przewięźlikowski, Sebastian Cygert, Bartłomiej Twardowski, Tomasz Trzciński Adaptive Deep Neural Network Inference Optimization with EENet
Fatih Ilhan, Ka-Ho Chow, Sihao Hu, Tiansheng Huang, Selim Tekin, Wenqi Wei, Yanzhao Wu, Myungjin Lee, Ramana Kompella, Hugo Latapie, Gaowen Liu, Ling Liu Adversarial Likelihood Estimation with One-Way Flows
Omri Ben-Dov, Pravir Singh Gupta, Victoria Abrevaya, Michael J. Black, Partha Ghosh Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation
Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Pradyumna Ym, Akshay Kulkarni, Jogendra Nath Kundu, R. Venkatesh Babu An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning
Grégoire Petit, Michaël Soumm, Eva Feillet, Adrian Popescu, Bertrand Delezoide, David Picard, Céline Hudelot Annotation-Free Audio-Visual Segmentation
Jinxiang Liu, Yu Wang, Chen Ju, Chaofan Ma, Ya Zhang, Weidi Xie AnyStar: Domain Randomized Universal Star-Convex 3D Instance Segmentation
Neel Dey, Mazdak Abulnaga, Benjamin Billot, Esra Abaci Turk, Ellen Grant, Adrian V. Dalca, Polina Golland Appearance-Based Curriculum for Semi-Supervised Learning with Multi-Angle Unlabeled Data
Yuki Tanaka, Shuhei M. Yoshida, Takashi Shibata, Makoto Terao, Takayuki Okatani, Masashi Sugiyama ArcGeo: Localizing Limited Field-of-View Images Using Cross-View Matching
Maxim Shugaev, Ilya Semenov, Kyle Ashley, Michael Klaczynski, Naresh Cuntoor, Mun Wai Lee, Nathan Jacobs Are Natural Domain Foundation Models Useful for Medical Image Classification?
Joana Palés Huix, Adithya Raju Ganeshan, Johan Fredin Haslum, Magnus Söderberg, Christos Matsoukas, Kevin Smith ARNIQA: Learning Distortion Manifold for Image Quality Assessment
Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo Assessing Neural Network Robustness via Adversarial Pivotal Tuning
Peter Ebert Christensen, Vésteinn Snæbjarnarson, Andrea Dittadi, Serge Belongie, Sagie Benaim Asymmetric Image Retrieval with Cross Model Compatible Ensembles
Alon Shoshan, Ori Linial, Nadav Bhonker, Elad Hirsch, Lior Zamir, Igor Kviatkovsky, Gérard Medioni Attention Modules Improve Image-Level Anomaly Detection for Industrial Inspection: A DifferNet Case Study
André Luiz Vieira e Silva, Francisco Simões, Danny Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb AvatarOne: Monocular 3D Human Animation
Akash Karthikeyan, Robert Ren, Yash Kant, Igor Gilitschenski Back to Optimization: Diffusion-Based Zero-Shot 3D Human Pose Estimation
Zhongyu Jiang, Zhuoran Zhou, Lei Li, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang Bag of Tricks for Fully Test-Time Adaptation
Saypraseuth Mounsaveng, Florent Chiaroni, Malik Boudiaf, Marco Pedersoli, Ismail Ben Ayed Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness
Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Zachariah Carmichael, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Antonio Guillen, Avisek Naug BEVMap: mAP-Aware BEV Modeling for 3D Perception
Mincheol Chang, Seokha Moon, Reza Mahjourian, Jinkyu Kim Beyond Document Page Classification: Design, Datasets, and Challenges
Jordy Van Landeghem, Sanket Biswas, Matthew Blaschko, Marie-Francine Moens Beyond RGB: A Real World Dataset for Multispectral Imaging in Mobile Devices
Ortal Glatt, Yotam Ater, Woo-Shik Kim, Shira Werman, Oded Berby, Yael Zini, Shay Zelinger, Sangyoon Lee, Heejin Choi, Evgeny Soloveichik Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation
Reza Azad, Leon Niggemeier, Michael Hüttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof Beyond SOT: Tracking Multiple Generic Objects at Once
Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova Bias and Diversity in Synthetic-Based Face Recognition
Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer BPKD: Boundary Privileged Knowledge Distillation for Semantic Segmentation
Liyang Liu, Zihan Wang, Minh Hieu Phan, Bowen Zhang, Jinchao Ge, Yifan Liu Brainomaly: Unsupervised Neurologic Disease Detection Utilizing Unannotated T1-Weighted Brain MR Images
Md Mahfuzur Rahman Siddiquee, Jay Shah, Teresa Wu, Catherine Chong, Todd J. Schwedt, Gina Dumkrieger, Simona Nikolova, Baoxin Li C2AIR: Consolidated Compact Aerial Image Haze Removal
Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala CAD - Contextual Multi-Modal Alignment for Dynamic AVQA
Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa Can CLIP Help Sound Source Localization?
Sooyoung Park, Arda Senocak, Joon Son Chung Can You Even Tell Left from Right? Presenting a New Challenge for VQA
Sai Raam Venkataraman, Rishi Sridhar Rao, S. Balasubramanian, R. Raghunatha Sarma, Chandra Sekhar Vorugunti CARE: Counterfactual-Based Algorithmic Recourse for Explainable Pose Correction
Bhat Dittakavi, Bharathi Callepalli, Aleti Vardhan, Sai Vikas Desai, Vineeth N. Balasubramanian CATS: Combined Activation and Temporal Suppression for Efficient Network Inference
Zeqi Zhu, Arash Pourtaherian, Luc Waeijen, Ibrahim Batuhan Akkaya, Egor Bondarev, Orlando Moreira Causal Analysis for Robust Interpretability of Neural Networks
Ola Ahmad, Nicolas Béreux, Loïc Baret, Vahid Hashemi, Freddy Lecue CHAI: Craters in Historical Aerial Images
Marvin Burges, Sebastian Zambanini, Philipp Pirker CL-MAE: Curriculum-Learned Masked Autoencoders
Neelu Madan, Nicolae-Cătălin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu ClusterFix: A Cluster-Based Debiasing Approach Without Protected-Group Supervision
Giacomo Capitani, Federico Bolelli, Angelo Porrello, Simone Calderara, Elisa Ficarra Co-Speech Gesture Detection Through Multi-Phase Sequence Labeling
Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim Pouw, Peter Uhrig, Judith Holler, Ivan Toni, Aslı Özyürek, Raquel Fernández Collage Diffusion
Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian Consistent Multimodal Generation via a Unified GAN Framework
Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Sören Pirk, Derek Hoiem Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks
Yixuan Ren, Jing Shi, Zhifei Zhang, Yifei Fan, Zhe Lin, Bo He, Abhinav Shrivastava Context in Human Action Through Motion Complementarity
Eadom Dessalene, Michael Maynord, Cornelia Fermüller, Yiannis Aloimonos Continual Atlas-Based Segmentation of Prostate MRI
Amin Ranem, Camila González, Daniel Pinto dos Santos, Andreas M. Bucher, Ahmed E. Othman, Anirban Mukhopadhyay Continual Test-Time Domain Adaptation via Dynamic Sample Selection
Yanshuo Wang, Jie Hong, Ali Cheraghian, Shafin Rahman, David Ahmedt-Aristizabal, Lars Petersson, Mehrtash Harandi Contrastive Learning for Multi-Object Tracking with Transformers
Pierre-François De Plaen, Nicola Marinello, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool Controllable Image Synthesis of Industrial Data Using Stable Diffusion
Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri Controllable Text-to-Image Synthesis for Multi-Modality MR Images
Kyuri Kim, Yoonho Na, Sung-Joon Ye, Jimin Lee, Sung Soo Ahn, Ji Eun Park, Hwiyoung Kim Correlation-Aware Active Learning for Surgery Video Segmentation
Fei Wu, Pablo Márquez-Neila, Mingyi Zheng, Hedyeh Rafii-Tari, Raphael Sznitman CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection
Quanfu Fan, Yilai Li, Yuguang Yao, John Cohn, Sijia Liu, Ziping Xu, Seychelle Vos, Michael Cianfrocco CSAM: A 2.5d Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation
Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung D3GU: Multi-Target Active Domain Adaptation via Enhancing Domain Alignment
Lin Zhang, Linghan Xu, Saman Motamed, Shayok Chakraborty, Fernando De la Torre D4: Detection of Adversarial Diffusion Deepfakes Using Disjoint Ensembles
Ashish Hooda, Neal Mangaokar, Ryan Feng, Kassem Fawaz, Somesh Jha, Atul Prakash Data Augmentation for Object Detection via Controllable Diffusion Models
Haoyang Fang, Boran Han, Shuai Zhang, Su Zhou, Cuixiong Hu, Wen-Ming Ye DDAM-PS: Diligent Domain Adaptive Mixer for Person Search
Mohammed Khaleed Almansoori, Mustansar Fiaz, Hisham Cholakkal Deblur-NSFF: Neural Scene Flow Fields for Blurry Dynamic Scenes
Achleshwar Luthra, Shiva Souhith Gantha, Xiyun Song, Heather Yu, Zongfang Lin, Liang Peng DECDM: Document Enhancement Using Cycle-Consistent Diffusion Models
Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Sricharan Kumar Defense Against Adversarial Cloud Attack on Remote Sensing Salient Object Detection
Huiming Sun, Lan Fu, Jinlong Li, Qing Guo, Zibo Meng, Tianyun Zhang, Yuewei Lin, Hongkai Yu Design Choices for Enhancing Noisy Student Self-Training
Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman Ilin DeVos: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak, Yaroslav Romanus, Bohdan Hlovatskyi, Bohdan Sydor, Oles Dobosevych, Igor Babin, Roman Riazantsev Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava Differentiable JPEG: The Devil Is in the Details
Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar Differentially Private Video Activity Recognition
Zelun Luo, Yuliang Zou, Yijin Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja Pantic Diffusion Models Meet Image Counter-Forensics
Matías Tailanián, Marina Gardella, Alvaro Pardo, Pablo Musé Discovering and Mitigating Biases in CLIP-Based Image Editing
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell Disentangled Pre-Training for Image Matting
Yanda Li, Zilong Huang, Gang Yu, Ling Chen, Yunchao Wei, Jianbo Jiao Distortion-Disentangled Contrastive Learning
Jinfeng Wang, Sifan Song, Jionglong Su, S. Kevin Zhou Diverse ImageNet Models Transfer Better
Niv Nayman, Avram Golbert, Asaf Noy, Lihi Zelnik-Manor Do VSR Models Generalize Beyond LRS3?
Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Eustache LeBihan, Haithem Boussaid, Ebtesam Almazrouei, Merouane Debbah DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
Fangchen Yu, Yina Xie, Lei Wu, Yafei Wen, Guozhi Wang, Shuai Ren, Xiaoxin Chen, Jianfeng Mao, Wenye Li Domain Adaptive 3D Shape Retrieval from Monocular Images
Harsh Pal, Ritwik Khandelwal, Shivam Pande, Biplab Banerjee, Srikrishna Karanam Domain Aligned CLIP for Few-Shot Classification
Muhammad Waleed Gondal, Jochen Gast, Inigo Alonso Ruiz, Richard Droste, Tommaso Macri, Suren Kumar, Luitpold Staudigl Domain Generalization by Rejecting Extreme Augmentations
Masih Aminbeidokhti, Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli Domain Generalization with Correlated Style Uncertainty
Zheyuan Zhang, Bin Wang, Debesh Jha, Ugur Demir, Ulas Bagci DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang, Xiangyun Liao, Qiong Wang, Bian Wu, Guangyong Chen, Pheng-Ann Heng DR10K: Transfer Learning Using Weak Labels for Grading Diabetic Retinopathy on DR10K Dataset
Mohamed ElHabebe, Shereen ElKordi, Ahmed Gamal ElDin, Noha Adly, Marwan Torki, Ahmed Elmassry, Islam SH Ahmed DREAM: Visual Decoding from Reversing Human Visual System
Weihao Xia, Raoul de Charette, Cengiz Oztireli, Jing-Hao Xue Dynamic Multimodal Information Bottleneck for Multimodality Classification
Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang Dynamic Token-Pass Transformers for Semantic Segmentation
Yuang Liu, Qiang Zhou, Jing Wang, Zhibin Wang, Fan Wang, Jun Wang, Wei Zhang ECSIC: Epipolar Cross Attention for Stereo Image Compression
Matthias Wödlinger, Jan Kotera, Manuel Keglevic, Jan Xu, Robert Sablatnig Effective Restoration of Source Knowledge in Continual Test Time Adaptation
Fahim Faisal Niloy, Sk Miraj Ahmed, Dripta S. Raychaudhuri, Samet Oymak, Amit K. Roy-Chowdhury Efficient MAE Towards Large-Scale Vision Transformers
Qiu Han, Gongjie Zhang, Jiaxing Huang, Peng Gao, Zhang Wei, Shijian Lu Empowering Unsupervised Domain Adaptation with Large-Scale Pre-Trained Vision-Language Models
Zhengfeng Lai, Haoping Bai, Haotian Zhang, Xianzhi Du, Jiulong Shan, Yinfei Yang, Chen-Nee Chuah, Meng Cao ENIGMA-51: Towards a Fine-Grained Understanding of Human Behavior in Industrial Scenarios
Francesco Ragusa, Rosario Leonardi, Michele Mazzamuto, Claudia Bonanno, Rosario Scavo, Antonino Furnari, Giovanni Maria Farinella EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields
Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta Evidential Uncertainty Quantification: A Variance-Based Perspective
Ruxiao Duan, Brian Caffo, Harrison X. Bai, Haris I. Sair, Craig Jones Exploiting the Signal-Leak Bias in Diffusion Models
Martin Nicolas Everaert, Athanasios Fitsios, Marco Bocchio, Sami Arpa, Sabine Süsstrunk, Radhakrishna Achanta FacadeNet: Conditional Facade Synthesis via Selective Editing
Yiangos Georgiou, Marios Loizou, Tom Kelly, Melinos Averkiou Face Identity-Aware Disentanglement in StyleGAN
Adrian Suwała, Bartosz Wójcik, Magdalena Proszewska, Jacek Tabor, Przemysław Spurek, Marek Śmieja FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude
Feng Liu, Ryan Ashbaugh, Nicholas Chimitt, Najmul Hassan, Ali Hassani, Ajay Jaiswal, Minchul Kim, Zhiyuan Mao, Christopher Perry, Zhiyuan Ren, Yiyang Su, Pegah Varghaei, Kai Wang, Xingguang Zhang, Stanley Chan, Arun Ross, Humphrey Shi, Zhangyang Wang, Anil Jain, Xiaoming Liu Fast Sun-Aligned Outdoor Scene Relighting Based on TensoRF
Yeonjin Chang, Yearim Kim, Seunghyeon Seo, Jung Yi, Nojun Kwak FastCLIPstyler: Optimisation-Free Text-Based Image Style Transfer Using Style Representations
Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly, Ukrit Watchareeruetai, Aubin Samacoits Feed-Forward Latent Domain Adaptation
Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy Hospedales Few-Shot Event Classification in Images Using Knowledge Graphs for Prompting
Golsa Tahmasebzadeh, Matthias Springstein, Ralph Ewerth, Eric Müller-Budack Few-Shot Shape Recognition by Learning Deep Shape-Aware Features
Wenlong Shi, Changsheng Lu, Ming Shao, Yinjie Zhang, Siyu Xia, Piotr Koniusz FG-Net: Facial Action Unit Detection with Generalizable Pyramidal Features
Yufeng Yin, Di Chang, Guoxian Song, Shen Sang, Tiancheng Zhi, Jing Liu, Linjie Luo, Mohammad Soleymani Fine-Grained Alignment for Cross-Modal Recipe Retrieval
Muntasir Wahed, Xiaona Zhou, Tianjiao Yu, Ismini Lourentzou FIRe: Fast Inverse Rendering Using Directional and Signed Distance Functions
Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers FIRE: Food Image to REcipe Generation
Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski FishTrack23: An Ensemble Underwater Dataset for Multi-Object Tracking
Matthew Dawkins, Jack Prior, Bryon Lewis, Robin Faillettaz, Thompson Banez, Mary Salvi, Audrey Rollo, Julien Simon, Matthew Campbell, Matthew Lucero, Aashish Chaudhary, Benjamin Richards, Anthony Hoogs Fixing Overconfidence in Dynamic Neural Networks
Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin FLORA: Fine-Grained Low-Rank Architecture Search for Vision Transformer
Chi-Chih Chang, Yuan-Yao Sung, Shixing Yu, Ning-Chi Huang, Diana Marculescu, Kai-Chiang Wu FocusTune: Tuning Visual Localization Through Focus-Guided Sampling
Son Tung Nguyen, Alejandro Fontan, Michael Milford, Tobias Fischer FPGAN-Control: A Controllable Fingerprint Generator for Training with Synthetic Data
Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua Engelsma, Manoj Aggarwal, Gérard Medioni FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation
Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Yuanxiu Cai, Shanshan Song, Jiangyun Li Frequency Attention for Knowledge Distillation
Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-Ray Classification
Bin Wang, Hongyi Pan, Armstrong Aboah, Zheyuan Zhang, Elif Keles, Drew Torigian, Baris Turkbey, Elizabeth Krupinski, Jayaram Udupa, Ulas Bagci GC-MVSNet: Multi-View, Multi-Scale, Geometrically-Consistent Multi-View Stereo
Vibhas K. Vats, Sripad Joshi, David J. Crandall, Md. Alimoor Reza, Soon-heung Jung Generalizing to Unseen Domains in Diabetic Retinopathy Classification
Chamuditha Jayanga Galappaththige, Gayal Kuruppu, Muhammad Haris Khan Gradient Coreset for Federated Learning
Durga Sivasubramanian, Lokesh Nagalapatti, Rishabh Iyer, Ganesh Ramakrishnan Grafting Vision Transformers
Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo GraphFill: Deep Image Inpainting Using Graphs
Shashikant Verma, Aman Sharma, Roopa Sheshadri, Shanmuganathan Raman GRIT: GAN Residuals for Paired Image-to-Image Translation
Saksham Suri, Moustafa Meshry, Larry S. Davis, Abhinav Shrivastava Guided Distillation for Semi-Supervised Instance Segmentation
Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek HaGRID -- HAnd Gesture Recognition Image Dataset
Alexander Kapitanov, Karina Kvanchiani, Alexander Nagaev, Roman Kraynov, Andrei Makhliarchuk HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information
Heitor Rapela Medeiros, Fidel A. Guerrero Peña, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli Hardware Aware Evolutionary Neural Architecture Search Using Representation Similarity Metric
Nilotpal Sinha, Abd El Rahman Shabayek, Anis Kacem, Peyman Rostami, Carl Shneider, Djamila Aouada HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation
Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis HMP: Hand Motion Priors for Pose and Shape Estimation from Video
Enes Duran, Muhammed Kocabas, Vasileios Choutas, Zicong Fan, Michael J. Black Human Motion Aware Text-to-Video Generation with Explicit Camera Control
Taehoon Kim, ChanHee Kang, JaeHyuk Park, Daun Jeong, ChangHee Yang, Suk-Ju Kang, Kyeongbo Kong iBARLE: imBalance-Aware Room Layout Estimation
Taotao Jing, Lichen Wang, Naji Khosravan, Zhiqiang Wan, Zachary Bessinger, Zhengming Ding, Sing Bing Kang Identifying Label Errors in Object Detection Datasets by Loss Inspection
Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Siniša Šegvić, Matthias Rottmann Image Denoising and the Generative Accumulation of Photons
Alexander Krull, Hector Basevi, Benjamin Salmon, Andre Zeug, Franziska Müller, Samuel Tonks, Leela Muppala, Aleš Leonardis Image Labels Are All You Need for Coarse Seagrass Segmentation
Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Tobias Fischer Implicit Neural Representation for Change Detection
Peter Naylor, Diego Di Carlo, Arianna Traviglia, Makoto Yamada, Marco Fiorucci Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths
Ximeng Sun, Rameswar Panda, Chun-Fu Richard Chen, Naigang Wang, Bowen Pan, Aude Oliva, Rogerio Feris, Kate Saenko Improving Fairness in Deepfake Detection
Yan Ju, Shu Hu, Shan Jia, George H. Chen, Siwei Lyu Improving Vision-and-Language Reasoning via Spatial Relations Modeling
Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings
Amirhossein Kazerouni, Reza Azad, Alireza Hosseini, Dorit Merhof, Ulas Bagci Increasing Biases Can Be More Efficient than Increasing Weights
Carlo Metta, Marco Fantozzi, Andrea Papini, Gianluca Amato, Matteo Bergamaschi, Silvia Giulia Galfrè, Alessandro Marchetti, Michelangelo Vegliò, Maurizio Parton, Francesco Morandin InfraParis: A Multi-Modal and Multi-Task Autonomous Driving Dataset
Gianni Franchi, Marwane Hariat, Xuanlong Yu, Nacim Belkhir, Antoine Manzanera, David Filliat Instruct Me More! Random Prompting for Visual In-Context Learning
Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara Interactive Segmentation for Diverse Gesture Types Without Context
Josh Myers-Dean, Yifei Fan, Brian Price, Wilson Chan, Danna Gurari IR-FRestormer: Iterative Refinement with Fourier-Based Restormer for Accelerated MRI Reconstruction
Mohammad Zalbagi Darestani, Vishwesh Nath, Wenqi Li, Yufan He, Holger R. Roth, Ziyue Xu, Daguang Xu, Reinhard Heckel, Can Zhao Iterative Multi-Granular Image Editing Using Diffusion Models
K. J. Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan Joint Depth Prediction and Semantic Segmentation with Multi-View SAM
Mykhailo Shvets, Dongxu Zhao, Marc Niethammer, Roni Sengupta, Alexander C. Berg Kaizen: Practical Self-Supervised Continual Learning with Continual Fine-Tuning
Chi Ian Tang, Lorena Qendro, Dimitris Spathis, Fahim Kawsar, Cecilia Mascolo, Akhil Mathur Latent Feature-Guided Diffusion Models for Shadow Removal
Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration
Ran Liu, Sahil Khose, Jingyun Xiao, Lakshmi Sathidevi, Keerthan Ramnath, Zsolt Kira, Eva L. Dyer LaughTalk: Expressive 3D Talking Head Generation with Laughter
Kim Sung-Bin, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju, Tae-Hyun Oh Layer-Wise Auto-Weighting for Non-Stationary Test-Time Adaptation
Junyoung Park, Jin Kim, Hyeongjun Kwon, Ilhoon Yoon, Kwanghoon Sohn Learning Class and Domain Augmentations for Single-Source Open-Domain Generalization
Prathmesh Bele, Valay Bundele, Avigyan Bhattacharya, Ankit Jha, Gemma Roig, Biplab Banerjee Learning Intra-Class Multimodal Distributions with Orthonormal Matrices
Jumpei Goto, Yohei Nakata, Kiyofumi Abe, Yasunori Ishii, Takayoshi Yamashita Learning Low-Rank Latent Spaces with Simple Deterministic Autoencoder: Theoretical and Empirical Insights
Alokendu Mazumder, Tirthajit Baruah, Bhartendu Kumar, Rishab Sharma, Vishwajeet Pattanaik, Punit Rathore Learning Quality Labels for Robust Image Classification
Xiaosong Wang, Ziyue Xu, Dong Yang, Leo Tam, Holger Roth, Daguang Xu Learning Robust Deep Visual Representations from EEG Brain Recordings
Prajwal Singh, Dwip Dalal, Gautam Vashishtha, Krishna Miyapuram, Shanmuganathan Raman Learning Saliency from Fixations
Yasser Abdelaziz Dahou Djilali, Kevin McGuinness, Noel O’Connor Learning to Compose SuperWeights for Neural Parameter Allocation Search
Piotr Teterwak, Soren Nelson, Nikoli Dryden, Dina Bashkirova, Kate Saenko, Bryan A. Plummer Learning to Generate Training Datasets for Robust Semantic Segmentation
Marwane Hariat, Olivier Laurent, Rémi Kazmierczak, Shihao Zhang, Andrei Bursuc, Angela Yao, Gianni Franchi Learning to Read Analog Gauges from Synthetic Data
Juan Leon-Alcazar, Yazeed Alnumay, Cheng Zheng, Hassane Trigui, Sahejad Patel, Bernard Ghanem Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement
Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava Leveraging Synthetic Data to Learn Video Stabilization Under Adverse Conditions
Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang LidarCLIP or: How I Learned to Talk to Point Clouds
Georg Hess, Adam Tonderski, Christoffer Petersson, Kalle Åström, Lennart Svensson Lightweight Delivery Detection on Doorbell Cameras
Pirazh Khorramshahi, Zhe Wu, Tianchen Wang, Luke DeLuccia, Hongcheng Wang Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo Link Prediction for Flow-Driven Spatial Networks
Bastian Wittmann, Johannes C. Paetzold, Chinmay Prabhakar, Daniel Rueckert, Bjoern Menze Linking Convolutional Kernel Size to Generalization Bias in Face Analysis CNNs
Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick Using Cosmetic Attributes
Amila Silva, Olga Moskvyak, Alexander Long, Ravi Garg, Stephen Gould, Gil Avraham, Anton van den Hengel LIVENet: A Novel Network for Real-World Low-Light Image Denoising and Enhancement
Dhruv Makwana, Gayatri Deshmukh, Onkar Susladkar, Sparsh Mittal, Sai Chandra Teja R. MACP: Efficient Model Adaptation for Cooperative Perception
Yunsheng Ma, Juanwu Lu, Can Cui, Sicheng Zhao, Xu Cao, Wenqian Ye, Ziran Wang MAdVerse: A Hierarchical Dataset of Multi-Lingual Ads from Diverse Sources and Categories
Amruth Sagar, Rishabh Srivastava, R. T. Rakshitha, Venkata Kesav Venna, Ravi Kiran Sarvadevabhatla MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds
Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof MarsLS-Net: Martian Landslides Segmentation Network and Benchmark Dataset
Sidike Paheding, Abel A. Reyes, A. Rajaneesh, K.S. Sajinkumar, Thomas Oommen MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan, Jiageng Zhang, Ali Taalimi, Fan Yang, Xingyi Zhou, Chaochao Yan, Liang-Chieh Chen, Yeqing Li Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation
Fangwen Wu, Jingxuan He, Yufei Yin, Yanbin Hao, Gang Huang, Lechao Cheng Masked Event Modeling: Self-Supervised Pretraining for Event Cameras
Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers Meta-Learned Kernel for Blind Super-Resolution Kernel Estimation
Royson Lee, Rui Li, Stylianos Venieris, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane MFT: Long-Term Tracking of Every Pixel
Michal Neoral, Jonáš Šerých, Jiří Matas Mini but Mighty: Finetuning ViTs with Mini Adapters
Imad Eddine Marouf, Enzo Tartaglione, Stéphane Lathuilière Minimizing Layerwise Activation Norm Improves Generalization in Federated Learning
M. Yashwanth, Gaurav Kumar Nayak, Harsh Rangwani, Arya Singh, R. Venkatesh Babu, Anirban Chakraborty MobileNVC: Real-Time 1080p Neural Video Compression on a Mobile Device
Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautiere, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke Wiggers MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning
Julien Nicolas, Florent Chiaroni, Imtiaz Ziko, Ola Ahmad, Christian Desrosiers, Jose Dolz MOPA: Modular Object Navigation with PointGoal Agents
Sonia Raychaudhuri, Tommaso Campari, Unnat Jain, Manolis Savva, Angel X. Chang MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video
Renat Bashirov, Alexey Larionov, Evgeniya Ustinova, Mikhail Sidorenko, David Svitov, Ilya Zakharkin, Victor Lempitsky Motion Matters: Neural Motion Transfer for Better Camera Physiological Measurement
Akshay Paruchuri, Xin Liu, Yulu Pan, Shwetak Patel, Daniel McDuff, Soumyadip Sengupta MotionGPT: Human Motion Synthesis with Improved Diversity and Realism via GPT-3 Prompting
Jose Ribeiro-Gomes, Tianhui Cai, Zoltán Á. Milacski, Chen Wu, Aayush Prakash, Shingo Takagi, Amaury Aubel, Daeil Kim, Alexandre Bernardino, Fernando De la Torre Movie Genre Classification by Language Augmentation and Shot Sampling
Zhongping Zhang, Yiwen Gu, Bryan A. Plummer, Xin Miao, Jiayi Liu, Huayan Wang MSCC: Multi-Scale Transformers for Camera Calibration
Xu Song, Hao Kang, Atsunori Moteki, Genta Suzuki, Yoshie Kobayashi, Zhiming Tan Multi-Modal Gaze Following in Conversational Scenarios
Yuqi Hou, Zhongqun Zhang, Nora Horanyi, Jaewon Moon, Yihua Cheng, Hyung Jin Chang Multimodal Deep Learning for Remote Stress Estimation Using CCT-LSTM
Sayyedjavad Ziaratnia, Tipporn Laohakangvalvit, Midori Sugaya, Peeraya Sripian Multimodality-Guided Image Style Transfer Using Cross-Modal GAN Inversion
Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava Multitask Vision-Language Prompt Tuning
Sheng Shen, Shijia Yang, Tianjun Zhang, Bohan Zhai, Joseph E. Gonzalez, Kurt Keutzer, Trevor Darrell Nested Diffusion Processes for Anytime Image Generation
Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad NVAutoNet: Fast and Accurate 360deg 3D Visual Perception for Self Driving
Trung Pham, Mehran Maghoumi, Wanli Jiang, Bala Siva Sashank Jujjavarapu, Mehdi Sajjadi, Xin Liu, Hsuan-Chu Lin, Bor-Jeng Chen, Giang Truong, Chao Fang, Junghyun Kwon, Minwoo Park Object Aware Contrastive Prior for Interactive Image Segmentation
Praful Mathur, Shashi Kumar Parwani, Mrinmoy Sen, Roopa Sheshadri, Aman Sharma Object Re-Identification from Point Clouds
Benjamin Thérien, Chengjie Huang, Adrian Chow, Krzysztof Czarnecki Object-Centric Video Representation for Long-Term Action Anticipation
Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun OOD Aware Supervised Contrastive Learning
Soroush Seifi, Daniel Olmeda Reino, Nikolay Chumerin, Rahaf Aljundi Open-Set Object Detection by Aligning Known Class Representations
Hiran Sarkar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth N. Balasubramanian Optimizing Long-Term Robot Tracking with Multi-Platform Sensor Fusion
Giuliano Albanese, Arka Mitra, Jan-Nico Zaech, Yupeng Zhao, Ajad Chhatkuli, Luc Van Gool Out-of-Distribution Detection with Logical Reasoning
Konstantin Kirchheim, Tim Gonschorek, Frank Ortmeier OVeNet: Offset Vector Network for Semantic Segmentation
Stamatis Alexandropoulos, Christos Sakaridis, Petros Maragos P2D: Plug and Play Discriminator for Accelerating GAN Frameworks
Min Jin Chong, Krishna Kumar Singh, Yijun Li, Jingwan Lu, David Forsyth Panelformer: Sewing Pattern Reconstruction from 2D Garment Images
Cheng-Hsiu Chen, Jheng-Wei Su, Min-Chun Hu, Chih-Yuan Yao, Hung-Kuo Chu Partial Binarization of Neural Networks for Budget-Aware Efficient Learning
Udbhav Bamba, Neeraj Anand, Saksham Aggarwal, Dilip K. Prasad, Deepak K. Gupta Patch-Based Selection and Refinement for Early Object Detection
Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-Sun Seo, Yu Cao PathLDM: Text Conditioned Latent Diffusion Model for Histopathology
Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin Kurc, Joel Saltz, Dimitris Samaras PDA-RWSR: Pixel-Wise Degradation Adaptive Real-World Super-Resolution
Andreas Aakerberg, Majed El Helou, Kamal Nasrollahi, Thomas Moeslund Permutation-Aware Activity Segmentation via Unsupervised Frame-to-Segment Alignment
Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, Zeeshan Zia Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention
Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Haene, Jean-Charles Bazin, Fernando De la Torre Pixel-Grounded Prototypical Part Networks
Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael J. Jones, Walter J. Scheirer PMVC: Promoting Multi-View Consistency for 3D Scene Reconstruction
Chushan Zhang, Jinguang Tong, Tao Jun Lin, Chuong Nguyen, Hongdong Li POISE: Pose Guided Human Silhouette Extraction Under Occlusions
Arindam Dutta, Rohit Lal, Dripta S. Raychaudhuri, Calvin-Khang Ta, Amit K. Roy-Chowdhury Polarimetric PatchMatch Multi-View Stereo
Jinyu Zhao, Jumpei Oishi, Yusuke Monno, Masatoshi Okutomi PolyMaX: General Dense Prediction with Mask Transformer
Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen POP-VQA - Privacy Preserving, On-Device, Personalized Visual Question Answering
Pragya Paramita Sahu, Abhishek Raut, Jagdish Singh Samant, Mahesh Gorijala, Vignesh Lakshminarayanan, Pinaki Bhaskar PressureVision++: Estimating Fingertip Pressure from Diverse RGB Images
Patrick Grady, Jeremy A. Collins, Chengcheng Tang, Christopher D. Twigg, Kunal Aneja, James Hays, Charles C. Kemp PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers Using Synthetic Scene Data
Roei Herzig, Ofir Abramovich, Elad Ben Avraham, Assaf Arbelle, Leonid Karlinsky, Ariel Shamir, Trevor Darrell, Amir Globerson Prototype Learning for Explainable Brain Age Prediction
Linde S. Hesse, Nicola K. Dinsdale, Ana I. L. Namburete Prototypical Contrastive Network for Imbalanced Aerial Image Segmentation
Keiller Nogueira, Mayara Maezano Faita-Pinheiro, Ana Paula Marques Ramos, Wesley Nunes Gonçalves, José Marcato Junior, Jefersson A. dos Santos ProxEdit: Improving Tuning-Free Real Image Editing with Proximal Guidance
Ligong Han, Song Wen, Qi Chen, Zhixing Zhang, Kunpeng Song, Mengwei Ren, Ruijiang Gao, Anastasis Stathopoulos, Xiaoxiao He, Yuxiao Chen, Di Liu, Qilong Zhangli, Jindong Jiang, Zhaoyang Xia, Akash Srivastava, Dimitris Metaxas RADIO: Reference-Agnostic Dubbing Video Synthesis
Dongyeun Lee, Chaewon Kim, Sangjoon Yu, Jaejun Yoo, Gyeong-Moon Park Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning
Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel Kochenderfer, Chiho Choi, Behzad Dariush Ray Deformation Networks for Novel View Synthesis of Refractive Objects
Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew Shaffer, Stephen Gould Re-Evaluating LiDAR Scene Flow
Nathaniel Chodosh, Deva Ramanan, Simon Lucey Real-Time 6-DoF Pose Estimation by an Event-Based Camera Using Active LED Markers
Gerald Ebmer, Adam Loch, Minh Nhat Vu, Roberto Mecca, Germain Haessig, Christian Hartl-Nesic, Markus Vincze, Andreas Kugi ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Xuefeng Hu, Ke Zhang, Lu Xia, Albert Chen, Jiajia Luo, Yuyin Sun, Ken Wang, Nan Qiao, Xiao Zeng, Min Sun, Cheng-Hao Kuo, Ram Nevatia Recognition of Unseen Bird Species by Learning from Field Guides
Andrés C. Rodríguez, Stefano D'Aronco, Rodrigo Caye Daudt, Jan D. Wegner, Konrad Schindler RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement
Gregor Köhler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jäger, Jörg K.H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein Reference-Based Restoration of Digitized Analog Videotapes
Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud
Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich Removing the Quality Tax in Controllable Face Generation
Yiwen Huang, Zhiqiu Yu, Xinjie Yi, Yue Wang, James Tompkin Revisiting Token Pruning for Object Detection and Instance Segmentation
Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza RGB-D Mapping and Tracking in a Plenoxel Radiance Field
Andreas L. Teigen, Yeonsoo Park, Annette Stahl, Rudolf Mester RGB-X Object Detection via Scene-Specific Fusion Modules
Sri Aditya Deevi, Connor Lee, Lu Gan, Sushruth Nagesh, Gaurav Pandey, Soon-Jo Chung RMFER: Semi-Supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video
Yunseong Cho, Chanwoo Kim, Hoseong Cho, Yunhoe Ku, Eunseo Kim, Muhammadjon Boboev, Joonseok Lee, Seungryul Baek Robust Category-Level 3D Pose Estimation from Diffusion-Enhanced Synthetic Data
Jiahao Yang, Wufei Ma, Angtian Wang, Xiaoding Yuan, Alan Yuille, Adam Kortylewski Robust Learning via Conditional Prevalence Adjustment
Minh Nguyen, Alan Q. Wang, Heejong Kim, Mert R. Sabuncu Robust Object Detection in Challenging Weather Conditions
Himanshu Gupta, Oleksandr Kotlyar, Henrik Andreasson, Achim J. Lilienthal Robust TRISO-Fueled Pebble Identification by Digit Recognition
Roshan Kenia, Jihane Mendil, Ahmed Jasim, Muthanna Al-Dahhan, Zhaozheng Yin Robust Unsupervised Domain Adaptation Through Negative-View Regularization
Joonhyeok Jang, Sunhyeok Lee, Seonghak Kim, Jung-un Kim, Seonghyun Kim, Daeshik Kim SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology
Dinkar Juyal, Siddhant Shingi, Syed Ashar Javed, Harshith Padigela, Chintan Shah, Anand Sampat, Archit Khosla, John Abel, Amaro Taylor-Weiner SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez Segment Anything, from Space?
Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof Self-Supervised Denoising Transformer with Gaussian Process
Rajeev Yasarla, Jeya Maria Jose Valanarasu, Vishwanath Sindagi, Vishal M. Patel Self-Supervised Learning for Visual Relationship Detection Through Masked Bounding Box Reconstruction
Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis Semantic Generative Augmentations for Few-Shot Counting
Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne Shape from Shading for Robotic Manipulation
Arkadeep Narayan Chaudhury, Leonid Keselman, Christopher G. Atkeson Shape-Guided Diffusion with Inside-Outside Attention
Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell SICKLE: A Multi-Sensor Satellite Imagery Dataset Annotated with Multiple Key Cropping Parameters
Depanshu Sani, Sandeep Mahato, Sourabh Saini, Harsh Kumar Agarwal, Charu Chandra Devshali, Saket Anand, Gaurav Arora, Thiagarajan Jayaraman Simple Token-Level Confidence Improves Caption Correctness
Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach Single Domain Generalization via Normalised Cross-Correlation Based Convolutions
WeiQin Chuah, Ruwan Tennakoon, Reza Hoseinnezhad, David Suter, Alireza Bab-Hadiashar Sketch-Based Video Object Localization
Sangmin Woo, So-Yeong Jeon, Jinyoung Park, Minji Son, Sumin Lee, Changick Kim Small Objects Matters in Weakly-Supervised Semantic Segmentation
Cheolhyun Mun, Sanghuk Lee, Youngjung Uh, Junsuk Choe, Hyeran Byun So You Think You Can Track?
Derek Gloudemans, Gergely Zachár, Yanbing Wang, Junyi Ji, Matt Nice, Matt Bunting, William W. Barbour, Jonathan Sprinkle, Benedetto Piccoli, Maria Laura Delle Monache, Alexandre Bayen, Benjamin Seibold, Daniel B. Work Spectroformer: Multi-Domain Query Cascaded Transformer Network for Underwater Image Enhancement
Raqib Khan, Priyanka Mishra, Nancy Mehta, Shruti S. Phutke, Santosh Kumar Vipparthi, Sukumar Nandi, Subrahmanyam Murala SphereCraft: A Dataset for Spherical Keypoint Detection, Matching and Camera Pose Estimation
Christiano Gava, Yunmin Cho, Federico Raue, Sebastian Palacio, Alain Pagani, Andreas Dengel Spiking Denoising Diffusion Probabilistic Models
Jiahang Cao, Ziqing Wang, Hanzhong Guo, Hao Cheng, Qiang Zhang, Renjing Xu SSP: Semi-Signed Prioritized Neural Fitting for Surface Reconstruction from Unoriented Point Clouds
Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Shi Qiu, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing Fu Steering Prototypes with Prompt-Tuning for Rehearsal-Free Continual Learning
Zhuowei Li, Long Zhao, Zizhao Zhang, Han Zhang, Di Liu, Ting Liu, Dimitris N. Metaxas STEP - Towards Structured Scene-Text Spotting
Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol StyleAvatar: Stylizing Animatable Head Avatars
Juan C. Pérez, Thu Nguyen-Phuoc, Chen Cao, Artsiom Sanakoyeu, Tomas Simon, Pablo Arbeláez, Bernard Ghanem, Ali Thabet, Albert Pumarola StyleGAN-Fusion: Diffusion Guided Domain Adaptation of Image Generators
Kunpeng Song, Ligong Han, Bingchen Liu, Dimitris Metaxas, Ahmed Elgammal StyleGenes: Discrete and Efficient Latent Distributions for GANs
Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Martin Danelljan, Luc Van Gool SupeRVol: Super-Resolution Shape and Reflectance Estimation in Inverse Volume Rendering
Mohammed Brahimi, Bjoern Haefner, Tarun Yenamandra, Bastian Goldluecke, Daniel Cremers Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation
Siddharth Katageri, Arkadipta De, Chaitanya Devaguptapu, Vssv Prasad, Charu Sharma, Manohar Kaul SynthProv: Interpretable Framework for Profiling Identity Leakage
Jaisidh Singh, Harshil Bhatia, Mayank Vatsa, Richa Singh, Aparna Bharati Taming Normalizing Flows
Shimon Malnick, Shai Avidan, Ohad Fried TCP: Triplet Contrastive-Relationship Preserving for Class-Incremental Learning
Shiyao Li, Xuefei Ning, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang Text-to-Image Editing by Image Information Removal
Zhongping Zhang, Jian Zheng, Zhiyuan Fang, Bryan A. Plummer Textron: Weakly Supervised Multilingual Text Detection Through Data Programming
Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri, Ganesh Ramakrishnan Textual Alchemy: CoFormer for Scene Text Understanding
Gayatri Deshmukh, Onkar Susladkar, Dhruv Makwana, Sparsh Mittal, Sai Chandra Teja R. Top-Down Beats Bottom-up in 3D Instance Segmentation
Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich Torque Based Structured Pruning for Deep Neural Network
Arshita Gupta, Tien Bau, Joonsoo Kim, Zhe Zhu, Sumit Jha, Hrishikesh Garud Toward Planet-Wide Traffic Camera Calibration
Khiem Vuong, Robert Tamburo, Srinivasa G. Narasimhan Towards a Dynamic Vision Sensor-Based Insect Camera Trap
Eike Gebauer, Sebastian Thiele, Pierre Ouvrard, Adrien Sicard, Benjamin Risse Towards Diverse and Consistent Typography Generation
Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi Towards More Realistic Membership Inference Attacks on Large Diffusion Models
Jan Dubiński, Antoni Kowalczuk, Stanisław Pawlak, Przemyslaw Rokita, Tomasz Trzciński, Paweł Morawiecki Towards Realistic Generative 3D Face Models
Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando De la Torre Tracking Skiers from the Top to the Bottom
Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
Yue Ruan, Han-Hung Lee, Yiming Zhang, Ke Zhang, Angel X. Chang TriPlaneNet: An Encoder for EG3D Inversion
Ananta R. Bhattarai, Matthias Nießner, Artem Sevastopolsky Triplet Attention Transformer for Spatiotemporal Predictive Learning
Xuesong Nie, Xi Chen, Haoyuan Jin, Zhihang Zhu, Yunfeng Yan, Donglian Qi TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding
Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao Tunable Hybrid Proposal Networks for the Open World
Matthew Inkawhich, Nathan Inkawhich, Hai Li, Yiran Chen U3DS3: Unsupervised 3D Semantic Scene Segmentation
Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum UGPNet: Universal Generative Prior for Image Restoration
Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho Unified Concept Editing in Diffusion Models
Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement
Xingchen Zhao, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-Pang Chiu, Supun Samarasekera Unsupervised Domain Adaptation of MRI Skull-Stripping Trained on Adult Data to Newborns
Abbas Omidi, Aida Mohammadshahi, Neha Gianchandani, Regan King, Lara Leijser, Roberto Souza Unsupervised Event-Based Video Reconstruction
Gereon Fox, Xingang Pan, Ayush Tewari, Mohamed Elgharib, Christian Theobalt Unsupervised Graphic Layout Grouping with Transformers
Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo Using Early Readouts to Mediate Featural Bias in Distillation
Rishabh Tiwari, Durga Sivasubramanian, Anmol Mekala, Ganesh Ramakrishnan, Pradeep Shenoy VEATIC: Video-Based Emotion and Affect Tracking in Context Dataset
Zhihang Ren, Jefferson Ortega, Yifan Wang, Zhimin Chen, Yunhui Guo, Stella X. Yu, David Whitney Video Instance Matting
Jiachen Li, Roberto Henschel, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Humphrey Shi Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation
Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen Visual Narratives: Large-Scale Hierarchical Classification of Art-Historical Images
Matthias Springstein, Stefanie Schneider, Javad Rahnama, Julian Stalter, Maximilian Kristen, Eric Müller-Budack, Ralph Ewerth Visually Guided Audio Source Separation with Meta Consistency Learning
Md Amirul Islam, Seyed Shahabeddin Nabavi, Irina Kezele, Yang Wang, Yuanhao Yu, Jin Tang VMFormer: End-to-End Video Matting with Transformer
Jiachen Li, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Yunchao Wei, Humphrey Shi Volumetric Disentanglement for 3D Scene Manipulation
Sagie Benaim, Frederik Warburg, Peter Ebert Christensen, Serge Belongie WalkFormer: Point Cloud Completion via Guided Walks
Mohang Zhang, Yushi Li, Rong Chen, Yushan Pan, Jia Wang, Yunzhe Wang, Rong Xiang WATCH: Wide-Area Terrestrial Change Hypercube
Connor Greenwell, Jon Crall, Matthew Purri, Kristin Dana, Nathan Jacobs, Armin Hadzic, Scott Workman, Matt Leotta What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection
Sourabh Vasant Gothe, Vibhav Agarwal, Sourav Ghosh, Jayesh Rajkumar Vachhani, Pranay Kashyap, Barath Raj Kandur Raja What's Outside the Intersection? Fine-Grained Error Analysis for Semantic Segmentation Beyond IoU
Maximilian Bernhard, Roberto Amoroso, Yannic Kindermann, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp, Matthias Schubert Wino Vidi Vici: Conquering Numerical Instability of 8-Bit Winograd Convolution for Accurate Inference Acceleration on Edge
Pierpaolo Mori, Lukas Frickenstein, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Walter Stechele, Daniel Mueller-Gritschneder, Claudio Passerone