CVPR 2023

2353 papers

"Seeing" Electric Network Frequency from Events Lexuan Xu, Guang Hua, Haijian Zhang, Lei Yu, Ning Qiao
PDF
(ML)$^2$P-Encoder: On Exploration of Channel-Class Correlation for Multi-Label Zero-Shot Learning Ziming Liu, Song Guo, Xiaocheng Lu, Jingcai Guo, Jiewei Zhang, Yue Zeng, Fushuo Huo
PDF
1% vs 100%: Parameter-Efficient Low Rank Adapter for Dense Predictions Dongshuo Yin, Yiran Yang, Zhechao Wang, Hongfeng Yu, Kaiwen Wei, Xian Sun
PDF
1000 FPS HDR Video with a Spike-RGB Hybrid Camera Yakun Chang, Chu Zhou, Yuchen Hong, Liwen Hu, Chao Xu, Tiejun Huang, Boxin Shi
PDF
2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan
PDF
3D Cinemagraphy from a Single Image Xingyi Li, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin
PDF
3D Concept Learning and Reasoning from Multi-View Images Yining Hong, Chunru Lin, Yilun Du, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan
PDF
3D GAN Inversion with Facial Symmetry Prior Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Oztireli, Yujiu Yang
PDF
3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions Dale Decatur, Itai Lang, Rana Hanocka
PDF
3D Human Keypoints Estimation from Point Clouds in the Wild Without Human Labels Zhenzhen Weng, Alexander S. Gorban, Jingwei Ji, Mahyar Najibi, Yin Zhou, Dragomir Anguelov
PDF
3D Human Mesh Estimation from Virtual Markers Xiaoxuan Ma, Jiajun Su, Chunyu Wang, Wentao Zhu, Yizhou Wang
PDF
3D Human Pose Estimation via Intuitive Physics Shashank Tripathi, Lea Müller, Chun-Hao P. Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas
PDF
3D Human Pose Estimation with Spatio-Temporal Criss-Cross Attention Zhenhua Tang, Zhaofan Qiu, Yanbin Hao, Richang Hong, Ting Yao
PDF
3D Line Mapping Revisited Shaohui Liu, Yifan Yu, Rémi Pautrat, Marc Pollefeys, Viktor Larsson
PDF
3D Neural Field Generation Using Triplane Diffusion J. Ryan Shue, Eric Ryan Chan, Ryan Po, Zachary Ankner, Jiajun Wu, Gordon Wetzstein
PDF
3D Registration with Maximal Cliques Xiyu Zhang, Jiaqi Yang, Shikun Zhang, Yanning Zhang
PDF
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, Eric P. Xing
PDF
3D Shape Reconstruction of Semi-Transparent Worms Thomas P. Ilett, Omer Yuval, Thomas Ranner, Netta Cohen, David C. Hogg
PDF
3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud Mingtao Feng, Haoran Hou, Liang Zhang, Zijie Wu, Yulan Guo, Ajmal Mian
PDF
3D Video Loops from Asynchronous Input Li Ma, Xiaoyu Li, Jing Liao, Pedro V. Sander
PDF
3D Video Object Detection with Learnable Object-Centric Global Optimization Jiawei He, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang
PDF
3D-Aware Conditional Image Synthesis Kangle Deng, Gengshan Yang, Deva Ramanan, Jun-Yan Zhu
PDF
3D-Aware Face Swapping Yixuan Li, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang
PDF
3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data Libing Zeng, Lele Chen, Wentao Bao, Zhong Li, Yi Xu, Junsong Yuan, Nima Khademi Kalantari
PDF
3D-Aware Multi-Class Image-to-Image Translation with NeRFs Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang
PDF
3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification Jiazhao Zhang, Liu Dai, Fanpeng Meng, Qingnan Fan, Xuelin Chen, Kai Xu, He Wang
PDF
3D-POP - An Automated Annotation Approach to Facilitate Markerless 2D-3D Tracking of Freely Moving Birds with Marker-Based Motion Capture Hemal Naik, Alex Hoi Hang Chan, Junran Yang, Mathilde Delacoux, Iain D. Couzin, Fumihiro Kano, Máté Nagy
PDF
3DAvatarGAN: Bridging Domains for Personalized Editable Avatars Rameen Abdal, Hsin-Ying Lee, Peihao Zhu, Menglei Chai, Aliaksandr Siarohin, Peter Wonka, Sergey Tulyakov
PDF
3Mformer: Multi-Order Multi-Mode Transformer for Skeletal Action Recognition Lei Wang, Piotr Koniusz
PDF
A Bag-of-Prototypes Representation for Dataset-Level Applications Weijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng
PDF
A Characteristic Function-Based Method for Bottom-up Human Pose Estimation Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu
PDF
A Data-Based Perspective on Transfer Learning Saachi Jain, Hadi Salman, Alaa Khaddaj, Eric Wong, Sung Min Park, Aleksander Mądry
PDF
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou
PDF
A General Regret Bound of Preconditioned Gradient Method for DNN Training Hongwei Yong, Ying Sun, Lei Zhang
PDF
A Generalized Framework for Video Instance Segmentation Miran Heo, Sukjun Hwang, Jeongseok Hyun, Hanjung Kim, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim
PDF
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-the-Wild Images Biwen Lei, Jianqiang Ren, Mengyang Feng, Miaomiao Cui, Xuansong Xie
PDF
A Large-Scale Homography Benchmark Daniel Barath, Dmytro Mishkin, Michal Polic, Wolfgang Förstner, Jiri Matas
PDF
A Large-Scale Robustness Analysis of Video Action Recognition Models Madeline Chantry Schiappa, Naman Biyani, Prudvi Kamtam, Shruti Vyas, Hamid Palangi, Vibhav Vineet, Yogesh S. Rawat
PDF
A Light Touch Approach to Teaching Transformers Multi-View Geometry Yash Bhalgat, João F. Henriques, Andrew Zisserman
PDF
A Light Weight Model for Active Speaker Detection Junhua Liao, Haihan Duan, Kanghui Feng, Wanbing Zhao, Yanbing Yang, Liangyin Chen
PDF
A Loopback Network for Explainable Microvascular Invasion Classification Shengxuming Zhang, Tianqi Shi, Yang Jiang, Xiuming Zhang, Jie Lei, Zunlei Feng, Mingli Song
PDF
A Meta-Learning Approach to Predicting Performance and Data Requirements Achin Jain, Gurumurthy Swaminathan, Paolo Favaro, Hao Yang, Avinash Ravichandran, Hrayr Harutyunyan, Alessandro Achille, Onkar Dabeer, Bernt Schiele, Ashwin Swaminathan, Stefano Soatto
PDF
A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation Hui Tang, Kui Jia
PDF
A New Comprehensive Benchmark for Semi-Supervised Video Anomaly Detection and Anticipation Congqi Cao, Yue Lu, Peng Wang, Yanning Zhang
PDF
A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet Categories Reza Akbarian Bafghi, Danna Gurari
PDF
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning Aishwarya Kamath, Peter Anderson, Su Wang, Jing Yu Koh, Alexander Ku, Austin Waters, Yinfei Yang, Jason Baldridge, Zarana Parekh
PDF
A Practical Stereo Depth System for Smart Glasses Jialiang Wang, Daniel Scharstein, Akash Bapat, Kevin Blackburn-Matzen, Matthew Yu, Jonathan Lehman, Suhib Alsisan, Yanghan Wang, Sam Tsai, Jan-Michael Frahm, Zijian He, Peter Vajda, Michael F. Cohen, Matt Uyttendaele
PDF
A Practical Upper Bound for the Worst-Case Attribution Deviations Fan Wang, Adams Wai-Kin Kong
PDF
A Probabilistic Attention Model with Occlusion-Aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams
PDF
A Probabilistic Framework for Lifelong Test-Time Adaptation Dhanajit Brahma, Piyush Rai
PDF
A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial Initialization Yijia He, Bo Xu, Zhanpeng Ouyang, Hongdong Li
PDF
A Simple Baseline for Video Restoration with Grouped Spatial-Temporal Shift Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li
PDF
A Simple Framework for Text-Supervised Semantic Segmentation Muyang Yi, Quan Cui, Hao Wu, Cheng Yang, Osamu Yoshie, Hongtao Lu
PDF
A Soma Segmentation Benchmark in Full Adult Fly Brain Xiaoyu Liu, Bo Hu, Mingxing Li, Wei Huang, Yueyi Zhang, Zhiwei Xiong
PDF
A Strong Baseline for Generalized Few-Shot Semantic Segmentation Sina Hajimiri, Malik Boudiaf, Ismail Ben Ayed, Jose Dolz
PDF
A Unified HDR Imaging Method with Pixel and Patch Level Qingsen Yan, Weiye Chen, Song Zhang, Yu Zhu, Jinqiu Sun, Yanning Zhang
PDF
A Unified Knowledge Distillation Framework for Deep Directed Graphical Models Yizhuo Chen, Kaizhao Liang, Zhe Zeng, Shuochao Yao, Huajie Shao
PDF
A Unified Pyramid Recurrent Network for Video Frame Interpolation Xin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-hee Hahm
PDF
A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance Xianmin Xu, Yuxin Lin, Haoyang Zhou, Chong Zeng, Yaxin Yu, Kun Zhou, Hongzhi Wu
PDF
A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others Zhiheng Li, Ivan Evtimov, Albert Gordo, Caner Hazirbas, Tal Hassner, Cristian Canton Ferrer, Chenliang Xu, Mark Ibrahim
PDF
A-Cap: Anticipation Captioning with Commonsense Knowledge Duc Minh Vo, Quoc-An Luong, Akihiro Sugimoto, Hideki Nakayama
PDF
A-La-Carte Prompt Tuning (APT): Combining Distinct Data via Composable Prompting Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto
PDF
A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou
PDF
ABCD: Arbitrary Bitwise Coefficient for De-Quantization Woo Kyoung Han, Byeonghun Lee, Sang Hyun Park, Kyong Hwan Jin
PDF
ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field Zhe Jun Tang, Tat-Jen Cham, Haiyu Zhao
PDF
Abstract Visual Reasoning: An Algebraic Approach for Solving Raven's Progressive Matrices Jingyi Xu, Tushar Vaidya, Yufei Wu, Saket Chandra, Zhangsheng Lai, Kai Fong Ernest Chong
PDF
Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses Eric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu
PDF
Accelerating Dataset Distillation via Model Augmentation Lei Zhang, Jie Zhang, Bowen Lei, Subhabrata Mukherjee, Xiang Pan, Bo Zhao, Caiwen Ding, Yao Li, Dongkuan Xu
PDF
Accelerating Vision-Language Pretraining with Free Language Modeling Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo
PDF
AccelIR: Task-Aware Image Compression for Accelerating Neural Restoration Juncheol Ye, Hyunho Yeo, Jinwoo Park, Dongsu Han
PDF
Accidental Light Probes Hong-Xing Yu, Samir Agarwala, Charles Herrmann, Richard Szeliski, Noah Snavely, Jiajun Wu, Deqing Sun
PDF
Achieving a Better Stability-Plasticity Trade-Off via Auxiliary Networks in Continual Learning Sanghwan Kim, Lorenzo Noci, Antonio Orvieto, Thomas Hofmann
PDF
ACL-SPC: Adaptive Closed-Loop System for Self-Supervised Point Cloud Completion Sangmin Hong, Mohsen Yavartanoo, Reyhaneh Neshatavar, Kyoung Mu Lee
PDF
ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction Zhengdi Yu, Shaoli Huang, Chen Fang, Toby P. Breckon, Jue Wang
PDF
ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation Kehan Li, Zhennan Wang, Zesen Cheng, Runyi Yu, Yian Zhao, Guoli Song, Chang Liu, Li Yuan, Jie Chen
PDF
Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition Lilang Lin, Jiahang Zhang, Jiaying Liu
PDF
Activating More Pixels in Image Super-Resolution Transformer Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong
PDF
Active Exploration of Multimodal Complementarity for Few-Shot Action Recognition Yuyang Wanyan, Xiaoshan Yang, Chaofan Chen, Changsheng Xu
PDF
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm Yichen Xie, Han Lu, Junchi Yan, Xiaokang Yang, Masayoshi Tomizuka, Wei Zhan
PDF
ActMAD: Activation Matching to Align Distributions for Test-Time-Training Muhammad Jehanzeb Mirza, Pol Jané Soneira, Wei Lin, Mateusz Kozinski, Horst Possegger, Horst Bischof
PDF
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders Wele Gedara Chaminda Bandara, Naman Patel, Ali Gholami, Mehdi Nikkhah, Motilal Agrawal, Vishal M. Patel
PDF
AdamsFormer for Spatial Action Localization in the Future Hyung-gun Chi, Kwonjoon Lee, Nakul Agarwal, Yi Xu, Karthik Ramani, Chiho Choi
PDF
Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for Visual Recognition Yaoming Wang, Bowen Shi, Xiaopeng Zhang, Jin Li, Yuchen Liu, Wenrui Dai, Chenglin Li, Hongkai Xiong, Qi Tian
PDF
Adaptive Annealing for Robust Geometric Estimation Chitturi Sidhartha, Lalit Manam, Venu Madhav Govindu
PDF
Adaptive Assignment for Geometry Aware Local Feature Matching Dihe Huang, Ying Chen, Yong Liu, Jianlin Liu, Shang Xu, Wenlong Wu, Yikang Ding, Fan Tang, Chengjie Wang
PDF
Adaptive Channel Sparsity for Federated Learning Under System Heterogeneity Dongping Liao, Xitong Gao, Yiren Zhao, Cheng-Zhong Xu
PDF
Adaptive Data-Free Quantization Biao Qian, Yang Wang, Richang Hong, Meng Wang
PDF
Adaptive Global Decay Process for Event Cameras Urbano Miguel Nunes, Ryad Benosman, Sio-Hoi Ieng
PDF
Adaptive Graph Convolutional Subspace Clustering Lai Wei, Zhengwei Chen, Jun Yin, Changming Zhu, Rigui Zhou, Jin Liu
PDF
Adaptive Human Matting for Dynamic Videos Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu
PDF
Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo Yuesong Wang, Zhaojie Zeng, Tao Guan, Wei Yang, Zhuo Chen, Wenkai Liu, Luoyuan Xu, Yawei Luo
PDF
Adaptive Plasticity Improvement for Continual Learning Yan-Shuo Liang, Wu-Jun Li
PDF
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images Bowei Du, Yecheng Huang, Jiaxin Chen, Di Huang
PDF
Adaptive Sparse Pairwise Loss for Object Re-Identification Xiao Zhou, Yujie Zhong, Zhen Cheng, Fan Liang, Lin Ma
PDF
Adaptive Spot-Guided Transformer for Consistent Local Feature Matching Jiahuan Yu, Jiahao Chang, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Feng Wu
PDF
Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation Chen Gao, Xingyu Peng, Mi Yan, He Wang, Lirong Yang, Haibing Ren, Hongsheng Li, Si Liu
PDF
AdaptiveMix: Improving GAN Training via Feature Space Shrinkage Haozhe Liu, Wentian Zhang, Bing Li, Haoqian Wu, Nanjun He, Yawen Huang, Yuexiang Li, Bernard Ghanem, Yefeng Zheng
PDF
Adjustment and Alignment for Unbiased Open Set Domain Adaptation Wuyang Li, Jie Liu, Bo Han, Yixuan Yuan
PDF
Advancing Visual Grounding with Scene Knowledge: Benchmark and Method Zhihong Chen, Ruifei Zhang, Yibing Song, Xiang Wan, Guanbin Li
PDF
Adversarial Counterfactual Visual Explanations Guillaume Jeanneret, Loïc Simon, Frédéric Jurie
PDF
Adversarial Normalization: I Can Visualize Everything (ICE) Hoyoung Choi, Seungwan Jin, Kyungsik Han
PDF
Adversarial Robustness via Random Projection Filters Minjing Dong, Chang Xu
PDF
Adversarially Masking Synthetic to Mimic Real: Adaptive Noise Injection for Point Cloud Segmentation Adaptation Guangrui Li, Guoliang Kang, Xiaohan Wang, Yunchao Wei, Yi Yang
PDF
Adversarially Robust Neural Architecture Search for Graph Neural Networks Beini Xie, Heng Chang, Ziwei Zhang, Xin Wang, Daixin Wang, Zhiqiang Zhang, Rex Ying, Wenwu Zhu
PDF
AeDet: Azimuth-Invariant Multi-View 3D Object Detection Chengjian Feng, Zequn Jie, Yujie Zhong, Xiangxiang Chu, Lin Ma
PDF
Affection: Learning Affective Explanations for Real-World Visual Data Panos Achlioptas, Maks Ovsjanikov, Leonidas Guibas, Sergey Tulyakov
PDF
Affordance Diffusion: Synthesizing Hand-Object Interactions Yufei Ye, Xueting Li, Abhinav Gupta, Shalini De Mello, Stan Birchfield, Jiaming Song, Shubham Tulsiani, Sifei Liu
PDF
Affordance Grounding from Demonstration Video to Target Image Joya Chen, Difei Gao, Kevin Qinghong Lin, Mike Zheng Shou
PDF
Affordances from Human Videos as a Versatile Representation for Robotics Shikhar Bahl, Russell Mendonca, Lili Chen, Unnat Jain, Deepak Pathak
PDF
AGAIN: Adversarial Training with Attribution Span Enlargement and Hybrid Feature Fusion Shenglin Yin, Kelu Yao, Sheng Shi, Yangzhou Du, Zhen Xiao
PDF
Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations Hagay Michaeli, Tomer Michaeli, Daniel Soudry
PDF
Align and Attend: Multimodal Summarization with Dual Contrastive Losses Bo He, Jun Wang, Jielin Qiu, Trung Bui, Abhinav Shrivastava, Zhaowen Wang
PDF
Align Your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, Karsten Kreis
PDF
AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training Yifan Jiang, Peter Hedman, Ben Mildenhall, Dejia Xu, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue
PDF
Aligning Bag of Regions for Open-Vocabulary Object Detection Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy
PDF
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations Jiahao Zhang, Anoop Cherian, Yanbin Liu, Yizhak Ben-Shabat, Cristian Rodriguez, Stephen Gould
PDF
All Are Worth Words: A ViT Backbone for Diffusion Models Fan Bao, Shen Nie, Kaiwen Xue, Yue Cao, Chongxuan Li, Hang Su, Jun Zhu
PDF
All in One: Exploring Unified Video-Language Pre-Training Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Kevin Qinghong Lin, Satoshi Tsutsui, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou
PDF
All-in-Focus Imaging from Event Focal Stack Hanyue Lou, Minggui Teng, Yixin Yang, Boxin Shi
PDF
All-in-One Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations Dongwon Park, Byung Hyun Lee, Se Young Chun
PDF
ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-Frequency Transform for Domain Generalization Jintao Guo, Na Wang, Lei Qi, Yinghuan Shi
PDF
ALSO: Automotive LiDAR Self-Supervision by Occupancy Estimation Alexandre Boulch, Corentin Sautier, Björn Michele, Gilles Puy, Renaud Marlet
PDF
AltFreezing for More General Video Face Forgery Detection Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Houqiang Li
PDF
ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction Zhen Wang, Shijie Zhou, Jeong Joon Park, Despoina Paschalidou, Suya You, Gordon Wetzstein, Leonidas Guibas, Achuta Kadambi
PDF
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang
PDF
Ambiguous Medical Image Segmentation Using Diffusion Models Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M. Patel
PDF
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation Zhen Li, Zuo-Liang Zhu, Ling-Hao Han, Qibin Hou, Chun-Le Guo, Ming-Ming Cheng
PDF
An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity Zhao Xie, Tian Gao, Kewei Wu, Jiao Chang
PDF
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu
PDF
An Erudite Fine-Grained Visual Classification Model Dongliang Chang, Yujun Tong, Ruoyi Du, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma
PDF
An Image Quality Assessment Dataset for Portraits Nicolas Chahine, Stefania Calarasanu, Davide Garcia-Civiero, Théo Cayla, Sira Ferradans, Jean Ponce
PDF
An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions Weijia Li, Saihui Hou, Chunjie Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang, Yao Zhao
PDF
Analyzing and Diagnosing Pose Estimation with Attributions Qiyuan He, Linlin Yang, Kerui Gu, Qiuxia Lin, Angela Yao
PDF
Analyzing Physical Impacts Using Transient Surface Wave Imaging Tianyuan Zhang, Mark Sheinin, Dorian Chan, Mark Rau, Matthew O’Toole, Srinivasa G. Narasimhan
PDF
Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection Shaofei Huang, Zhenwei Shen, Zehao Huang, Zi-han Ding, Jiao Dai, Jizhong Han, Naiyan Wang, Si Liu
PDF
AnchorFormer: Point Cloud Completion from Discriminative Nodes Zhikai Chen, Fuchen Long, Zhaofan Qiu, Ting Yao, Wengang Zhou, Jiebo Luo, Tao Mei
PDF
ANetQA: A Large-Scale Benchmark for Fine-Grained Compositional Reasoning over Untrimmed Videos Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu
PDF
Angelic Patches for Improving Third-Party Object Detector Performance Wenwen Si, Shuo Li, Sangdon Park, Insup Lee, Osbert Bastani
PDF
Annealing-Based Label-Transfer Learning for Open World Object Detection Yuqing Ma, Hainan Li, Zhange Zhang, Jinyang Guo, Shanghang Zhang, Ruihao Gong, Xianglong Liu
PDF
AnyFlow: Arbitrary Scale Optical Flow with Implicit Neural Representation Hyunyoung Jung, Zhuo Hui, Lei Luo, Haitao Yang, Feng Liu, Sungjoo Yoo, Rakesh Ranjan, Denis Demandolx
PDF
Architectural Backdoors in Neural Networks Mikel Bober-Irizar, Ilia Shumailov, Yiren Zhao, Robert Mullins, Nicolas Papernot
PDF
Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning Zixuan Hu, Li Shen, Zhenyi Wang, Tongliang Liu, Chun Yuan, Dacheng Tao
PDF
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation Zicong Fan, Omid Taheri, Dimitrios Tzionas, Muhammed Kocabas, Manuel Kaufmann, Michael J. Black, Otmar Hilliges
PDF
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning Wei Ji, Renjie Liang, Zhedong Zheng, Wenqiao Zhang, Shengyu Zhang, Juncheng Li, Mengze Li, Tat-seng Chua
PDF
Are Data-Driven Explanations Robust Against Out-of-Distribution Data? Tang Li, Fengchun Qiao, Mengmeng Ma, Xi Peng
PDF
Are Deep Neural Networks SMARTer than Second Graders? Anoop Cherian, Kuan-Chuan Peng, Suhas Lohit, Kevin A. Smith, Joshua B. Tenenbaum
PDF
Are We Ready for Vision-Centric Driving Streaming Perception? the ASAP Benchmark Xiaofeng Wang, Zheng Zhu, Yunpeng Zhang, Guan Huang, Yun Ye, Wenbo Xu, Ziwei Chen, Xingang Wang
PDF
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu
PDF
ARO-Net: Learning Implicit Fields from Anchored Radial Observations Yizhi Wang, Zeyu Huang, Ariel Shamir, Hui Huang, Hao Zhang, Ruizhen Hu
PDF
AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers Zechuan Li, Hongshan Yu, Zhengeng Yang, Tongjia Chen, Naveed Akhtar
PDF
ASPnet: Action Segmentation with Shared-Private Representation of Multiple Data Sources Beatrice van Amsterdam, Abdolrahim Kadkhodamohammadi, Imanol Luengo, Danail Stoyanov
PDF
AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation Takehiko Ohkawa, Kun He, Fadime Sener, Tomas Hodan, Luan Tran, Cem Keskin
PDF
AstroNet: When Astrocyte Meets Artificial Neural Network Mengqiao Han, Liyuan Pan, Xiabi Liu
PDF
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection Yipeng Gao, Kun-Yu Lin, Junkai Yan, Yaowei Wang, Wei-Shi Zheng
PDF
Asymmetric Feature Fusion for Image Retrieval Hui Wu, Min Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
PDF
Attention-Based Point Cloud Edge Sampling Chengzhi Wu, Junwei Zheng, Julius Pfrommer, Jürgen Beyerer
PDF
AttentionShift: Iteratively Estimated Part-Based Attention mAP for Pointly Supervised Instance Segmentation Mingxiang Liao, Zonghao Guo, Yuze Wang, Peng Yuan, Bailan Feng, Fang Wan
PDF
Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization Simone Barattin, Christos Tzelepis, Ioannis Patras, Nicu Sebe
PDF
AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning Runqi Wang, Xiaoyue Duan, Guoliang Kang, Jianzhuang Liu, Shaohui Lin, Songcen Xu, Jinhu Lü, Baochang Zhang
PDF
Audio-Visual Grouping Network for Sound Localization from Mixtures Shentong Mo, Yapeng Tian
PDF
Augmentation Matters: A Simple-yet-Effective Approach to Semi-Supervised Semantic Segmentation Zhen Zhao, Lihe Yang, Sifan Long, Jimin Pi, Luping Zhou, Jingdong Wang
PDF
AUNet: Learning Relations Between Action Units for Face Forgery Detection Weiming Bai, Yufan Liu, Zhipeng Zhang, Bing Li, Weiming Hu
PDF
Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-Time Mobile Telepresence Yonggan Fu, Yuecheng Li, Chenghui Li, Jason Saragih, Peizhao Zhang, Xiaoliang Dai, Yingyan Lin
PDF
AutoAD: Movie Description in Context Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
PDF
AutoFocusFormer: Image Segmentation Off the Grid Chen Ziwen, Kaushik Patnaik, Shuangfei Zhai, Alvin Wan, Zhile Ren, Alexander G. Schwing, Alex Colburn, Li Fuxin
PDF
AutoLabel: CLIP-Based Framework for Open-Set Video Domain Adaptation Giacomo Zara, Subhankar Roy, Paolo Rota, Elisa Ricci
PDF
Automatic High Resolution Wire Segmentation and Removal Mang Tik Chiu, Xuaner Zhang, Zijun Wei, Yuqian Zhou, Eli Shechtman, Connelly Barnes, Zhe Lin, Florian Kainz, Sohrab Amirghodsi, Humphrey Shi
PDF
Autonomous Manipulation Learning for Similar Deformable Objects via Only One Demonstration Yu Ren, Ronghan Chen, Yang Cong
PDF
AutoRecon: Automated 3D Object Discovery and Reconstruction Yuang Wang, Xingyi He, Sida Peng, Haotong Lin, Hujun Bao, Xiaowei Zhou
PDF
Autoregressive Visual Tracking Xing Wei, Yifan Bai, Yongchao Zheng, Dahu Shi, Yihong Gong
PDF
Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali Thabet, Artsiom Sanakoyeu
PDF
AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction Aggelina Chatziagapi, Dimitris Samaras
PDF
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid
PDF
Azimuth Super-Resolution for FMCW Radar in Autonomous Driving Yu-Jhe Li, Shawn Hunt, Jinhyung Park, Matthew O’Toole, Kris Kitani
PDF
B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution Byeonghyun Pak, Jaewon Lee, Kyong Hwan Jin
PDF
BAAM: Monocular 3D Pose and Shape Reconstruction with Bi-Contextual Attention Module and Attention-Guided Modeling Hyo-Jun Lee, Hanul Kim, Su-Min Choi, Seong-Gyun Jeong, Yeong Jun Koh
PDF
Back to the Source: Diffusion-Driven Adaptation to Test-Time Corruption Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang
PDF
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger Yi Yu, Yufei Wang, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
PDF
Backdoor Cleansing with Unlabeled Data Lu Pang, Tao Sun, Haibin Ling, Chao Chen
PDF
Backdoor Defense via Adaptively Splitting Poisoned Dataset Kuofeng Gao, Yang Bai, Jindong Gu, Yong Yang, Shu-Tao Xia
PDF
Backdoor Defense via Deconfounded Representation Learning Zaixi Zhang, Qi Liu, Zhicai Wang, Zepu Lu, Qingyong Hu
PDF
BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields Peng Wang, Lingzhe Zhao, Ruijie Ma, Peidong Liu
PDF
BAEFormer: Bi-Directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation Cong Pan, Yonghao He, Junran Peng, Qian Zhang, Wei Sui, Zhaoxiang Zhang
PDF
Balanced Energy Regularization Loss for Out-of-Distribution Detection Hyunjun Choi, Hawook Jeong, Jin Young Choi
PDF
Balanced Product of Calibrated Experts for Long-Tailed Recognition Emanuel Sanchez Aimar, Arvi Jonnarth, Michael Felsberg, Marco Kuhlmann
PDF
Balanced Spherical Grid for Egocentric View Synthesis Changwoon Choi, Sang Min Kim, Young Min Kim
PDF
Balancing Logit Variation for Long-Tailed Semantic Segmentation Yuchao Wang, Jingjing Fei, Haochen Wang, Wei Li, Tianpeng Bao, Liwei Wu, Rui Zhao, Yujun Shen
PDF
BASiS: Batch Aligned Spectral Embedding Space Or Streicher, Ido Cohen, Guy Gilboa
PDF
Batch Model Consolidation: A Multi-Task Model Consolidation Framework Iordanis Fostiropoulos, Jiaye Zhu, Laurent Itti
PDF
Bayesian Posterior Approximation with Stochastic Ensembles Oleksandr Balabanov, Bernhard Mehlig, Hampus Linander
PDF
BBDM: Image-to-Image Translation with Brownian Bridge Diffusion Models Bo Li, Kaitao Xue, Bin Liu, Yu-Kun Lai
PDF
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion Michael J. Black, Priyanka Patel, Joachim Tesch, Jinlong Yang
PDF
Behavioral Analysis of Vision-and-Language Navigation Agents Zijiao Yang, Arjun Majumdar, Stefan Lee
PDF
Behind the Scenes: Density Fields for Single View Reconstruction Felix Wimbauer, Nan Yang, Christian Rupprecht, Daniel Cremers
PDF
Being Comes from Not-Being: Open-Vocabulary Text-to-Motion Generation with Wordless Training Junfan Lin, Jianlong Chang, Lingbo Liu, Guanbin Li, Liang Lin, Qi Tian, Chang-Wen Chen
PDF
Benchmarking Robustness of 3D Object Detection to Common Corruptions Yinpeng Dong, Caixin Kang, Jinlai Zhang, Zijian Zhu, Yikai Wang, Xiao Yang, Hang Su, Xingxing Wei, Jun Zhu
PDF
Benchmarking Self-Supervised Learning on Diverse Pathology Datasets Mingu Kang, Heon Song, Seonwook Park, Donggeun Yoo, Sérgio Pereira
PDF
Best of Both Worlds: Multimodal Contrastive Learning with Tabular and Imaging Data Paul Hager, Martin J. Menten, Daniel Rueckert
PDF
Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution Xuhai Chen, Jiangning Zhang, Chao Xu, Yabiao Wang, Chengjie Wang, Yong Liu
PDF
BEV-Guided Multi-Modality Fusion for Driving Perception Yunze Man, Liang-Yan Gui, Yu-Xiong Wang
PDF
BEV-LaneDet: An Efficient 3D Lane Detection Based on Virtual Camera via Key-Points Ruihao Wang, Jian Qin, Kaiying Li, Yaochen Li, Dong Cao, Jintao Xu
PDF
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks Xiaowei Chi, Jiaming Liu, Ming Lu, Rongyu Zhang, Zhaoqing Wang, Yandong Guo, Shanghang Zhang
PDF
BEV@DC: Bird's-Eye View Assisted Training for Depth Completion Wending Zhou, Xu Yan, Yinghong Liao, Yuankai Lin, Jin Huang, Gangming Zhao, Shuguang Cui, Zhen Li
PDF
BEVFormer V2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai
PDF
BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection Lei Yang, Kaicheng Yu, Tao Tang, Jun Li, Kun Yuan, Li Wang, Xinyu Zhang, Peng Chen
PDF
Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks Weihua Chen, Xianzhe Xu, Jian Jia, Hao Luo, Yaohua Wang, Fan Wang, Rong Jin, Xiuyu Sun
PDF
Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers Sifan Long, Zhen Zhao, Jimin Pi, Shengsheng Wang, Jingdong Wang
PDF
Beyond mAP: Towards Better Evaluation of Instance Segmentation Rohit Jena, Lukas Zhornyak, Nehal Doiphode, Pratik Chaudhari, Vivek Buch, James Gee, Jianbo Shi
PDF
Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning Zhicai Wang, Yanbin Hao, Tingting Mu, Ouxiang Li, Shuo Wang, Xiangnan He
PDF
Bi-Directional Feature Fusion Generative Adversarial Network for Ultra-High Resolution Pathological Image Virtual Re-Staining Kexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang
PDF
Bi-Level Meta-Learning for Few-Shot Domain Generalization Xiaorong Qin, Xinhang Song, Shuqiang Jiang
PDF
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection Yingjie Wang, Jiajun Deng, Yao Li, Jinshui Hu, Cong Liu, Yu Zhang, Jianmin Ji, Wanli Ouyang, Yanyong Zhang
PDF
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao
PDF
Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures Eugenia Iofinova, Alexandra Peste, Dan Alistarh
PDF
Bias Mimicking: A Simple Sampling Approach for Bias Mitigation Maan Qraitem, Kate Saenko, Bryan A. Plummer
PDF
Bias-Eliminating Augmentation Learning for Debiased Federated Learning Yuan-Yi Xu, Ci-Siang Lin, Yu-Chiang Frank Wang
PDF
BiasAdv: Bias-Adversarial Augmentation for Model Debiasing Jongin Lim, Youngdong Kim, Byungjai Kim, Chanho Ahn, Jinwoo Shin, Eunho Yang, Seungju Han
PDF
BiasBed - Rigorous Texture Bias Evaluation Nikolai Kalischek, Rodrigo Caye Daudt, Torben Peters, Reinhard Furrer, Jan D. Wegner, Konrad Schindler
PDF
BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency Shuo Yang, Zhaopan Xu, Kai Wang, Yang You, Hongxun Yao, Tongliang Liu, Min Xu
PDF
Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation Yunhao Bai, Duowen Chen, Qingli Li, Wei Shen, Yan Wang
PDF
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-Trained Vision-Language Models Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang
PDF
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4k Video Frame Interpolation Junheum Park, Jintae Kim, Chang-Su Kim
PDF
BiFormer: Vision Transformer with Bi-Level Routing Attention Lei Zhu, Xinjiang Wang, Zhanghan Ke, Wayne Zhang, Rynson W.H. Lau
PDF
Bilateral Memory Consolidation for Continual Learning Xing Nie, Shixiong Xu, Xiyan Liu, Gaofeng Meng, Chunlei Huo, Shiming Xiang
PDF
Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis Xiuwei Xu, Ziwei Wang, Jie Zhou, Jiwen Lu
PDF
Binary Latent Diffusion Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu
PDF
Biomechanics-Guided Facial Action Unit Detection Through Force Modeling Zijun Cui, Chenyi Kuang, Tian Gao, Kartik Talamadupula, Qiang Ji
PDF
BioNet: A Biologically-Inspired Network for Face Recognition Pengyu Li
PDF
Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization Chen Lin, Bo Peng, Zheyang Li, Wenming Tan, Ye Ren, Jun Xiao, Shiliang Pu
PDF
BITE: Beyond Priors for Improved Three-D Dog Pose Estimation Nadine Rüegg, Shashank Tripathi, Konrad Schindler, Michael J. Black, Silvia Zuffi
PDF
Bitstream-Corrupted JPEG Images Are Restorable: Two-Stage Compensation and Alignment Framework for Image Restoration Wenyang Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau
PDF
BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos Jennifer J. Sun, Lili Karashchuk, Amil Dravid, Serim Ryou, Sonia Fereidooni, John C. Tuthill, Aggelos Katsaggelos, Bingni W. Brunton, Georgia Gkioxari, Ann Kennedy, Yisong Yue, Pietro Perona
PDF
Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation Phoenix Neale Williams, Ke Li
PDF
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung, Jiyoung Jung, Hosik Choi, Kyungwoo Song
PDF
Blemish-Aware and Progressive Face Retouching with Limited Paired Data Lianxin Xie, Wen Xue, Zhen Xu, Si Wu, Zhiwen Yu, Hau San Wong
PDF
BlendFields: Few-Shot Example-Driven Facial Modeling Kacper Kania, Stephan J. Garbin, Andrea Tagliasacchi, Virginia Estellers, Kwang Moo Yi, Julien Valentin, Tomasz Trzciński, Marek Kowalski
PDF
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective Weixia Zhang, Guangtao Zhai, Ying Wei, Xiaokang Yang, Kede Ma
PDF
Blind Video Deflickering by Neural Filtering with a Flawed Atlas Chenyang Lei, Xuanchi Ren, Zhaoxiang Zhang, Qifeng Chen
PDF
Block Selection Method for Using Feature Norm in Out-of-Distribution Detection Yeonguk Yu, Sungho Shin, Seongju Lee, Changhyun Jun, Kyoobin Lee
PDF
Blowing in the Wind: CycleNet for Human Cinemagraphs from Still Images Hugo Bertiche, Niloy J. Mitra, Kuldeep Kulkarni, Chun-Hao P. Huang, Tuanfeng Y. Wang, Meysam Madadi, Sergio Escalera, Duygu Ceylan
PDF
Blur Interpolation Transformer for Real-World Motion from Blur Zhihang Zhong, Mingdeng Cao, Xiang Ji, Yinqiang Zheng, Imari Sato
PDF
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization Chong Yu, Tao Chen, Zhongxue Gan, Jiayuan Fan
PDF
Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation Bo Huang, Mingyang Chen, Yi Wang, Junda Lu, Minhao Cheng, Wei Wang
PDF
Boosting Detection in Crowd Analysis via Underutilized Output Features Shaokai Wu, Fengyu Yang
PDF
Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training with Saliency Prompt Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Chao Zhang, Xinggang Wang, Junwei Han
PDF
Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data Yuhao Chen, Xin Tan, Borui Zhao, Zhaowei Chen, Renjie Song, Jiajun Liang, Xuequan Lu
PDF
Boosting Transductive Few-Shot Fine-Tuning with Margin-Based Uncertainty Weighting and Probability Regularization Ran Tao, Hao Chen, Marios Savvides
PDF
Boosting Verified Training for Robust Image Classifications via Abstraction Zhaodi Zhang, Zhiyi Xue, Yang Chen, Si Liu, Yueling Zhang, Jing Liu, Min Zhang
PDF
Boosting Video Object Segmentation via Space-Time Correspondence Learning Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang
PDF
Boosting Weakly-Supervised Temporal Action Localization with Text Information Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Xiaoyu Wang, Xinbo Gao
PDF
Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery Muli Yang, Liancheng Wang, Cheng Deng, Hanwang Zhang
PDF
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping Long Lian, Zhirong Wu, Stella X. Yu
PDF
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation Xu Zheng, Jinjing Zhu, Yexin Liu, Zidong Cao, Chong Fu, Lin Wang
PDF
Boundary Unlearning: Rapid Forgetting of Deep Networks via Shifting the Decision Boundary Min Chen, Weizhuo Gao, Gaoyang Liu, Kai Peng, Chen Wang
PDF
Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval Tan Pan, Furong Xu, Xudong Yang, Sifeng He, Chen Jiang, Qingpei Guo, Feng Qian, Xiaobo Zhang, Yuan Cheng, Lei Yang, Wei Chu
PDF
Boundary-Enhanced Co-Training for Weakly Supervised Semantic Segmentation Shenghai Rong, Bohai Tu, Zilei Wang, Junjie Li
PDF
Box-Level Active Detection Mengyao Lyu, Jundong Zhou, Hui Chen, Yijie Huang, Dongdong Yu, Yaqian Li, Yandong Guo, Yuchen Guo, Liuyu Xiang, Guiguang Ding
PDF
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu
PDF
Breaching FedMD: Image Recovery via Paired-Logits Inversion Attack Hideaki Takahashi, Jingjing Liu, Yang Liu
PDF
Breaking the "Object" in Video Object Segmentation Pavel Tokmakov, Jie Li, Adrien Gaidon
PDF
Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection Muhammad Akhtar Munir, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan
PDF
Bridging Search Region Interaction with Template for RGB-T Tracking Tianrui Hui, Zizheng Xun, Fengguang Peng, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu
PDF
Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification Youngwook Kim, Jae Myung Kim, Jieun Jeong, Cordelia Schmid, Zeynep Akata, Jungwoo Lee
PDF
Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild Gyeongsik Moon
PDF
BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration Sheng Ao, Qingyong Hu, Hanyun Wang, Kai Xu, Yulan Guo
PDF
Building Rearticulable Models for Arbitrary 3D Objects from 4D Point Clouds Shaowei Liu, Saurabh Gupta, Shenlong Wang
PDF
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects Bowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas Müller, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield
PDF
BUOL: A Bottom-up Framework with Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction from a Single Image Tao Chu, Pan Zhang, Qiong Liu, Jiaqi Wang
PDF
Burstormer: Burst Image Restoration and Enhancement Transformer Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang
PDF
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation Nazmul Karim, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-pang Chiu, Supun Samarasekera, Nazanin Rahnavard
PDF
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input Senmao Tian, Ming Lu, Jiaming Liu, Yandong Guo, Yurong Chen, Shunli Zhang
PDF
CafeBoost: Causal Feature Boost to Eliminate Task-Induced Bias for Class Incremental Learning Benliu Qiu, Hongliang Li, Haitao Wen, Heqian Qiu, Lanxiao Wang, Fanman Meng, Qingbo Wu, Lili Pan
PDF
Camouflaged Instance Segmentation via Explicit De-Camouflaging Naisong Luo, Yuwen Pan, Rui Sun, Tianzhu Zhang, Zhiwei Xiong, Feng Wu
PDF
Camouflaged Object Detection with Feature Decomposition and Edge Reconstruction Chunming He, Kai Li, Yachao Zhang, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li
PDF
CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis Juntian Zheng, Qingyuan Zheng, Lixing Fang, Yun Liu, Li Yi
PDF
Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders Zeyang Sha, Xinlei He, Ning Yu, Michael Backes, Yang Zhang
PDF
Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar
PDF
CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer Linfeng Wen, Chengying Gao, Changqing Zou
PDF
CAP: Robust Point Cloud Classification via Semantic and Structural Modeling Daizong Ding, Erling Jiang, Yuanmin Huang, Mi Zhang, Wenxuan Li, Min Yang
PDF
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? Wenhao Wu, Haipeng Luo, Bo Fang, Jingdong Wang, Wanli Ouyang
PDF
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang
PDF
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai
PDF
CaPriDe Learning: Confidential and Private Decentralized Learning Based on Encryption-Friendly Distillation Loss Nurbek Tastan, Karthik Nandakumar
PDF
CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Rares Andrei Ambrus, Jeannette Bohg, Abhinav Valada, Thomas Kollar
PDF
Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization Mengyuan Chen, Junyu Gao, Changsheng Xu
PDF
Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution Hao-Wei Chen, Yu-Syuan Xu, Min-Fong Hong, Yi-Min Tsai, Hsien-Kai Kuo, Chun-Yi Lee
PDF
CASP-Net: Rethinking Video Saliency Prediction from an Audio-Visual Consistency Perceptual Perspective Junwen Xiong, Ganglai Wang, Peng Zhang, Wei Huang, Yufei Zha, Guangtao Zhai
PDF
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference Haoran You, Yunyang Xiong, Xiaoliang Dai, Bichen Wu, Peizhao Zhang, Haoqi Fan, Peter Vajda, Yingyan Lin
PDF
CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection Shuailei Ma, Yuefeng Wang, Ying Wei, Jiaqi Fan, Thomas H. Li, Hongli Liu, Fanbing Lv
PDF
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder Xinmiao Lin, Yikang Li, Jenhao Hsiao, Chiuman Ho, Yu Kong
PDF
Category Query Learning for Human-Object Interaction Classification Chi Xie, Fangao Zeng, Yue Hu, Shuang Liang, Yichen Wei
PDF
Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction Xiang Li, Xuelin Qian, Litian Liang, Lingjie Kong, Qiaole Dong, Jiejun Chen, Dingxia Liu, Xiuzhong Yao, Yanwei Fu
PDF
CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes Harshil Bhatia, Edith Tretschk, Zorah Lähner, Marcel Seelbach Benkner, Michael Moeller, Christian Theobalt, Vladislav Golyanik
PDF
CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu, Zudi Lin, Radu Timofte, Luc Van Gool
PDF
CelebV-Text: A Large-Scale Facial Text-Video Dataset Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, Wayne Wu
PDF
Center Focusing Network for Real-Time LiDAR Panoptic Segmentation Xiaoyan Li, Gang Zhang, Boyue Wang, Yongli Hu, Baocai Yin
PDF
CF-Font: Content Fusion for Few-Shot Font Generation Chi Wang, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu
PDF
CFA: Class-Wise Calibrated Fair Adversarial Training Zeming Wei, Yifei Wang, Yiwen Guo, Yisen Wang
PDF
Change-Aware Sampling and Contrastive Learning for Satellite Images Utkarsh Mall, Bharath Hariharan, Kavita Bala
PDF
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu
PDF
CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning Jianlong Wu, Haozhe Yang, Tian Gan, Ning Ding, Feijun Jiang, Liqiang Nie
PDF
CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution Jiezhang Cao, Qin Wang, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc Van Gool
PDF
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning Yiting Cheng, Fangyun Wei, Jianmin Bao, Dong Chen, Wenqiang Zhang
PDF
CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection Yabo Liu, Jinghua Wang, Chao Huang, Yaowei Wang, Yong Xu
PDF
CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang
PDF
CIRCLE: Capture in Rich Contextual Environments João Pedro Araújo, Jiaman Li, Karthik Vetrivel, Rishi Agarwal, Jiajun Wu, Deepak Gopinath, Alexander William Clegg, Karen Liu
PDF
CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, Dacheng Tao
PDF
Class Adaptive Network Calibration Bingyuan Liu, Jérôme Rony, Adrian Galdran, Jose Dolz, Ismail Ben Ayed
PDF
Class Attention Transfer Based Knowledge Distillation Ziyao Guo, Haonan Yan, Hui Li, Xiaodong Lin
PDF
Class Balanced Adaptive Pseudo Labeling for Federated Semi-Supervised Learning Ming Li, Qingli Li, Yan Wang
PDF
Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos Rohit Gupta, Anirban Roy, Claire Christensen, Sujeong Kim, Sarah Gerard, Madeline Cincebeaux, Ajay Divakaran, Todd Grindal, Mubarak Shah
PDF
Class Relationship Embedded Learning for Source-Free Unsupervised Domain Adaptation Yixin Zhang, Zilei Wang, Weinan He
PDF
Class-Balancing Diffusion Models Yiming Qin, Huangjie Zheng, Jiangchao Yao, Mingyuan Zhou, Ya Zhang
PDF
Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition Zhipeng Zhou, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Wei Gong
PDF
Class-Incremental Exemplar Compression for Class-Incremental Learning Zilin Luo, Yaoyao Liu, Bernt Schiele, Qianru Sun
PDF
CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song
PDF
CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation Yuqi Lin, Minghao Chen, Wenxiao Wang, Boxi Wu, Ke Li, Binbin Lin, Haifeng Liu, Xiaofei He
PDF
CLIP the Gap: A Single Domain Generalization Approach for Object Detection Vidit Vidit, Martin Engilberge, Mathieu Salzmann
PDF
CLIP-S4: Language-Guided Self-Supervised Semantic Segmentation Wenbin He, Suphanut Jamonnak, Liang Gou, Liu Ren
PDF
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language Aditya Sanghi, Rao Fu, Vivian Liu, Karl D.D. Willis, Hooman Shayani, Amir H. Khasahmadi, Srinath Sridhar, Daniel Ritchie
PDF
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
PDF
CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar
PDF
CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, Wenping Wang
PDF
CLIPPING: Distilling CLIP-Based Models with a Student Base for Video-Language Retrieval Renjing Pei, Jianzhuang Liu, Weimian Li, Bin Shao, Songcen Xu, Peng Dai, Juwei Lu, Youliang Yan
PDF
CLIPPO: Image-and-Language Understanding from Pixels Only Michael Tschannen, Basil Mustafa, Neil Houlsby
PDF
CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu
PDF
CLOTH4D: A Dataset for Clothed Human Reconstruction Xingxing Zou, Xintong Han, Waikeung Wong
PDF
Clothed Human Performance Capture with a Double-Layer Neural Radiance Fields Kangkan Wang, Guofeng Zhang, Suxu Cong, Jian Yang
PDF
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World Yulu Gan, Mingjie Pan, Rongyu Zhang, Zijian Ling, Lingran Zhao, Jiaming Liu, Shanghang Zhang
PDF
Clover: Towards a Unified Video-Language Alignment and Fusion Model Jingjia Huang, Yinan Li, Jiashi Feng, Xinglong Wu, Xiaoshuai Sun, Rongrong Ji
PDF
CNVid-3.5m: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset Tian Gan, Qing Wang, Xingning Dong, Xiangyuan Ren, Liqiang Nie, Qingpei Guo
PDF
Co-Salient Object Detection with Uncertainty-Aware Group Exchange-Masking Yang Wu, Huihui Song, Bo Liu, Kaihua Zhang, Dong Liu
PDF
Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM Hengyi Wang, Jingwen Wang, Lourdes Agapito
PDF
Co-Speech Gesture Synthesis by Reinforcement Learning with Contrastive Pre-Trained Rewards Mingyang Sun, Mengchen Zhao, Yaqing Hou, Minglei Li, Huang Xu, Songcen Xu, Jianye Hao
PDF
Co-Training 2l Submodels for Visual Recognition Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek, Hervé Jégou
PDF
Coaching a Teachable Student Jimuyang Zhang, Zanming Huang, Eshed Ohn-Bar
PDF
CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, Paola Cascante-Bonilla, Donghyun Kim, Assaf Arbelle, Rameswar Panda, Rogerio Feris, Zsolt Kira
PDF
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior Jinbo Xing, Menghan Xia, Yuechen Zhang, Xiaodong Cun, Jue Wang, Tien-Tsin Wong
PDF
Collaboration Helps Camera Overtake LiDAR in 3D Detection Yue Hu, Yifan Lu, Runsheng Xu, Weidi Xie, Siheng Chen, Yanfeng Wang
PDF
Collaborative Diffusion for Multi-Modal Face Generation and Editing Ziqi Huang, Kelvin C.K. Chan, Yuming Jiang, Ziwei Liu
PDF
Collaborative Noisy Label Cleaner: Learning Scene-Aware Trailers for Multi-Modal Highlight Detection in Movies Bei Gan, Xiujun Shu, Ruizhi Qiao, Haoqian Wu, Keyu Chen, Hanjun Li, Bo Ren
PDF
Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding Zihang Lin, Chaolei Tan, Jian-Fang Hu, Zhi Jin, Tiancai Ye, Wei-Shi Zheng
PDF
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception Junyu Gao, Mengyuan Chen, Changsheng Xu
PDF
Color Backdoor: A Robust Poisoning Attack in Color Space Wenbo Jiang, Hongwei Li, Guowen Xu, Tianwei Zhang
PDF
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation Ruixuan Cong, Da Yang, Rongshan Chen, Sizhe Wang, Zhenglong Cui, Hao Sheng
PDF
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation Fabio Cermelli, Matthieu Cord, Arthur Douillard
PDF
Command-Driven Articulated Object Understanding and Manipulation Ruihang Chu, Zhengzhe Liu, Xiaoqing Ye, Xiao Tan, Xiaojuan Qi, Chi-Wing Fu, Jiaya Jia
PDF
Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories Samarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotny
PDF
Compacting Binary Neural Networks by Sparse Kernel Selection Yikai Wang, Wenbing Huang, Yinpeng Dong, Fuchun Sun, Anbang Yao
PDF
Complementary Intrinsics from Neural Radiance Fields and CNNs for Outdoor Scene Relighting Siqi Yang, Xuanning Cui, Yongjie Zhu, Jiajun Tang, Si Li, Zhaofei Yu, Boxin Shi
PDF
Complete 3D Human Reconstruction from a Single Incomplete Image Junying Wang, Jae Shin Yoon, Tuanfeng Y. Wang, Krishna Kumar Singh, Ulrich Neumann
PDF
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning Zhuoyang Zhang, Yuhao Dong, Yunze Liu, Li Yi
PDF
CompletionFormer: Depth Completion with Convolutions and Vision Transformers Youmin Zhang, Xianda Guo, Matteo Poggi, Zheng Zhu, Guan Huang, Stefano Mattoccia
PDF
Complexity-Guided Slimmable Decoder for Efficient Deep Video Compression Zhihao Hu, Dong Xu
PDF
Compositor: Bottom-up Clustering and Compositing for Robust Part and Object Segmentation Ju He, Jieneng Chen, Ming-Xian Lin, Qihang Yu, Alan L. Yuille
PDF
Comprehensive and Delicate: An Efficient Transformer for Image Restoration Haiyu Zhao, Yuanbiao Gou, Boyun Li, Dezhong Peng, Jiancheng Lv, Xi Peng
PDF
Compressing Volumetric Radiance Fields to 1 MB Lingzhi Li, Zhen Shen, Zhongshu Wang, Li Shen, Liefeng Bo
PDF
Compression-Aware Video Super-Resolution Yingwei Wang, Takashi Isobe, Xu Jia, Xin Tao, Huchuan Lu, Yu-Wing Tai
PDF
Computational Flash Photography Through Intrinsics Sepideh Sarajian Maralan, Chris Careaga, Yagiz Aksoy
PDF
Computationally Budgeted Continual Learning: What Does Matter? Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet K. Dokania, Philip H.S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi
PDF
Conditional Generation of Audio from Video via Foley Analogies Yuexi Du, Ziyang Chen, Justin Salamon, Bryan Russell, Andrew Owens
PDF
Conditional Image-to-Video Generation with Latent Flow Diffusion Models Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min
PDF
Conditional Text Image Generation with Diffusion Models Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao
PDF
Confidence-Aware Personalized Federated Learning via Variational Expectation Maximization Junyi Zhu, Xingchen Ma, Matthew B. Blaschko
PDF
Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation Zicheng Wang, Zhen Zhao, Xiaoxia Xing, Dong Xu, Xiangyu Kong, Luping Zhou
PDF
Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching Paul Roetzer, Zorah Lähner, Florian Bernard
PDF
Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries Yuanwen Yue, Theodora Kontogianni, Konrad Schindler, Francis Engelmann
PDF
Connecting Vision and Language with Video Localized Narratives Paul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari
PDF
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection Benjin Zhu, Zhe Wang, Shaoshuai Shi, Hang Xu, Lanqing Hong, Hongsheng Li
PDF
Consistent Direct Time-of-Flight Video Depth Super-Resolution Zhanghao Sun, Wei Ye, Jinhui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan
PDF
Consistent View Synthesis with Pose-Guided Diffusion Models Hung-Yu Tseng, Qinbo Li, Changil Kim, Suhib Alsisan, Jia-Bin Huang, Johannes Kopf
PDF
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection Xinjiang Wang, Xingyi Yang, Shilong Zhang, Yijiang Li, Litong Feng, Shijie Fang, Chengqi Lyu, Kai Chen, Wayne Zhang
PDF
Constrained Evolutionary Diffusion Filter for Monocular Endoscope Tracking Xiongbiao Luo
PDF
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David Cox, Diyi Yang, Zsolt Kira, Rogerio Feris, Leonid Karlinsky
PDF
Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation Qi Xu, Yaxin Li, Jiangrong Shen, Jian K. Liu, Huajin Tang, Gang Pan
PDF
Content-Aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers Chenyang Lu, Daan de Geus, Gijs Dubbelman
PDF
Context De-Confounded Emotion Recognition Dingkang Yang, Zhaoyu Chen, Yuzheng Wang, Shunli Wang, Mingcheng Li, Siao Liu, Xiao Zhao, Shuai Huang, Zhiyan Dong, Peng Zhai, Lihua Zhang
PDF
Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training Zhao Jin, Munawar Hayat, Yuwei Yang, Yulan Guo, Yinjie Lei
PDF
Context-Aware Pretraining for Efficient Blind Image Decomposition Chao Wang, Zhedong Zheng, Ruijie Quan, Yifan Sun, Yi Yang
PDF
Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation Anwesa Choudhuri, Girish Chowdhary, Alexander G. Schwing
PDF
Context-Based Trit-Plane Coding for Progressive Image Compression Seungmin Jeon, Kwang Pyo Choi, Youngo Park, Chang-Su Kim
PDF
Continual Detection Transformer for Incremental Object Detection Yaoyao Liu, Bernt Schiele, Andrea Vedaldi, Christian Rupprecht
PDF
Continual Semantic Segmentation with Automatic Memory Sample Selection Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
PDF
Continuous Intermediate Token Learning with Implicit Motion Manifold for Keyframe Based Motion Interpolation Clinton A. Mo, Kun Hu, Chengjiang Long, Zhiyong Wang
PDF
Continuous Landmark Detection with 3D Queries Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Derek Bradley
PDF
Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation with Implicit Neural Representations Rui Gong, Qin Wang, Martin Danelljan, Dengxin Dai, Luc Van Gool
PDF
Continuous Sign Language Recognition with Correlation Network Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng
PDF
ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning Hao Yang, Lanqing Hong, Aoxue Li, Tianyang Hu, Zhenguo Li, Gim Hee Lee, Liwei Wang
PDF
Contrastive Grouping with Transformer for Referring Image Segmentation Jiajin Tang, Ge Zheng, Cheng Shi, Sibei Yang
PDF
Contrastive Mean Teacher for Domain Adaptive Object Detectors Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang
PDF
Contrastive Semi-Supervised Learning for Underwater Image Restoration via Reliable Bank Shirui Huang, Keyan Wang, Huan Liu, Jun Chen, Yunsong Li
PDF
Controllable Light Diffusion for Portraits David Futschik, Kelvin Ritland, James Vecore, Sean Fanello, Sergio Orts-Escolano, Brian Curless, Daniel Sýkora, Rohit Pandey
PDF
Controllable Mesh Generation Through Sparse Latent Point Diffusion Models Zhaoyang Lyu, Jinyi Wang, Yuwei An, Ya Zhang, Dahua Lin, Bo Dai
PDF
ConvNeXt V2: Co-Designing and Scaling ConvNets with Masked Autoencoders Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie
PDF
ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing Zequn Zeng, Hao Zhang, Ruiying Lu, Dongsheng Wang, Bo Chen, Zhengjue Wang
PDF
Cooperation or Competition: Avoiding Player Domination for Multi-Target Robustness via Adaptive Budgets Yimu Wang, Dinghuai Zhang, Yihan Wu, Heng Huang, Hongyang Zhang
PDF
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li
PDF
CoralStyleCLIP: Co-Optimized Region and Layer Selection for Image Editing Ambareesh Revanur, Debraj Basu, Shradha Agrawal, Dhwanit Agarwal, Deepak Pai
PDF
Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning Sungnyun Kim, Sangmin Bae, Se-Young Yun
PDF
Correlational Image Modeling for Self-Supervised Visual Pre-Training Wei Li, Jiahao Xie, Chen Change Loy
PDF
Correspondence Transformers with Asymmetric Feature Learning and Matching Flow Super-Resolution Yixuan Sun, Dongyang Zhao, Zhangyue Yin, Yiwen Huang, Tao Gui, Wenqiang Zhang, Weifeng Ge
PDF
COT: Unsupervised Domain Adaptation with Clustering and Optimal Transport Yang Liu, Zhipeng Zhou, Baigui Sun
PDF
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation Samir Yitzhak Gadre, Mitchell Wortsman, Gabriel Ilharco, Ludwig Schmidt, Shuran Song
PDF
CP3: Channel Pruning Plug-in for Point-Based Networks Yaomin Huang, Ning Liu, Zhengping Che, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Guixu Zhang, Xinmei Liu, Feifei Feng, Jian Tang
PDF
CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability Fadi Boutros, Meiling Fang, Marcel Klemt, Biying Fu, Naser Damer
PDF
CRAFT: Concept Recursive Activation FacTorization for Explainability Thomas Fel, Agustin Picard, Louis Béthune, Thibaut Boissin, David Vigouroux, Julien Colin, Rémi Cadène, Thomas Serre
PDF
CREPE: Can Vision-Language Foundation Models Reason Compositionally? Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna
PDF
Critical Learning Periods for Multisensory Integration in Deep Networks Michael Kleinman, Alessandro Achille, Stefano Soatto
PDF
CrOC: Cross-View Online Clustering for Dense Visual Representation Learning Thomas Stegmüller, Tim Lebailly, Behzad Bozorgtabar, Tinne Tuytelaars, Jean-Philippe Thiran
PDF
Cross-Domain 3D Hand Pose Estimation with Dual Modalities Qiuxia Lin, Linlin Yang, Angela Yao
PDF
Cross-Domain Image Captioning with Discriminative Finetuning Roberto Dessì, Michele Bevilacqua, Eleonora Gualdoni, Nathanaël Carraz Rakotonirina, Francesca Franzon, Marco Baroni
PDF
Cross-GAN Auditing: Unsupervised Identification of Attribute Level Similarities and Differences Between Pretrained Generative Models Matthew L. Olson, Shusen Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Weng-Keen Wong
PDF
Cross-Guided Optimization of Radiance Fields with Multi-View Image Super-Resolution for High-Resolution Novel View Synthesis Youngho Yoon, Kuk-Jin Yoon
PDF
Cross-Image-Attention for Conditional Embeddings in Deep Metric Learning Dmytro Kotovenko, Pingchuan Ma, Timo Milbich, Björn Ommer
PDF
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval Ding Jiang, Mang Ye
PDF
Crossing the Gap: Domain Generalization for Image Captioning Yuchen Ren, Zhendong Mao, Shancheng Fang, Yan Lu, Tong He, Hao Du, Yongdong Zhang, Wanli Ouyang
PDF
Crowd3D: Towards Hundreds of People Reconstruction from a Single Image Hao Wen, Jing Huang, Huili Cui, Haozhe Lin, Yu-Kun Lai, Lu Fang, Kun Li
PDF
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model Dingkang Liang, Jiahao Xie, Zhikang Zou, Xiaoqing Ye, Wei Xu, Xiang Bai
PDF
CUDA: Convolution-Based Unlearnable Datasets Vinu Sankar Sadasivan, Mahdi Soltanolkotabi, Soheil Feizi
PDF
CUF: Continuous Upsampling Filters Cristina N. Vasconcelos, Cengiz Oztireli, Mark Matthews, Milad Hashemi, Kevin Swersky, Andrea Tagliasacchi
PDF
Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing Yu Zheng, Jiahui Zhan, Shengfeng He, Junyu Dong, Yong Du
PDF
Curricular Object Manipulation in LiDAR-Based Object Detection Ziyue Zhu, Qiang Meng, Xiao Wang, Ke Wang, Liujiang Yan, Jian Yang
PDF
Curvature-Balanced Feature Manifold Learning for Long-Tailed Classification Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Lingling Li
PDF
Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra
PDF
CutMIB: Boosting Light Field Super-Resolution via Multi-View Image Blending Zeyu Xiao, Yutong Liu, Ruisheng Gao, Zhiwei Xiong
PDF
CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition with Variational Alignment Jiangbin Zheng, Yile Wang, Cheng Tan, Siyuan Li, Ge Wang, Jun Xia, Yidong Chen, Stan Z. Li
PDF
CXTrack: Improving 3D Point Cloud Tracking with Contextual Information Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang
PDF
D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based Transformers Jianfeng He, Yuan Gao, Tianzhu Zhang, Zhe Zhang, Feng Wu
PDF
DA Wand: Distortion-Aware Selection Using Neural Mesh Parameterization Richard Liu, Noam Aigerman, Vladimir G. Kim, Rana Hanocka
PDF
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion Jingyi Zhang, Jiaxing Huang, Zhipeng Luo, Gongjie Zhang, Xiaoqin Zhang, Shijian Lu
PDF
DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer Ping Chen, Xingpeng Zhang, Ye Li, Ju Tao, Bin Xiao, Bing Wang, Zongjie Jiang
PDF
DaFKD: Domain-Aware Federated Knowledge Distillation Haozhao Wang, Yichen Li, Wenchao Xu, Ruixuan Li, Yufeng Zhan, Zhigang Zeng
PDF
DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang
PDF
DARE-GRAM: Unsupervised Domain Adaptation Regression by Aligning Inverse Gram Matrices Ismail Nejjar, Qin Wang, Olga Fink
PDF
DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks Samyak Jain, Sravanti Addepalli, Pawan Kumar Sahu, Priyam Dey, R. Venkatesh Babu
PDF
DartBlur: Privacy Preservation with Detection Artifact Suppression Baowei Jiang, Bing Bai, Haozhe Lin, Yu Wang, Yuchen Guo, Lu Fang
PDF
Data-Driven Feature Tracking for Event Cameras Nico Messikommer, Carter Fang, Mathias Gehrig, Davide Scaramuzza
PDF
Data-Efficient Large Scale Place Recognition with Graded Similarity Supervision María Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov
PDF
Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint Shikang Yu, Jiachen Chen, Hu Han, Shuqiang Jiang
PDF
Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri, Ayan Kumar Bhunia, Yi-Zhe Song, Anjan Dutta
PDF
DATE: Domain Adaptive Product Seeker for E-Commerce Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao
PDF
DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model Gwanghyun Kim, Se Young Chun
PDF
DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields Yu Chen, Gim Hee Lee
PDF
DC2: Dual-Camera Defocus Control by Learning to Refocus Hadi Alzayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar
PDF
DCFace: Synthetic Face Generation with Dual Condition Diffusion Model Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu
PDF
Dealing with Cross-Task Class Discrimination in Online Continual Learning Yiduo Guo, Bing Liu, Dongyan Zhao
PDF
DeAR: Debiasing Vision-Language Models with Additive Residuals Ashish Seth, Mayur Hemani, Chirag Agarwal
PDF
Decentralized Learning with Multi-Headed Distillation Andrey Zhmoginov, Mark Sandler, Nolan Miller, Gus Kristiansen, Max Vladymyrov
PDF
DeCo: Decomposition and Reconstruction for Compositional Temporal Grounding via Coarse-to-Fine Contrastive Ranking Lijin Yang, Quan Kong, Hsuan-Kung Yang, Wadim Kehl, Yoichi Sato, Norimasa Kobori
PDF
Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction Xuehao Gao, Shaoyi Du, Yang Wu, Yang Yang
PDF
Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization Sangrok Lee, Jongseong Bae, Ha Young Kim
PDF
Decomposed Cross-Modal Distillation for RGB-Based Temporal Action Detection Pilhyeon Lee, Taeoh Kim, Minho Shim, Dongyoon Wee, Hyeran Byun
PDF
Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning Xiaocheng Lu, Song Guo, Ziming Liu, Jingcai Guo
PDF
Decoupled Multimodal Distilling for Emotion Recognition Yong Li, Yuanzhi Wang, Zhen Cui
PDF
Decoupled Semantic Prototypes Enable Learning from Diverse Annotation Types for Semi-Weakly Segmentation in Expert-Driven Domains Simon Reiß, Constantin Seibold, Alexander Freytag, Erik Rodner, Rainer Stiefelhagen
PDF
Decoupling Human and Camera Motion from Videos in the Wild Vickie Ye, Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa
PDF
Decoupling Learning and Remembering: A Bilevel Memory Framework with Knowledge Projection for Task-Incremental Learning Wenju Sun, Qingyong Li, Jing Zhang, Wen Wang, Yangli-ao Geng
PDF
Decoupling MaxLogit for Out-of-Distribution Detection Zihan Zhang, Xiang Xiang
PDF
Decoupling-and-Aggregating for Image Exposure Correction Yang Wang, Long Peng, Liang Li, Yang Cao, Zheng-Jun Zha
PDF
Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit Xiaohang Wang, Xuanhong Chen, Bingbing Ni, Hang Wang, Zhengyan Tong, Yutian Liu
PDF
Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model Takehiro Aoshima, Takashi Matsubara
PDF
Deep Depth Estimation from Thermal Image Ukcheol Shin, Jinsun Park, In So Kweon
PDF
Deep Deterministic Uncertainty: A New Simple Baseline Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H.S. Torr, Yarin Gal
PDF
Deep Discriminative Spatial and Temporal Network for Efficient Video Deblurring Jinshan Pan, Boming Xu, Jiangxin Dong, Jianjun Ge, Jinhui Tang
PDF
Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision Qi Ming, Lingjuan Miao, Zhe Ma, Lin Zhao, Zhiqiang Zhou, Xuhui Huang, Yuanpei Chen, Yufei Guo
PDF
Deep Factorized Metric Learning Chengkun Wang, Wenzhao Zheng, Junlong Li, Jie Zhou, Jiwen Lu
PDF
Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric Pengxin Zeng, Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Xi Peng
PDF
Deep Frequency Filtering for Domain Generalization Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen
PDF
Deep Graph Reprogramming Yongcheng Jing, Chongbin Yuan, Li Ju, Yiding Yang, Xinchao Wang, Dacheng Tao
PDF
Deep Graph-Based Spatial Consistency for Robust Non-Rigid Point Cloud Registration Zheng Qin, Hao Yu, Changjian Wang, Yuxing Peng, Kai Xu
PDF
Deep Hashing with Minimal-Distance-Separated Hash Centers Liangdao Wang, Yan Pan, Cong Liu, Hanjiang Lai, Jian Yin, Ye Liu
PDF
Deep Incomplete Multi-View Clustering with Cross-View Partial Sample and Prototype Alignment Jiaqi Jin, Siwei Wang, Zhibin Dong, Xinwang Liu, En Zhu
PDF
Deep Learning of Partial Graph Matching via Differentiable Top-K Runzhong Wang, Ziao Guo, Shaofei Jiang, Xiaokang Yang, Junchi Yan
PDF
Deep Polarization Reconstruction with PDAVIS Events Haiyang Mei, Zuowen Wang, Xin Yang, Xiaopeng Wei, Tobi Delbruck
PDF
Deep Random Projector: Accelerated Deep Image Prior Taihui Li, Hengkang Wang, Zhong Zhuang, Ju Sun
PDF
Deep Semi-Supervised Metric Learning with Mixed Label Propagation Furen Zhuang, Pierre Moulin
PDF
Deep Stereo Video Inpainting Zhiliang Wu, Changchang Sun, Hanyu Xuan, Yan Yan
PDF
DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients Rémi Pautrat, Daniel Barath, Viktor Larsson, Martin R. Oswald, Marc Pollefeys
PDF
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network Xuan Shen, Yaohua Wang, Ming Lin, Yilun Huang, Hao Tang, Xiuyu Sun, Yanzhi Wang
PDF
DeepMapping2: Self-Supervised Large-Scale LiDAR mAP Optimization Chao Chen, Xinhao Liu, Yiming Li, Li Ding, Chen Feng
PDF
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting Maoyuan Ye, Jing Zhang, Shanshan Zhao, Juhua Liu, Tongliang Liu, Bo Du, Dacheng Tao
PDF
DeepVecFont-V2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality Yuqing Wang, Yizhi Wang, Longhui Yu, Yuesheng Zhu, Zhouhui Lian
PDF
DeFeeNet: Consecutive 3D Human Motion Prediction with Deviation Feedback Xiaoning Sun, Huaijiang Sun, Bin Li, Dong Wei, Weiqing Li, Jianfeng Lu
PDF
Defending Against Patch-Based Backdoor Attacks on Self-Supervised Learning Ajinkya Tejankar, Maziar Sanjabi, Qifan Wang, Sinong Wang, Hamed Firooz, Hamed Pirsiavash, Liang Tan
PDF
Defining and Quantifying the Emergence of Sparse Concepts in DNNs Jie Ren, Mingjie Li, Qirui Chen, Huiqi Deng, Quanshi Zhang
PDF
Deformable Mesh Transformer for 3D Human Mesh Recovery Yusuke Yoshiyasu
PDF
DegAE: A New Pretraining Paradigm for Low-Level Vision Yihao Liu, Jingwen He, Jinjin Gu, Xiangtao Kong, Yu Qiao, Chao Dong
PDF
DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and Counting Aayush Kumar Tyagi, Chirag Mohapatra, Prasenjit Das, Govind Makharia, Lalita Mehra, Prathosh Ap, Mausam
PDF
DejaVu: Conditional Regenerative Learning to Enhance Dense Prediction Shubhankar Borse, Debasmit Das, Hyojin Park, Hong Cai, Risheek Garrepalli, Fatih Porikli
PDF
Delivering Arbitrary-Modal Semantic Segmentation Jiaming Zhang, Ruiping Liu, Hao Shi, Kailun Yang, Simon Reiß, Kunyu Peng, Haodong Fu, Kaiwei Wang, Rainer Stiefelhagen
PDF
Delving into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling Yulin Liu, Haoran Liu, Yingda Yin, Yang Wang, Baoquan Chen, He Wang
PDF
Delving into Shape-Aware Zero-Shot Semantic Segmentation Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou
PDF
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint Hongyu Liu, Yibing Song, Qifeng Chen
PDF
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression Junho Kim, Byung-Kwan Lee, Yong Man Ro
PDF
Dense Distinct Query for End-to-End Object Detection Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen
PDF
Dense Network Expansion for Class Incremental Learning Zhiyuan Hu, Yunsheng Li, Jiancheng Lyu, Dashan Gao, Nuno Vasconcelos
PDF
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline Tiantian Geng, Teng Wang, Jinming Duan, Runmin Cong, Feng Zheng
PDF
Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection Qianjiang Hu, Daizong Liu, Wei Hu
PDF
DepGraph: Towards Any Structural Pruning Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
PDF
Depth Estimation from Camera Image and mmWave Radar Point Cloud Akash Deep Singh, Yunhao Ba, Ankur Sarker, Howard Zhang, Achuta Kadambi, Stefano Soatto, Mani Srivastava, Alex Wong
PDF
Depth Estimation from Indoor Panoramas with Neural Scene Representation Wenjie Chang, Yueyi Zhang, Zhiwei Xiong
PDF
DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection Xuan Zhang, Shiyu Li, Xi Li, Ping Huang, Jiulong Shan, Ting Chen
PDF
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu
PDF
Detecting and Grounding Multi-Modal Media Manipulation Rui Shao, Tianxing Wu, Ziwei Liu
PDF
Detecting Backdoors During the Inference Stage Based on Corruption Robustness Consistency Xiaogeng Liu, Minghui Li, Haoyu Wang, Shengshan Hu, Dengpan Ye, Hai Jin, Libing Wu, Chaowei Xiao
PDF
Detecting Backdoors in Pre-Trained Encoders Shiwei Feng, Guanhong Tao, Siyuan Cheng, Guangyu Shen, Xiangzhe Xu, Yingqi Liu, Kaiyuan Zhang, Shiqing Ma, Xiangyu Zhang
PDF
Detecting Everything in the Open World: Towards Universal Object Detection Zhenyu Wang, Yali Li, Xi Chen, Ser-Nam Lim, Antonio Torralba, Hengshuang Zhao, Shengjin Wang
PDF
Detecting Human-Object Contact in Images Yixin Chen, Sai Kumar Dwivedi, Michael J. Black, Dimitrios Tzionas
PDF
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang
PDF
Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns Bartłomiej Olber, Krystian Radlak, Adam Popowicz, Michal Szczepankiewicz, Krystian Chachuła
PDF
DETR with Additional Global Aggregation for Cross-Domain Weakly Supervised Object Detection Zongheng Tang, Yifan Sun, Si Liu, Yi Yang
PDF
DETRs with Hybrid Matching Ding Jia, Yuhui Yuan, Haodi He, Xiaopei Wu, Haojun Yu, Weihong Lin, Lei Sun, Chao Zhang, Han Hu
PDF
Devil Is in the Queries: Advancing Mask Transformers for Real-World Medical Image Segmentation and Out-of-Distribution Localization Mingze Yuan, Yingda Xia, Hexin Dong, Zifan Chen, Jiawen Yao, Mingyan Qiu, Ke Yan, Xiaoli Yin, Yu Shi, Xin Chen, Zaiyi Liu, Bin Dong, Jingren Zhou, Le Lu, Ling Zhang, Li Zhang
PDF
Devil's on the Edges: Selective Quad Attention for Scene Graph Generation Deunsol Jung, Sanghyun Kim, Won Hwa Kim, Minsu Cho
PDF
DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects Chen Bao, Helin Xu, Yuzhe Qin, Xiaolong Wang
PDF
DF-Platter: Multi-Face Heterogeneous Deepfake Dataset Kartik Narayan, Harsh Agarwal, Kartik Thakral, Surbhi Mittal, Mayank Vatsa, Richa Singh
PDF
DiffCollage: Parallel Generation of Large Content with Diffusion Models Qinsheng Zhang, Jiaming Song, Xun Huang, Yongxin Chen, Ming-Yu Liu
PDF
Differentiable Architecture Search with Random Features Xuanyang Zhang, Yonggang Li, Xiangyu Zhang, Yongtao Wang, Jian Sun
PDF
Differentiable Shadow Mapping for Efficient Inverse Graphics Markus Worchel, Marc Alexa
PDF
Difficulty-Based Sampling for Debiased Contrastive Representation Learning Taeuk Jang, Xiaoqian Wang
PDF
DiffPose: Toward More Reliable 3D Pose Estimation Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu
PDF
DiffRF: Rendering-Guided 3D Radiance Field Diffusion Norman Müller, Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder, Matthias Nießner
PDF
DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion Wenliang Zhao, Yongming Rao, Weikang Shi, Zuyan Liu, Jie Zhou, Jiwen Lu
PDF
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation Shuai Shen, Wenliang Zhao, Zibin Meng, Wanhua Li, Zheng Zhu, Jie Zhou, Jiwen Lu
PDF
Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein
PDF
Diffusion Probabilistic Model Made Slim Xingyi Yang, Daquan Zhou, Jiashi Feng, Xinchao Wang
PDF
Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang
PDF
Diffusion-Based Generation, Optimization, and Planning in 3D Scenes Siyuan Huang, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu, Wei Liang, Song-Chun Zhu
PDF
Diffusion-Based Signed Distance Fields for 3D Shape Generation Jaehyeok Shim, Changwoo Kang, Kyungdon Joo
PDF
Diffusion-SDF: Text-to-Shape via Voxelized Diffusion Muheng Li, Yueqi Duan, Jie Zhou, Jiwen Lu
PDF
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn, Daniyar Turmukhambetov
PDF
DiffusionRig: Learning Personalized Priors for Facial Appearance Editing Zheng Ding, Xuaner Zhang, Zhihao Xia, Lars Jebe, Zhuowen Tu, Xiuming Zhang
PDF
DIFu: Depth-Guided Implicit Function for Clothed Human Reconstruction Dae-Young Song, HeeKyung Lee, Jeongil Seo, Donghyeon Cho
PDF
DiGA: Distil to Generalize and Then Adapt for Domain Adaptive Semantic Segmentation Fengyi Shen, Akhil Gurram, Ziyuan Liu, He Wang, Alois Knoll
PDF
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection Jiawei Ma, Yulei Niu, Jincheng Xu, Shiyuan Huang, Guangxing Han, Shih-Fu Chang
PDF
Dimensionality-Varying Diffusion Process Han Zhang, Ruili Feng, Zhantao Yang, Lianghua Huang, Yu Liu, Yifei Zhang, Yujun Shen, Deli Zhao, Jingren Zhou, Fan Cheng
PDF
DINER: Depth-Aware Image-Based NEural Radiance Fields Malte Prinzler, Otmar Hilliges, Justus Thies
PDF
DINER: Disorder-Invariant Implicit Neural Representation Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma
PDF
DINN360: Deformable Invertible Neural Network for Latitude-Aware 360deg Image Rescaling Yichen Guo, Mai Xu, Lai Jiang, Leonid Sigal, Yunjin Chen
PDF
Dionysus: Recovering Scene Structures by Dividing into Semantic Pieces Likang Wang, Lei Chen
PDF
DIP: Dual Incongruity Perceiving Network for Sarcasm Detection Changsong Wen, Guoli Jia, Jufeng Yang
PDF
Directional Connectivity-Based Segmentation of Medical Images Ziyun Yang, Sina Farsiu
PDF
DISC: Learning from Noisy Labels via Dynamic Instance-Specific Selection and Correction Yifan Li, Hu Han, Shiguang Shan, Xilin Chen
PDF
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training Yihao Chen, Xianbiao Qi, Jianan Wang, Lei Zhang
PDF
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis Yinghao Xu, Menglei Chai, Zifan Shi, Sida Peng, Ivan Skorokhodov, Aliaksandr Siarohin, Ceyuan Yang, Yujun Shen, Hsin-Ying Lee, Bolei Zhou, Sergey Tulyakov
PDF
Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang
PDF
Discrete Point-Wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face Recognition Qian Li, Yuxiao Hu, Ye Liu, Dongxiao Zhang, Xin Jin, Yuntian Chen
PDF
Discriminating Known from Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder Aming Wu, Cheng Deng
PDF
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection Long Li, Junwei Han, Ni Zhang, Nian Liu, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan
PDF
Discriminator-Cooperated Feature mAP Distillation for GAN Compression Tie Hu, Mingbao Lin, Lizhou You, Fei Chao, Rongrong Ji
PDF
Disentangled Representation Learning for Unsupervised Neural Quantization Haechan Noh, Sangeek Hyun, Woojin Jeong, Hanshin Lim, Jae-Pil Heo
PDF
Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness Zhijie Shen, Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Shuai Zheng, Yao Zhao
PDF
Disentangling Writer and Character Styles for Handwriting Generation Gang Dai, Yifan Zhang, Qingfeng Wang, Qing Du, Zhuliang Yu, Zhuoman Liu, Shuangping Huang
PDF
Distilling Cross-Temporal Contexts for Continuous Sign Language Recognition Leming Guo, Wanli Xue, Qing Guo, Bo Liu, Kaihua Zhang, Tiantian Yuan, Shengyong Chen
PDF
Distilling Focal Knowledge from Imperfect Expert for 3D Object Detection Jia Zeng, Li Chen, Hanming Deng, Lewei Lu, Junchi Yan, Yu Qiao, Hongyang Li
PDF
Distilling Neural Fields for Real-Time Articulated Shape Reconstruction Jeff Tan, Gengshan Yang, Deva Ramanan
PDF
Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification & Segmentation Dahyun Kang, Piotr Koniusz, Minsu Cho, Naila Murray
PDF
Distilling Vision-Language Pre-Training to Collaborate with Weakly-Supervised Temporal Action Localization Chen Ju, Kunhao Zheng, Jinxiang Liu, Peisen Zhao, Ya Zhang, Jianlong Chang, Qi Tian, Yanfeng Wang
PDF
DistilPose: Tokenized Pose Regression with Heatmap Distillation Suhang Ye, Yingyi Zhang, Jie Hu, Liujuan Cao, Shengchuan Zhang, Lei Shen, Jun Wang, Shouhong Ding, Rongrong Ji
PDF
DistractFlow: Improving Optical Flow Estimation via Realistic Distractions and Pseudo-Labeling Jisoo Jeong, Hong Cai, Risheek Garrepalli, Fatih Porikli
PDF
Distribution Shift Inversion for Out-of-Distribution Prediction Runpeng Yu, Songhua Liu, Xingyi Yang, Xinchao Wang
PDF
DisWOT: Student Architecture Search for Distillation WithOut Training Peijie Dong, Lujun Li, Zimian Wei
PDF
DivClust: Controlling Diversity in Deep Clustering Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, Ioannis Patras
PDF
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement Xingqun Qi, Chen Liu, Muyi Sun, Lincheng Li, Changjie Fan, Xin Yu
PDF
Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-Identification Yukang Zhang, Hanzi Wang
PDF
Diversity-Aware Meta Visual Prompting Qidong Huang, Xiaoyi Dong, Dongdong Chen, Weiming Zhang, Feifei Wang, Gang Hua, Nenghai Yu
PDF
Diversity-Measurable Anomaly Detection Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen
PDF
Divide and Adapt: Active Domain Adaptation via Customized Learning Duojun Huang, Jichang Li, Weikai Chen, Junshi Huang, Zhenhua Chai, Guanbin Li
PDF
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning Shi Chen, Qi Zhao
PDF
DKM: Dense Kernelized Feature Matching for Geometry Estimation Johan Edstedt, Ioannis Athanasiadis, Mårten Wadenbäck, Michael Felsberg
PDF
DKT: Diverse Knowledge Transfer Transformer for Class Incremental Learning Xinyuan Gao, Yuhang He, Songlin Dong, Jie Cheng, Xing Wei, Yihong Gong
PDF
DLBD: A Self-Supervised Direct-Learned Binary Descriptor Bin Xiao, Yang Hu, Bo Liu, Xiuli Bi, Weisheng Li, Xinbo Gao
PDF
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos Qi Zhao, M. Salman Asif, Zhan Ma
PDF
DNF: Decouple and Feedback Network for Seeing in the Dark Xin Jin, Ling-Hao Han, Zhen Li, Chun-Le Guo, Zhi Chai, Chongyi Li
PDF
Document Image Shadow Removal Guided by Color-Aware Background Ling Zhang, Yinghao He, Qing Zhang, Zheng Liu, Xiaolong Zhang, Chunxia Xiao
PDF
Domain Expansion of Image Generators Yotam Nitzan, Michaël Gharbi, Richard Zhang, Taesung Park, Jun-Yan Zhu, Daniel Cohen-Or, Eli Shechtman
PDF
Domain Generalized Stereo Matching via Hierarchical Visual Transformation Tianyu Chang, Xun Yang, Tianzhu Zhang, Meng Wang
PDF
Don't Lie to Me! Robust and Efficient Explainability with Verified Perturbation Analysis Thomas Fel, Melanie Ducoffe, David Vigouroux, Rémi Cadène, Mikaël Capelle, Claire Nicodème, Thomas Serre
PDF
DoNet: Deep De-Overlapping Network for Cytology Instance Segmentation Hao Jiang, Rushan Zhang, Yanning Zhou, Yumeng Wang, Hao Chen
PDF
Doubly Right Object Recognition: A Why Prompt for Visual Rationales Chengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang, Xin Wang, Carl Vondrick
PDF
DP-NeRF: Deblurred Neural Radiance Field with Physical Scene Priors Dogyoon Lee, Minhyeok Lee, Chajin Shin, Sangyoun Lee
PDF
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing Youxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-Ming Yan
PDF
DPF: Learning Dense Prediction Fields with Weak Supervision Xiaoxue Chen, Yuhang Zheng, Yupeng Zheng, Qiang Zhou, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
PDF
DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration Zhixin Wang, Ziying Zhang, Xiaoyun Zhang, Huangjie Zheng, Mingyuan Zhou, Ya Zhang, Yanfeng Wang
PDF
DrapeNet: Garment Generation and Self-Supervised Draping Luca De Luigi, Ren Li, Benoît Guillard, Mathieu Salzmann, Pascal Fua
PDF
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao
PDF
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman
PDF
DropKey for Vision Transformer Bonan Li, Yinhan Hu, Xuecheng Nie, Congying Han, Xiangjian Jiang, Tiande Guo, Luoqi Liu
PDF
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks Qiangqiang Wu, Tianyu Yang, Ziquan Liu, Baoyuan Wu, Ying Shan, Antoni B. Chan
PDF
DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment Heyuan Li, Bo Wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan
PDF
DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets Haiyang Wang, Chen Shi, Shaoshuai Shi, Meng Lei, Sen Wang, Di He, Bernt Schiele, Liwei Wang
PDF
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval Xiaoshuai Hao, Wanqian Zhang, Dayan Wu, Fei Zhu, Bo Li
PDF
Dual-Bridging with Adversarial Noise Generation for Domain Adaptive rPPG Estimation Jingda Du, Si-Qi Liu, Bochao Zhang, Pong C. Yuen
PDF
Dual-Path Adaptation from Image to Video Transformers Jungin Park, Jiyoung Lee, Kwanghoon Sohn
PDF
DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium Antyanta Bangunharcana, Ahmed Magd, Kyung-Soo Kim
PDF
DualRel: Semi-Supervised Mitochondria Segmentation from a Prototype Perspective Huayu Mai, Rui Sun, Tianzhu Zhang, Zhiwei Xiong, Feng Wu
PDF
DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Matthew Fisher, Zhaowen Wang, Song-Hai Zhang
PDF
DyLiN: Making Light Field Networks Dynamic Heng Yu, Joel Julin, Zoltán Á. Milacski, Koichiro Niinuma, László A. Jeni
PDF
DynaFed: Tackling Client Data Heterogeneity with Global Dynamics Renjie Pi, Weizhong Zhang, Yueqi Xie, Jiahui Gao, Xiaoyu Wang, Sunghun Kim, Qifeng Chen
PDF
DynaMask: Dynamic Mask Selection for Instance Segmentation Ruihuang Li, Chenhang He, Shuai Li, Yabin Zhang, Lei Zhang
PDF
Dynamic Aggregated Network for Gait Recognition Kang Ma, Ying Fu, Dezhi Zheng, Chunshui Cao, Xuecai Hu, Yongzhen Huang
PDF
Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection Chang Xu, Jian Ding, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia
PDF
Dynamic Conceptional Contrastive Learning for Generalized Category Discovery Nan Pu, Zhun Zhong, Nicu Sebe
PDF
Dynamic Focus-Aware Positional Queries for Semantic Segmentation Haoyu He, Jianfei Cai, Zizheng Pan, Jing Liu, Jing Zhang, Dacheng Tao, Bohan Zhuang
PDF
Dynamic Generative Targeted Attacks with Pattern Injection Weiwei Feng, Nanqing Xu, Tianzhu Zhang, Yongdong Zhang
PDF
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang
PDF
Dynamic Graph Learning with Content-Guided Spatial-Frequency Relation Reasoning for Deepfake Detection Yuan Wang, Kun Yu, Chen Chen, Xiyuan Hu, Silong Peng
PDF
Dynamic Inference with Grounding Based Vision and Language Models Burak Uzkent, Amanmeet Garg, Wentao Zhu, Keval Doshi, Jingru Yi, Xiaolong Wang, Mohamed Omar
PDF
Dynamic Neural Network for Multi-Task Learning Searching Across Diverse Network Topologies Wonhyeok Choi, Sunghoon Im
PDF
Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation Wei Wang, Zhun Zhong, Weijie Wang, Xi Chen, Charles Ling, Boyu Wang, Nicu Sebe
PDF
DynamicDet: A Unified Dynamic Architecture for Object Detection Zhihao Lin, Yongtao Wang, Jinhe Zhang, Xiaojie Chu
PDF
DynamicStereo: Consistent Dynamic Depth from Stereo Videos Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht
PDF
DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata Ehsan Pajouheshgar, Yitao Xu, Tong Zhang, Sabine Süsstrunk
PDF
DynIBaR: Neural Dynamic Image-Based Rendering Zhengqi Li, Qianqian Wang, Forrester Cole, Richard Tucker, Noah Snavely
PDF
E2PN: Efficient SE(3)-Equivariant Point Network Minghan Zhu, Maani Ghaffari, William A. Clark, Huei Peng
PDF
EC2: Emergent Communication for Embodied Control Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan
PDF
ECON: Explicit Clothed Humans Optimized via Normal Integration Yuliang Xiu, Jinlong Yang, Xu Cao, Dimitrios Tzionas, Michael J. Black
PDF
EcoTTA: Memory-Efficient Continual Test-Time Adaptation via Self-Distilled Regularization Junha Song, Jungsoo Lee, In So Kweon, Sungha Choi
PDF
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding Yanmin Wu, Xinhua Cheng, Renrui Zhang, Zesen Cheng, Jian Zhang
PDF
Edge-Aware Regional Message Passing Controller for Image Forgery Localization Dong Li, Jiaying Zhu, Menglu Wang, Jiawei Liu, Xueyang Fu, Zheng-Jun Zha
PDF
EDGE: Editable Dance Generation from Music Jonathan Tseng, Rodrigo Castellon, Karen Liu
PDF
Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision Aditay Tripathi, Rishubh Singh, Anirban Chakraborty, Pradeep Shenoy
PDF
EDICT: Exact Diffusion Inversion via Coupled Transformations Bram Wallace, Akash Gokul, Nikhil Naik
PDF
EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points Chengwei Zheng, Wenbin Lin, Feng Xu
PDF
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision Jiahui Lei, Congyue Deng, Karl Schmeckpeper, Leonidas Guibas, Kostas Daniilidis
PDF
Effective Ambiguity Attack Against Passport-Based DNN Intellectual Property Protection Schemes Through Fully Connected Layer Substitution Yiming Chen, Jinyu Tian, Xiangyu Chen, Jiantao Zhou
PDF
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration Yawei Li, Yuchen Fan, Xiaoyu Xiang, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc Van Gool
PDF
Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring Lingshun Kong, Jiangxin Dong, Jianjun Ge, Mingqiang Li, Jinshan Pan
PDF
Efficient Hierarchical Entropy Model for Learned Point Cloud Compression Rui Song, Chunyang Fu, Shan Liu, Ge Li
PDF
Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks Yunrui Yu, Cheng-Zhong Xu
PDF
Efficient mAP Sparsification Based on 2D and 3D Discretized Grids Xiaoyu Zhang, Yun-Hui Liu
PDF
Efficient Mask Correction for Click-Based Interactive Image Segmentation Fei Du, Jianlong Yuan, Zhibin Wang, Fan Wang
PDF
Efficient Movie Scene Detection Using State-Space Transformers Md Mohaiminul Islam, Mahmudul Hasan, Kishan Shamsundar Athrey, Tony Braskich, Gedas Bertasius
PDF
Efficient Multimodal Fusion via Interactive Prompting Yaowei Li, Ruijie Quan, Linchao Zhu, Yi Yang
PDF
Efficient On-Device Training via Gradient Filtering Yuedong Yang, Guihong Li, Radu Marculescu
PDF
Efficient RGB-T Tracking via Cross-Modality Distillation Tianlu Zhang, Hongyuan Guo, Qiang Jiao, Qiang Zhang, Jungong Han
PDF
Efficient Robust Principal Component Analysis via Block Krylov Iteration and CUR Decomposition Shun Fang, Zhengqin Xu, Shiqian Wu, Shoulie Xie
PDF
Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis Thuan Hoang Nguyen, Thanh Van Le, Anh Tran
PDF
Efficient Second-Order Plane Adjustment Lipu Zhou
PDF
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos Yubin Hu, Yuze He, Yanghao Li, Jisheng Li, Yuxing Han, Jiangtao Wen, Yong-Jin Liu
PDF
Efficient Verification of Neural Networks Against LVM-Based Specifications Harleen Hanspal, Alessio Lomuscio
PDF
Efficient View Synthesis and 3D-Based Multi-Frame Denoising with Multiplane Feature Representations Thomas Tanay, Aleš Leonardis, Matteo Maggioni
PDF
EfficientSCI: Densely Connected Network with Space-Time Factorization for Large-Scale Video Snapshot Compressive Imaging Lishun Wang, Miao Cao, Xin Yuan
PDF
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang, Han Hu, Yixuan Yuan
PDF
Ego-Body Pose Estimation via Ego-Head Pose Estimation Jiaman Li, Karen Liu, Jiajun Wu
PDF
Egocentric Audio-Visual Object Localization Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu
PDF
Egocentric Auditory Attention Localization in Conversations Fiona Ryan, Hao Jiang, Abhinav Shukla, James M. Rehg, Vamsi Krishna Ithapu
PDF
Egocentric Video Task Translation Zihui Xue, Yale Song, Kristen Grauman, Lorenzo Torresani
PDF
Elastic Aggregation for Federated Optimization Dengsheng Chen, Jie Hu, Vince Junkai Tan, Xiaoming Wei, Enhua Wu
PDF
EMT-NAS:Transferring Architectural Knowledge Between Tasks from Different Datasets Peng Liao, Yaochu Jin, Wenli Du
PDF
End-to-End 3D Dense Captioning with Vote2Cap-DETR Sijin Chen, Hongyuan Zhu, Xin Chen, Yinjie Lei, Gang Yu, Tao Chen
PDF
End-to-End Vectorized HD-mAP Construction with Piecewise Bezier Curve Limeng Qiao, Wenjie Ding, Xi Qiu, Chi Zhang
PDF
End-to-End Video Matting with Trimap Propagation Wei-Lun Huang, Ming-Sui Lee
PDF
Endpoints Weight Fusion for Class Incremental Semantic Segmentation Jia-Wen Xiao, Chang-Bin Zhang, Jiekang Feng, Xialei Liu, Joost van de Weijer, Ming-Ming Cheng
PDF
Energy-Efficient Adaptive 3D Sensing Brevin Tilmon, Zhanghao Sun, Sanjeev J. Koppal, Yicheng Wu, Georgios Evangelidis, Ramzi Zahreddine, Gurunandan Krishnan, Sizhuo Ma, Jian Wang
PDF
Enhanced Multimodal Representation Learning with Cross-Modal KD Mengxi Chen, Linyu Xing, Yu Wang, Ya Zhang
PDF
Enhanced Stable View Synthesis Nishant Jain, Suryansh Kumar, Luc Van Gool
PDF
Enhanced Training of Query-Based Object Detection via Selective Query Recollection Fangyi Chen, Han Zhang, Kai Hu, Yu-Kai Huang, Chenchen Zhu, Marios Savvides
PDF
Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints Guilherme Potje, Felipe Cadar, André Araujo, Renato Martins, Erickson R. Nascimento
PDF
Enhancing Multiple Reliability Measures via Nuisance-Extended Information Bottleneck Jongheon Jeong, Sihyun Yu, Hankook Lee, Jinwoo Shin
PDF
Enhancing the Self-Universality for Transferable Targeted Attacks Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
PDF
Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition Jun Cen, Shiwei Zhang, Xiang Wang, Yixuan Pei, Zhiwu Qing, Yingya Zhang, Qifeng Chen
PDF
Ensemble-Based Blackbox Attacks on Dense Prediction Zikui Cai, Yaoteng Tan, M. Salman Asif
PDF
EqMotion: Equivariant Multi-Agent Motion Prediction with Invariant Interaction Reasoning Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang
PDF
Equiangular Basis Vectors Yang Shen, Xuhao Sun, Xiu-Shen Wei
PDF
Equivalent Transformation and Dual Stream Network Construction for Mobile Image Super-Resolution Jiahao Chao, Zhou Zhou, Hongfan Gao, Jiali Gong, Zhengfeng Yang, Zhenbing Zeng, Lydia Dehbi
PDF
ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer Shen Lin, Xiaoyu Zhang, Chenyang Chen, Xiaofeng Chen, Willy Susilo
PDF
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts Zhida Feng, Zhenyu Zhang, Xintong Yu, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Li Chen, Hao Tian, Hua Wu, Haifeng Wang
PDF
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari, Camilla Carta, François Fleuret
PDF
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun, Ledell Wu, Xinggang Wang, Tiejun Huang, Xinlong Wang, Yue Cao
PDF
Evading DeepFake Detectors via Adversarial Statistical Consistency Yang Hou, Qing Guo, Yihao Huang, Xiaofei Xie, Lei Ma, Jianjun Zhao
PDF
Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces Fahad Shamshad, Koushik Srivatsan, Karthik Nandakumar
PDF
EVAL: Explainable Video Anomaly Localization Ashish Singh, Michael J. Jones, Erik G. Learned-Miller
PDF
Event-Based Blurry Frame Interpolation Under Blind Exposure Wenming Weng, Yueyi Zhang, Zhiwei Xiong
PDF
Event-Based Frame Interpolation with Ad-Hoc Deblurring Lei Sun, Christos Sakaridis, Jingyun Liang, Peng Sun, Jiezhang Cao, Kai Zhang, Qi Jiang, Kaiwei Wang, Luc Van Gool
PDF
Event-Based Shape from Polarization Manasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza
PDF
Event-Based Video Frame Interpolation with Cross-Modal Asymmetric Bidirectional Motion Fields Taewoo Kim, Yujeong Chae, Hyun-Kurl Jang, Kuk-Jin Yoon
PDF
Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning Chengzhi Cao, Xueyang Fu, Hongjian Liu, Yukun Huang, Kunyu Wang, Jiebo Luo, Zheng-Jun Zha
PDF
EventNeRF: Neural Radiance Fields from a Single Colour Event Camera Viktor Rudnev, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik
PDF
Evolved Part Masking for Self-Supervised Learning Zhanzhou Feng, Shiliang Zhang
PDF
EvShutter: Transforming Events for Unconstrained Rolling Shutter Correction Julius Erbach, Stepan Tulyakov, Patricia Vitoria, Alfredo Bochicchio, Yuanyou Li
PDF
Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields Brian K. S. Isaac-Medina, Chris G. Willcocks, Toby P. Breckon
PDF
EXCALIBUR: Encouraging and Evaluating Embodied Exploration Hao Zhu, Raghav Kapoor, So Yeon Min, Winson Han, Jiatai Li, Kaiwen Geng, Graham Neubig, Yonatan Bisk, Aniruddha Kembhavi, Luca Weihs
PDF
Executing Your Commands via Motion Diffusion in Latent Space Xin Chen, Biao Jiang, Wen Liu, Zilong Huang, Bin Fu, Tao Chen, Gang Yu
PDF
Exemplar-FreeSOLO: Enhancing Unsupervised Instance Segmentation with Exemplars Taoseef Ishtiak, Qing En, Yuhong Guo
PDF
EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata Chenhao Zheng, Ayush Shrivastava, Andrew Owens
PDF
Explaining Image Classifiers with Multiscale Directional Image Representation Stefan Kolek, Robert Windesheim, Hector Andrade-Loarca, Gitta Kutyniok, Ron Levie
PDF
Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection Xincheng Yao, Ruoqi Li, Jing Zhang, Jun Sun, Chongyang Zhang
PDF
Explicit Visual Prompting for Low-Level Structure Segmentations Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun
PDF
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection Chen Zhang, Guorong Li, Yuankai Qi, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang
PDF
Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain, Ayan Kumar Bhunia, Subhadeep Koley, Pinaki Nath Chowdhury, Soumitri Chattopadhyay, Tao Xiang, Yi-Zhe Song
PDF
Exploring and Exploiting Uncertainty for Incomplete Multi-View Classification Mengyao Xie, Zongbo Han, Changqing Zhang, Yichen Bai, Qinghua Hu
PDF
Exploring and Utilizing Pattern Imbalance Shibin Mei, Chenglong Zhao, Shengchao Yuan, Bingbing Ni
PDF
Exploring Data Geometry for Continual Learning Zhi Gao, Chen Xu, Feng Li, Yunde Jia, Mehrtash Harandi, Yuwei Wu
PDF
Exploring Discontinuity for Video Frame Interpolation Sangjin Lee, Hyeongmin Lee, Chajin Shin, Hanbin Son, Sangyoun Lee
PDF
Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation Yunqing Zhao, Chao Du, Milad Abdollahzadeh, Tianyu Pang, Min Lin, Shuicheng Yan, Ngai-Man Cheung
PDF
Exploring Intra-Class Variation Factors with Learnable Cluster Prompts for Semi-Supervised Image Synthesis Yunfei Zhang, Xiaoyang Huo, Tianyi Chen, Si Wu, Hau San Wong
PDF
Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation Kun Zhou, Wenbo Li, Xiaoguang Han, Jiangbo Lu
PDF
Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels Zixuan Ding, Ao Wang, Hui Chen, Qiang Zhang, Pengzhang Liu, Yongjun Bao, Weipeng Yan, Jungong Han
PDF
Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language Chuanhao Li, Zhen Li, Chenchen Jing, Yunde Jia, Yuwei Wu
PDF
Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization Aishan Liu, Shiyu Tang, Siyuan Liang, Ruihao Gong, Boxi Wu, Xianglong Liu, Dacheng Tao
PDF
expOSE: Accurate Initialization-Free Projective Factorization Using Exponential Regularization José Pedro Iglesias, Amanda Nilsson, Carl Olsson
PDF
Extracting Class Activation Maps from Non-Discriminative Features as Well Zhaozheng Chen, Qianru Sun
PDF
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, Limin Wang
PDF
F2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories Peng Wang, Yuan Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang
PDF
FAC: 3D Representation Learning via Foreground Aware Feature Contrast Kangcheng Liu, Aoran Xiao, Xiaoqin Zhang, Shijian Lu, Ling Shao
PDF
FaceLit: Neural 3D Relightable Faces Anurag Ranjan, Kwang Moo Yi, Jen-Hao Rick Chang, Oncel Tuzel
PDF
Fair Federated Medical Image Segmentation via Client Contribution Estimation Meirui Jiang, Holger R. Roth, Wenqi Li, Dong Yang, Can Zhao, Vishwesh Nath, Daguang Xu, Qi Dou, Ziyue Xu
PDF
Fair Scratch Tickets: Finding Fair Sparse Networks Without Weight Training Pengwei Tang, Wei Yao, Zhicong Li, Yong Liu
PDF
Fake It till You Make It: Learning Transferable Representations from Synthetic ImageNet Clones Mert Bülent Sarıyıldız, Karteek Alahari, Diane Larlus, Yannis Kalantidis
PDF
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang
PDF
Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts Nikolas Lamb, Cameron Palmer, Benjamin Molloy, Sean Banerjee, Natasha Kholgade Banerjee
PDF
FashionSAP: Symbols and Attributes Prompt for Fine-Grained Fashion Vision-Language Pre-Training Yunpeng Han, Lisai Zhang, Qingcai Chen, Zhijian Chen, Zhonghua Li, Jianxin Yang, Zhao Cao
PDF
Fast Contextual Scene Graph Generation with Unbiased Context Augmentation Tianlei Jin, Fangtai Guo, Qiwei Meng, Shiqiang Zhu, Xiangming Xi, Wen Wang, Zonghao Mu, Wei Song
PDF
Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids Wei Dong, Christopher Choy, Charles Loop, Or Litany, Yuke Zhu, Anima Anandkumar
PDF
Fast Point Cloud Generation with Straight Flows Lemeng Wu, Dilin Wang, Chengyue Gong, Xingchao Liu, Yunyang Xiong, Rakesh Ranjan, Raghuraman Krishnamoorthi, Vikas Chandra, Qiang Liu
PDF
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation Junjie He, Pengyu Li, Yifeng Geng, Xuansong Xie
PDF
FCC: Feature Clusters Compression for Long-Tailed Visual Recognition Jian Li, Ziyao Meng, Daqian Shi, Rui Song, Xiaolei Diao, Jingwen Wang, Hao Xu
PDF
FeatER: An Efficient Network for Human Reconstruction via Feature mAP-Based TransformER Ce Zheng, Matias Mendieta, Taojiannan Yang, Guo-Jun Qi, Chen Chen
PDF
Feature Aggregated Queries for Transformer-Based Video Object Detectors Yiming Cui
PDF
Feature Alignment and Uniformity for Test Time Adaptation Shuai Wang, Daoan Zhang, Zipei Yan, Jianguo Zhang, Rui Li
PDF
Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition Zhijun Zhai, Jianhui Zhao, Chengjiang Long, Wenju Xu, Shuangjiang He, Huijuan Zhao
PDF
Feature Separation and Recalibration for Adversarial Robustness Woo Jae Kim, Yoonki Cho, Junsik Jung, Sung-Eui Yoon
PDF
Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers Zhou Huang, Hang Dai, Tian-Zhu Xiang, Shuo Wang, Huai-Xin Chen, Jie Qin, Huan Xiong
PDF
FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang, Zeyu Liu, Yu Hu, Wei Xi, Wenxian Yu, Danping Zou
PDF
FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning Yuanhao Xiong, Ruochen Wang, Minhao Cheng, Felix Yu, Cho-Jui Hsieh
PDF
Federated Domain Generalization with Generalization Adjustment Ruipeng Zhang, Qinwei Xu, Jiangchao Yao, Ya Zhang, Qi Tian, Yanfeng Wang
PDF
Federated Incremental Semantic Segmentation Jiahua Dong, Duzhen Zhang, Yang Cong, Wei Cong, Henghui Ding, Dengxin Dai
PDF
Federated Learning with Data-Agnostic Distribution Fusion Jian-hui Duan, Wenzhong Li, Derun Zou, Ruichen Li, Sanglu Lu
PDF
FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation Jiaxu Miao, Zongxin Yang, Leilei Fan, Yi Yang
PDF
FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-Tail Trajectory Prediction Yuning Wang, Pu Zhang, Lei Bai, Jianru Xue
PDF
Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation Linglan Zhao, Jing Lu, Yunlu Xu, Zhanzhan Cheng, Dashan Guo, Yi Niu, Xiangzhong Fang
PDF
Few-Shot Geometry-Aware Keypoint Localization Xingzhe He, Gaurav Bharaj, David Ferman, Helge Rhodin, Pablo Garrido
PDF
Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment Runqi Wang, Hao Zheng, Xiaoyue Duan, Jianzhuang Liu, Yuning Lu, Tian Wang, Songcen Xu, Baochang Zhang
PDF
Few-Shot Non-Line-of-Sight Imaging with Signal-Surface Collaborative Regularization Xintong Liu, Jianyu Wang, Leping Xiao, Xing Fu, Lingyun Qiu, Zuoqiang Shi
PDF
Few-Shot Referring Relationships in Videos Yogesh Kumar, Anand Mishra
PDF
Few-Shot Semantic Image Synthesis with Class Affinity Transfer Marlène Careil, Jakob Verbeek, Stéphane Lathuilière
PDF
FFCV: Accelerating Training by Removing Data Bottlenecks Guillaume Leclerc, Andrew Ilyas, Logan Engstrom, Sung Min Park, Hadi Salman, Aleksander Mądry
PDF
FFF: Fragment-Guided Flexible Fitting for Building Complete Protein Structures Weijie Chen, Xinyan Wang, Yuhang Wang
PDF
FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction Haoran Bai, Di Kang, Haoxian Zhang, Jinshan Pan, Linchao Bao
PDF
FIANCEE: Faster Inference of Adversarial Networks via Conditional Early Exits Polina Karpikova, Ekaterina Radionova, Anastasia Yaschenko, Andrei Spiridonov, Leonid Kostyushko, Riccardo Fabbricatore, Aleksei Ivakhnenko
PDF
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training Filip Radenovic, Abhimanyu Dubey, Abhishek Kadian, Todor Mihaylov, Simon Vandenhende, Yash Patel, Yi Wen, Vignesh Ramanathan, Dhruv Mahajan
PDF
Finding Geometric Models by Clustering in the Consensus Space Daniel Barath, Denys Rozumnyi, Ivan Eichhardt, Levente Hajder, Jiri Matas
PDF
Fine-Grained Audible Video Description Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong
PDF
Fine-Grained Classification with Noisy Labels Qi Wei, Lei Feng, Haoliang Sun, Ren Wang, Chenhui Guo, Yilong Yin
PDF
Fine-Grained Face Swapping via Regional GAN Inversion Zhian Liu, Maomao Li, Yong Zhang, Cairong Wang, Qi Zhang, Jue Wang, Yongwei Nie
PDF
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network Zhengxin Pan, Fangyu Wu, Bailing Zhang
PDF
Fine-Tuned CLIP Models Are Efficient Video Learners Hanoona Rasheed, Muhammad Uzair Khattak, Muhammad Maaz, Salman Khan, Fahad Shahbaz Khan
PDF
Finetune like You Pretrain: Improved Finetuning of Zero-Shot Vision Models Sachin Goyal, Ananya Kumar, Sankalp Garg, Zico Kolter, Aditi Raghunathan
PDF
FitMe: Deep Photorealistic 3D Morphable Model Avatars Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Baris Gecer, Jiankang Deng, Stefanos Zafeiriou
PDF
Fix the Noise: Disentangling Source Feature for Controllable Domain Translation Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Jaejun Yoo, Junmo Kim
PDF
FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs Luke Rowe, Martin Ethier, Eli-Henry Dykhne, Krzysztof Czarnecki
PDF
FLAG3D: A 3D Fitness Activity Dataset with Language Instruction Yansong Tang, Jinpeng Liu, Aoyang Liu, Bin Yang, Wenxun Dai, Yongming Rao, Jiwen Lu, Jie Zhou, Xiu Li
PDF
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer Zhijian Liu, Xinyu Yang, Haotian Tang, Shang Yang, Song Han
PDF
FLEX: Full-Body Grasping Without Full-Body Grasps Purva Tendulkar, Dídac Surís, Carl Vondrick
PDF
Flexible-Cm GAN: Towards Precise 3D Dose Prediction in Radiotherapy Riqiang Gao, Bin Lou, Zhoubing Xu, Dorin Comaniciu, Ali Kamen
PDF
FlexiViT: One Model for All Patch Sizes Lucas Beyer, Pavel Izmailov, Alexander Kolesnikov, Mathilde Caron, Simon Kornblith, Xiaohua Zhai, Matthias Minderer, Michael Tschannen, Ibrahim Alabdulmohsin, Filip Pavetic
PDF
FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans from Sparse Views Vinoj Jayasundara, Amit Agrawal, Nicolas Heron, Abhinav Shrivastava, Larry S. Davis
PDF
Flow Supervision for Deformable NeRF Chaoyang Wang, Lachlan Ewen MacDonald, László A. Jeni, Simon Lucey
PDF
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
PDF
FlowGrad: Controlling the Output of Generative ODEs with Gradients Xingchao Liu, Lemeng Wu, Shujian Zhang, Chengyue Gong, Wei Ping, Qiang Liu
PDF
Focus on Details: Online Multi-Object Tracking with Diverse Fine-Grained Representation Hao Ren, Shoudong Han, Huilin Ding, Ziwen Zhang, Hongwei Wang, Faquan Wang
PDF
Focused and Collaborative Feedback Integration for Interactive Image Segmentation Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong
PDF
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation Chaohui Yu, Qiang Zhou, Jingliang Li, Jianlong Yuan, Zhibin Wang, Fan Wang
PDF
Four-View Geometry with Unknown Radial Distortion Petr Hruby, Viktor Korotynskiy, Timothy Duff, Luke Oeding, Marc Pollefeys, Tomas Pajdla, Viktor Larsson
PDF
Frame Flexible Network Yitian Zhang, Yue Bai, Chang Liu, Huan Wang, Sheng Li, Yun Fu
PDF
Frame Interpolation Transformer and Uncertainty Guidance Markus Plack, Karlis Martins Briedis, Abdelaziz Djelouah, Matthias B. Hullin, Markus Gross, Christopher Schroers
PDF
Frame-Event Alignment and Fusion Network for High Frame Rate Tracking Jiqing Zhang, Yuanchen Wang, Wenxi Liu, Meng Li, Jinpeng Bai, Baocai Yin, Xin Yang
PDF
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson Cothren, Khoa Luu
PDF
FreeNeRF: Improving Few-Shot Neural Rendering with Free Frequency Regularization Jiawei Yang, Marco Pavone, Yue Wang
PDF
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang
PDF
Freestyle Layout-to-Image Synthesis Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang
PDF
Frequency-Modulated Point Cloud Rendering with Easy Editing Yi Zhang, Xiaoyang Huang, Bingbing Ni, Teng Li, Wenjun Zhang
PDF
Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection Tomoki Ichikawa, Yoshiki Fukao, Shohei Nobuhara, Ko Nishino
PDF
From Images to Textual Prompts: Zero-Shot Visual Question Answering with Frozen Large Language Models Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, Dacheng Tao, Steven Hoi
PDF
From Node Interaction to Hop Interaction: New Effective and Scalable Graph Learning Paradigm Jie Chen, Zilong Li, Yin Zhu, Junping Zhang, Jian Pu
PDF
Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning Qiang He, Huangyuan Su, Jieyu Zhang, Xinwen Hou
PDF
FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang
PDF
Full or Weak Annotations? an Adaptive Strategy for Budget-Constrained Annotation Campaigns Javier Gamazo Tejero, Martin S. Zinkernagel, Sebastian Wolf, Raphael Sznitman, Pablo Márquez-Neila
PDF
Fully Self-Supervised Depth Estimation from Defocus Clue Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang, Xuelong Li
PDF
Fusing Pre-Trained Language Models with Multimodal Prompts Through Reinforcement Learning Youngjae Yu, Jiwan Chung, Heeseung Yun, Jack Hessel, Jae Sung Park, Ximing Lu, Rowan Zellers, Prithviraj Ammanabrolu, Ronan Le Bras, Gunhee Kim, Yejin Choi
PDF
Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation Pengchong Qiao, Zhidan Wei, Yu Wang, Zhennan Wang, Guoli Song, Fan Xu, Xiangyang Ji, Chang Liu, Jie Chen
PDF
G-MSM: Unsupervised Multi-Shape Matching with Graph-Based Affinity Priors Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers
PDF
GaitGCI: Generative Counterfactual Intervention for Gait Recognition Huanzhang Dou, Pengyi Zhang, Wei Su, Yunlong Yu, Yining Lin, Xi Li
PDF
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-per-Second Vincent-Pierre Berges, Andrew Szot, Devendra Singh Chaplot, Aaron Gokaslan, Roozbeh Mottaghi, Dhruv Batra, Eric Undersander
PDF
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis Ming Tao, Bing-Kun Bao, Hao Tang, Changsheng Xu
PDF
GamutMLP: A Lightweight MLP for Color Loss Recovery Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown
PDF
GANHead: Towards Generative Animatable Neural Head Avatars Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai
PDF
GANmouflage: 3D Object Nondetection with Texture Fields Rui Guo, Jasmine Collins, Oscar de Lima, Andrew Owens
PDF
GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts Haoran Geng, Helin Xu, Chengyang Zhao, Chao Xu, Li Yi, Siyuan Huang, He Wang
PDF
GarmentTracking: Category-Level Garment Pose Tracking Han Xue, Wenqiang Xu, Jieyi Zhang, Tutian Tang, Yutong Li, Wenxin Du, Ruolin Ye, Cewu Lu
PDF
Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan
PDF
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active Stereo Cues Stefanie Walz, Mario Bijelic, Andrea Ramazzina, Amanpreet Walia, Fahim Mannan, Felix Heide
PDF
Gaussian Label Distribution Learning for Spherical Image Object Detection Hang Xu, Xinyuan Liu, Qiang Zhao, Yike Ma, Chenggang Yan, Feng Dai
PDF
Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention Sounak Mondal, Zhibo Yang, Seoyoung Ahn, Dimitris Samaras, Gregory Zelinsky, Minh Hoai
PDF
GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, Shalini De Mello, Hyung Jin Chang, Xucong Zhang, Otmar Hilliges
PDF
GCFAgg: Global and Cross-View Feature Aggregation for Multi-View Clustering Weiqing Yan, Yuanyang Zhang, Chenlei Lv, Chang Tang, Guanghui Yue, Liang Liao, Weisi Lin
PDF
GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds Honghui Yang, Tong He, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang
PDF
GEN: Pushing the Limits of SoftMax-Based Out-of-Distribution Detection Xixi Liu, Yaroslava Lochman, Christopher Zach
PDF
GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze, Nicolas Carion, Ishan Misra
PDF
Generalist: Decoupling Natural and Robust Generalization Hongjun Wang, Yisen Wang
PDF
Generalizable Implicit Neural Representations via Instance Pattern Composers Chiheon Kim, Doyup Lee, Saehoon Kim, Minsu Cho, Wook-Shin Han
PDF
Generalizable Local Feature Pre-Training for Deformable Shape Analysis Souhaib Attaiki, Lei Li, Maks Ovsjanikov
PDF
Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation Tianli Zhang, Mengqi Xue, Jiangtao Zhang, Haofei Zhang, Yu Wang, Lechao Cheng, Jie Song, Mingli Song
PDF
Generalized Decoding for Pixel, Image, and Language Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao
PDF
Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang
PDF
Generalized Relation Modeling for Transformer Tracking Shenyuan Gao, Chunluan Zhou, Jun Zhang
PDF
Generalized UAV Object Detection via Frequency Domain Disentanglement Kunyu Wang, Xueyang Fu, Yukun Huang, Chengzhi Cao, Gege Shi, Zheng-Jun Zha
PDF
Generalizing Dataset Distillation via Deep Generative Prior George Cazenavette, Tongzhou Wang, Antonio Torralba, Alexei A. Efros, Jun-Yan Zhu
PDF
Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Jinwei Gu, Chen Change Loy
PDF
Generating Anomalies for Video Anomaly Detection with Prompt-Based Feature Mapping Zuhao Liu, Xiao-Ming Wu, Dian Zheng, Kun-Yu Lin, Wei-Shi Zheng
PDF
Generating Features with Increased Crop-Related Diversity for Few-Shot Object Detection Jingyi Xu, Hieu Le, Dimitris Samaras
PDF
Generating Holistic 3D Human Motion from Speech Hongwei Yi, Hualin Liang, Yifei Liu, Qiong Cao, Yandong Wen, Timo Bolkart, Dacheng Tao, Michael J. Black
PDF
Generating Human Motion from Textual Descriptions with Discrete Representations Jianrong Zhang, Yangsong Zhang, Xiaodong Cun, Yong Zhang, Hongwei Zhao, Hongtao Lu, Xi Shen, Ying Shan
PDF
Generating Part-Aware Editable 3D Shapes Without 3D Supervision Konstantinos Tertikas, Despoina Paschalidou, Boxiao Pan, Jeong Joon Park, Mikaela Angelina Uy, Ioannis Emiris, Yannis Avrithis, Leonidas Guibas
PDF
Generative Bias for Robust Visual Question Answering Jae Won Cho, Dong-Jin Kim, Hyeonggon Ryu, In So Kweon
PDF
Generative Diffusion Prior for Unified Image Restoration and Enhancement Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, Bo Dai
PDF
Generative Semantic Segmentation Jiaqi Chen, Jiachen Lu, Xiatian Zhu, Li Zhang
PDF
Generic-to-Specific Distillation of Masked Autoencoders Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye
PDF
Genie: Show Me the Data for Quantization Yongkweon Jeon, Chungman Lee, Ho-young Kim
PDF
GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction Chuwei Luo, Changxu Cheng, Qi Zheng, Cong Yao
PDF
GeoMAE: Masked Geometric Target Prediction for Self-Supervised Point Cloud Pre-Training Xiaoyu Tian, Haoxi Ran, Yue Wang, Hang Zhao
PDF
Geometric Visual Similarity Learning in 3D Medical Image Self-Supervised Pre-Training Yuting He, Guanyu Yang, Rongjun Ge, Yang Chen, Jean-Louis Coatrieux, Boyu Wang, Shuo Li
PDF
Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation Yuwei Yang, Munawar Hayat, Zhao Jin, Chao Ren, Yinjie Lei
PDF
GeoMVSNet: Learning Multi-View Stereo with Geometry Perception Zhe Zhang, Rui Peng, Yuxi Hu, Ronggang Wang
PDF
GeoNet: Benchmarking Unsupervised Adaptation Across Geographies Tarun Kalluri, Wangdong Xu, Manmohan Chandraker
PDF
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation Jingyang Huo, Qiang Sun, Boyan Jiang, Haitao Lin, Yanwei Fu
PDF
GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments Zhengxi Hu, Yuxue Yang, Xiaolin Zhai, Dingye Yang, Bohan Zhou, Jingtai Liu
PDF
GFPose: Learning 3D Human Pose Prior with Gradient Fields Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang
PDF
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild Bokui Shen, Xinchen Yan, Charles R. Qi, Mahyar Najibi, Boyang Deng, Leonidas Guibas, Yin Zhou, Dragomir Anguelov
PDF
GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods Da Yin, Feng Gao, Govind Thattai, Michael Johnston, Kai-Wei Chang
PDF
GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task Huiping Zhuang, Zhenyu Weng, Run He, Zhiping Lin, Ziqian Zeng
PDF
GlassesGAN: Eyewear Personalization Using Synthetic Appearance Discovery and Targeted Subspace Modeling Richard Plesh, Peter Peer, Vitomir Struc
PDF
GLeaD: Improving GANs with a Generator-Leading Task Qingyan Bai, Ceyuan Yang, Yinghao Xu, Xihui Liu, Yujiu Yang, Yujun Shen
PDF
GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee
PDF
Global and Local Mixture Consistency Cumulative Learning for Long-Tailed Visual Recognitions Fei Du, Peng Yang, Qi Jia, Fengtao Nan, Xiaoting Chen, Yun Yang
PDF
Global Vision Transformer Pruning with Hessian-Aware Saliency Huanrui Yang, Hongxu Yin, Maying Shen, Pavlo Molchanov, Hai Li, Jan Kautz
PDF
Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation Xiaolong Shen, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
PDF
Glocal Energy-Based Learning for Few-Shot Open-Set Recognition Haoyu Wang, Guansong Pang, Peng Wang, Lei Zhang, Wei Wei, Yanning Zhang
PDF
Gloss Attention for Gloss-Free Sign Language Translation Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao
PDF
GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields from Multi-View Images Jianchuan Chen, Wentao Yi, Liqian Ma, Xu Jia, Huchuan Lu
PDF
Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification Zhengwei Yang, Meng Lin, Xian Zhong, Yu Wu, Zheng Wang
PDF
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning Zhenyu Xie, Zaiyu Huang, Xin Dong, Fuwei Zhao, Haoye Dong, Xijin Zhang, Feida Zhu, Xiaodan Liang
PDF
Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions Yun He, Danhang Tang, Yinda Zhang, Xiangyang Xue, Yanwei Fu
PDF
GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency Lin Tian, Hastings Greer, François-Xavier Vialard, Roland Kwitt, Raúl San José Estépar, Richard Jarrett Rushmore, Nikolaos Makris, Sylvain Bouix, Marc Niethammer
PDF
Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization Xingxuan Zhang, Renzhe Xu, Han Yu, Hao Zou, Peng Cui
PDF
Gradient-Based Uncertainty Attribution for Explainable Bayesian Deep Learning Hanjing Wang, Dhiraj Joshi, Shiqiang Wang, Qiang Ji
PDF
GradMA: A Gradient-Memory-Based Accelerated Federated Learning with Alleviated Catastrophic Forgetting Kangyang Luo, Xiang Li, Yunshi Lan, Ming Gao
PDF
Graph Representation for Order-Aware Visual Transformation Yue Qiu, Yanjun Sun, Fumiya Matsuzawa, Kenji Iwata, Hirokatsu Kataoka
PDF
Graph Transformer GANs for Graph-Constrained House Generation Hao Tang, Zhenyu Zhang, Humphrey Shi, Bo Li, Ling Shao, Nicu Sebe, Radu Timofte, Luc Van Gool
PDF
Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D Images Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zhaoxiang Zhang, Zhen Lei
PDF
GraVoS: Voxel Selection for 3D Point-Cloud Detection Oren Shrout, Yizhak Ben-Shabat, Ayellet Tal
PDF
GRES: Generalized Referring Expression Segmentation Chang Liu, Henghui Ding, Xudong Jiang
PDF
Grid-Guided Neural Radiance Fields for Large Urban Scenes Linning Xu, Yuanbo Xiangli, Sida Peng, Xingang Pan, Nanxuan Zhao, Christian Theobalt, Bo Dai, Dahua Lin
PDF
Ground-Truth Free Meta-Learning for Deep Compressive Sampling Xinran Qin, Yuhui Quan, Tongyao Pang, Hui Ji
PDF
Grounding Counterfactual Explanation of Image Classifiers to Textual Concept Space Siwon Kim, Jinoh Oh, Sungjin Lee, Seunghak Yu, Jaeyoung Do, Tara Taghavi
PDF
GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds Zihui Zhang, Bo Yang, Bing Wang, Bo Li
PDF
gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction Zerui Chen, Shizhe Chen, Cordelia Schmid, Ivan Laptev
PDF
Guided Depth Super-Resolution by Deep Anisotropic Diffusion Nando Metzger, Rodrigo Caye Daudt, Konrad Schindler
PDF
Guided Recommendation for Model Fine-Tuning Hao Li, Charless Fowlkes, Hao Yang, Onkar Dabeer, Zhuowen Tu, Stefano Soatto
PDF
Guiding Pseudo-Labels with Uncertainty Estimation for Source-Free Unsupervised Domain Adaptation Mattia Litrico, Alessio Del Bue, Pietro Morerio
PDF
H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction Hao Xu, Tianyu Wang, Xiao Tang, Chi-Wing Fu
PDF
HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning Chia-Wen Kuo, Zsolt Kira
PDF
Habitat-Matterport 3D Semantics Dataset Karmesh Yadav, Ram Ramrakhya, Santhosh Kumar Ramakrishnan, Theo Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel Xuan Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot
PDF
HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling Yujian Zheng, Zirong Jin, Moran Li, Haibin Huang, Chongyang Ma, Shuguang Cui, Xiaoguang Han
PDF
HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions Anshul Shah, Aniket Roy, Ketul Shah, Shlok Mishra, David Jacobs, Anoop Cherian, Rama Chellappa
PDF
Ham2Pose: Animating Sign Language Notation into Pose Sequences Rotem Shalev Arkushin, Amit Moryossef, Ohad Fried
PDF
Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video Xingyu Chen, Baoyuan Wang, Heung-Yeung Shum
PDF
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands Zhiyang Guo, Wengang Zhou, Min Wang, Li Li, Houqiang Li
PDF
HandsOff: Labeled Dataset Generation with No Additional Human Annotations Austin Xu, Mariya I. Vasileva, Achal Dave, Arjun Seshadri
PDF
Handwritten Text Generation from Visual Archetypes Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara
PDF
Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model Rolandos Alexandros Potamias, Stylianos Ploumpis, Stylianos Moschoglou, Vasileios Triantafyllou, Stefanos Zafeiriou
PDF
Hard Patches Mining for Masked Image Modeling Haochen Wang, Kaiyou Song, Junsong Fan, Yuxi Wang, Jin Xie, Zhaoxiang Zhang
PDF
Hard Sample Matters a Lot in Zero-Shot Quantization Huantong Li, Xiangmiao Wu, Fanbing Lv, Daihai Liao, Thomas H. Li, Yonggang Zhang, Bo Han, Mingkui Tan
PDF
Harmonious Feature Learning for Interactive Hand-Object Pose Estimation Zhifeng Lin, Changxing Ding, Huan Yao, Zengsheng Kuang, Shaoli Huang
PDF
Harmonious Teacher for Cross-Domain Object Detection Jinhong Deng, Dongli Xu, Wen Li, Lixin Duan
PDF
HARP: Personalized Hand Reconstruction from a Monocular RGB Video Korrawe Karunratanakul, Sergey Prokudin, Otmar Hilliges, Siyu Tang
PDF
HDR Imaging with Spatially Varying Signal-to-Noise Ratios Yiheng Chi, Xingguang Zhang, Stanley H. Chan
PDF
Heat Diffusion Based Multi-Scale and Geometric Structure-Aware Transformer for Mesh Segmentation Chi-Chong Wong
PDF
HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes with Iterative Intertwined Regularization Zhihao Liang, Zhangjin Huang, Changxing Ding, Kui Jia
PDF
Heterogeneous Continual Learning Divyam Madaan, Hongxu Yin, Wonmin Byeon, Jan Kautz, Pavlo Molchanov
PDF
HexPlane: A Fast Representation for Dynamic Scenes Ang Cao, Justin Johnson
PDF
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation Jian Ding, Nan Xue, Gui-Song Xia, Bernt Schiele, Dengxin Dai
PDF
HGNet: Learning Hierarchical Geometry from Points, Edges, and Surfaces Ting Yao, Yehao Li, Yingwei Pan, Tao Mei
PDF
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani
PDF
Hi4D: 4D Instance Segmentation of Close Human Interaction Yifei Yin, Chen Guo, Manuel Kaufmann, Juan Jose Zarate, Jie Song, Otmar Hilliges
PDF
Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision Fangqiang Ding, Andras Palffy, Dariu M. Gavrila, Chris Xiaoxuan Lu
PDF
HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization Sungyeon Kim, Boseung Jeong, Suha Kwak
PDF
Hierarchical B-Frame Video Coding Using Two-Layer CANF Without Motion Coding David Alexandre, Hsueh-Ming Hang, Wen-Hsiao Peng
PDF
Hierarchical Dense Correlation Distillation for Few-Shot Segmentation Bohao Peng, Zhuotao Tian, Xiaoyang Wu, Chengyao Wang, Shu Liu, Jingyong Su, Jiaya Jia
PDF
Hierarchical Discriminative Learning Improves Visual Representations of Biomedical Microscopy Cheng Jiang, Xinhai Hou, Akhil Kondepudi, Asadur Chowdury, Christian W. Freudiger, Daniel A. Orringer, Honglak Lee, Todd C. Hollon
PDF
Hierarchical Fine-Grained Image Forgery Detection and Localization Xiao Guo, Xiaohong Liu, Zhiyuan Ren, Steven Grosz, Iacopo Masi, Xiaoming Liu
PDF
Hierarchical Neural Memory Network for Low Latency Event Processing Ryuhei Hamaguchi, Yasutaka Furukawa, Masaki Onishi, Ken Sakurada
PDF
Hierarchical Prompt Learning for Multi-Task Learning Yajing Liu, Yuning Lu, Hao Liu, Yaozu An, Zhuoran Xu, Zhuokun Yao, Baofeng Zhang, Zhiwei Xiong, Chenguang Gui
PDF
Hierarchical Semantic Contrast for Scene-Aware Video Anomaly Detection Shengyang Sun, Xiaojin Gong
PDF
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding Chaolei Tan, Zihang Lin, Jian-Fang Hu, Wei-Shi Zheng, Jianhuang Lai
PDF
Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection Chuandong Liu, Chenqiang Gao, Fangcen Liu, Pengcheng Li, Deyu Meng, Xinbo Gao
PDF
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos Yilin Wen, Hao Pan, Lei Yang, Jia Pan, Taku Komura, Wenping Wang
PDF
Hierarchical Video-Moment Retrieval and Step-Captioning Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal
PDF
HierVL: Learning Hierarchical Video-Language Embeddings Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman
PDF
High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition Tianyu Luan, Yuanhao Zhai, Jingjing Meng, Zhong Li, Zhang Chen, Yi Xu, Junsong Yuan
PDF
High-Fidelity 3D Face Generation from Natural Language Descriptions Menghua Wu, Hao Zhu, Linjia Huang, Yiyu Zhuang, Yuanxun Lu, Xun Cao
PDF
High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization Jiaxin Xie, Hao Ouyang, Jingtan Piao, Chenyang Lei, Qifeng Chen
PDF
High-Fidelity 3D Human Digitization from Single 2k Resolution Images Sang-Hun Han, Min-Gyu Park, Ju Hong Yoon, Ju-Mi Kang, Young-Jae Park, Hae-Gon Jeon
PDF
High-Fidelity and Freely Controllable Talking Head Video Generation Yue Gao, Yuan Zhou, Jinglu Wang, Xiao Li, Xiang Ming, Yan Lu
PDF
High-Fidelity Clothed Avatar Reconstruction from a Single Image Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei
PDF
High-Fidelity Event-Radiance Recovery via Transient Event Frequency Jin Han, Yuta Asano, Boxin Shi, Yinqiang Zheng, Imari Sato
PDF
High-Fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan
PDF
High-Fidelity Generalized Emotional Talking Face Generation with Multi-Modal Emotion Space Learning Chao Xu, Junwei Zhu, Jiangning Zhang, Yue Han, Wenqing Chu, Ying Tai, Chengjie Wang, Zhifeng Xie, Yong Liu
PDF
High-Fidelity Guided Image Synthesis with Latent Diffusion Models Jaskirat Singh, Stephen Gould, Liang Zheng
PDF
High-Frequency Stereo Matching Network Haoliang Zhao, Huizhou Zhou, Yongjun Zhang, Jie Chen, Yitong Yang, Yong Zhao
PDF
High-Res Facial Appearance Capture from Polarized Smartphone Images Dejan Azinović, Olivier Maury, Christophe Hery, Matthias Nießner, Justus Thies
PDF
High-Resolution Image Reconstruction with Latent Diffusion Models from Human Brain Activity Yu Takagi, Shinji Nishimoto
PDF
Highly Confident Local Structure Based Consensus Graph Learning for Incomplete Multi-View Clustering Jie Wen, Chengliang Liu, Gehui Xu, Zhihao Wu, Chao Huang, Lunke Fei, Yong Xu
PDF
Hint-Aug: Drawing Hints from Foundation Vision Transformers Towards Boosted Few-Shot Parameter-Efficient Tuning Zhongzhi Yu, Shang Wu, Yonggan Fu, Shunyao Zhang, Yingyan Lin
PDF
Histopathology Whole Slide Image Analysis with Heterogeneous Graph Representation Learning Tsai Hor Chan, Fernando Julio Cendra, Lan Ma, Guosheng Yin, Lequan Yu
PDF
HNeRV: A Hybrid Neural Representation for Videos Hao Chen, Matthew Gwilliam, Ser-Nam Lim, Abhinav Shrivastava
PDF
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models Shan Ning, Longtian Qiu, Yongfei Liu, Xuming He
PDF
HOLODIFFUSION: Training a 3D Diffusion Model Using 2D Images Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy J. Mitra
PDF
HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics Artur Grigorev, Michael J. Black, Otmar Hilliges
PDF
HOTNAS: Hierarchical Optimal Transport for Neural Architecture Search Jiechao Yang, Yong Liu, Hongteng Xu
PDF
HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising Mohammad Amin Shabani, Sepidehsadat Hosseini, Yasutaka Furukawa
PDF
How Can Objects Help Action Recognition? Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid
PDF
How to Backdoor Diffusion Models? Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho
PDF
How to Prevent the Continuous Damage of Noises to Model Training? Xiaotian Yu, Yang Jiang, Tianqi Shi, Zunlei Feng, Yuexuan Wang, Mingli Song, Li Sun
PDF
How to Prevent the Poor Performance Clients for Personalized Federated Learning? Zhe Qu, Xingyu Li, Xiao Han, Rui Duan, Chengchao Shen, Lixing Chen
PDF
How You Feelin'? Learning Emotions and Mental States in Movie Scenes Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi
PDF
HRDFuse: Monocular 360deg Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions Hao Ai, Zidong Cao, Yan-Pei Cao, Ying Shan, Lin Wang
PDF
HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation Linfang Zheng, Chen Wang, Yinghan Sun, Esha Dasgupta, Hua Chen, Aleš Leonardis, Wei Zhang, Hyung Jin Chang
PDF
Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning with Hyperspherical Embeddings Daniel J. Trosten, Rwiddhi Chakraborty, Sigurd Løkse, Kristoffer Knutsen Wickstrøm, Robert Jenssen, Michael C. Kampffmeyer
PDF
Human Body Shape Completion with Implicit Shape and Flow Learning Boyao Zhou, Di Meng, Jean-Sébastien Franco, Edmond Boyer
PDF
Human Guided Ground-Truth Generation for Realistic Image Super-Resolution Du Chen, Jie Liang, Xindong Zhang, Ming Liu, Hui Zeng, Lei Zhang
PDF
Human Pose as Compositional Tokens Zigang Geng, Chunyu Wang, Yixuan Wei, Ze Liu, Houqiang Li, Han Hu
PDF
Human Pose Estimation in Extremely Low-Light Conditions Sohyun Lee, Jaesung Rim, Boseung Jeong, Geonu Kim, Byungju Woo, Haechan Lee, Sunghyun Cho, Suha Kwak
PDF
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes Xuan Ju, Ailing Zeng, Jianan Wang, Qiang Xu, Lei Zhang
PDF
HumanBench: Towards General Human-Centric Perception with Projector Assisted Pretraining Shixiang Tang, Cheng Chen, Qingsong Xie, Meilin Chen, Yizhou Wang, Yuanzheng Ci, Lei Bai, Feng Zhu, Haiyang Yang, Li Yi, Rui Zhao, Wanli Ouyang
PDF
HumanGen: Generating Human Radiance Fields with Explicit Priors Suyi Jiang, Haoran Jiang, Ziyu Wang, Haimin Luo, Wenzheng Chen, Lan Xu
PDF
HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation Akash Sengupta, Ignas Budvytis, Roberto Cipolla
PDF
Humans as Light Bulbs: 3D Human Reconstruction from Thermal Reflection Ruoshi Liu, Carl Vondrick
PDF
Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation Xiaoyang Wang, Bingfeng Zhang, Limin Yu, Jimin Xiao
PDF
Hybrid Active Learning via Deep Clustering for Video Action Detection Aayush J. Rana, Yogesh S. Rawat
PDF
Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur Peng Dai, Yinda Zhang, Xin Yu, Xiaoyang Lyu, Xiaojuan Qi
PDF
Hyperbolic Contrastive Learning for Visual Representations Beyond Objects Songwei Ge, Shlok Mishra, Simon Kornblith, Chun-Liang Li, David Jacobs
PDF
HyperCUT: Video Sequence from a Single Blurry Image Using Unsupervised Ordering Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai
PDF
HyperMatch: Noise-Tolerant Semi-Supervised Learning via Relaxed Contrastive Constraint Beitong Zhou, Jing Lu, Kerui Liu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
PDF
HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling Benjamin Attal, Jia-Bin Huang, Christian Richardt, Michael Zollhöfer, Johannes Kopf, Matthew O’Toole, Changil Kim
PDF
Hyperspherical Embedding for Point Cloud Completion Junming Zhang, Haomeng Zhang, Ram Vasudevan, Matthew Johnson-Roberson
PDF
HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic Fusion Sijie Wang, Qiyu Kang, Rui She, Wei Wang, Kai Zhao, Yang Song, Wee Peng Tay
PDF
I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang
PDF
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, Muhammad Zeshan Afzal, Didier Stricker, Luc Van Gool, Federico Tombari
PDF
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual Recognition Yixuan Wei, Yue Cao, Zheng Zhang, Houwen Peng, Zhuliang Yao, Zhenda Xie, Han Hu, Baining Guo
PDF
Identity-Preserving Talking Face Generation with Landmark and Appearance Priors Weizhi Zhong, Chaowei Fang, Yinqi Cai, Pengxu Wei, Gangming Zhao, Liang Lin, Guanbin Li
PDF
IDGI: A Framework to Eliminate Explanation Noise from Integrated Gradients Ruo Yang, Binghui Wang, Mustafa Bilgic
PDF
iDisc: Internal Discretization for Monocular Depth Estimation Luigi Piccinelli, Christos Sakaridis, Fisher Yu
PDF
IFSeg: Image-Free Semantic Segmentation via Vision-Language Model Sukmin Yun, Seong Hyeon Park, Paul Hongsuck Seo, Jinwoo Shin
PDF
Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes Jihyun Lee, Minhyuk Sung, Honggyu Choi, Tae-Kyun Kim
PDF
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei
PDF
Image Cropping with Spatial-Aware Feature and Rank Consistency Chao Wang, Li Niu, Bo Zhang, Liqing Zhang
PDF
Image Quality-Aware Diagnosis via Meta-Knowledge Co-Embedding Haoxuan Che, Siyu Chen, Hao Chen
PDF
Image Super-Resolution Using T-Tetromino Pixels Simon Grosche, Andy Regensky, Jürgen Seiler, André Kaup
PDF
ImageBind: One Embedding Space to Bind Them All Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra
PDF
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting Su Wang, Chitwan Saharia, Ceslee Montgomery, Jordi Pont-Tuset, Shai Noy, Stefano Pellegrini, Yasumasa Onoe, Sarah Laszlo, David J. Fleet, Radu Soricut, Jason Baldridge, Mohammad Norouzi, Peter Anderson, William Chan
PDF
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing Xiaodan Li, Yuefeng Chen, Yao Zhu, Shuhui Wang, Rong Zhang, Hui Xue
PDF
Images Speak in Images: A Generalist Painter for In-Context Visual Learning Xinlong Wang, Wen Wang, Yue Cao, Chunhua Shen, Tiejun Huang
PDF
Imagic: Text-Based Real Image Editing with Diffusion Models Bahjat Kawar, Shiran Zada, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, Michal Irani
PDF
Imitation Learning as State Matching via Differentiable Physics Siwei Chen, Xiao Ma, Zhongwen Xu
PDF
IMP: Iterative Matching and Pose Estimation with Adaptive Pooling Fei Xue, Ignas Budvytis, Roberto Cipolla
PDF
Implicit 3D Human Mesh Recovery Using Consistency with Pose and Shape from Unseen-View Hanbyel Cho, Yooshin Cho, Jaesung Ahn, Junmo Kim
PDF
Implicit Diffusion Models for Continuous Super-Resolution Sicheng Gao, Xuhui Liu, Bohan Zeng, Sheng Xu, Yanjing Li, Xiaoyan Luo, Jianzhuang Liu, Xiantong Zhen, Baochang Zhang
PDF
Implicit Identity Driven Deepfake Face Swapping Detection Baojin Huang, Zhongyuan Wang, Jifan Yang, Jiaxin Ai, Qin Zou, Qian Wang, Dengpan Ye
PDF
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization Shichao Dong, Jin Wang, Renhe Ji, Jiajun Liang, Haoqiang Fan, Zheng Ge
PDF
Implicit Neural Head Synthesis via Controllable Local Deformation Fields Chuhan Chen, Matthew O’Toole, Gaurav Bharaj, Pablo Garrido
PDF
Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving Ben Agro, Quinlan Sykora, Sergio Casas, Raquel Urtasun
PDF
Implicit Surface Contrastive Clustering for LiDAR Point Clouds Zaiwei Zhang, Min Bai, Erran Li
PDF
Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates Avinash Paliwal, Andrii Tsarov, Nima Khademi Kalantari
PDF
Improved Distribution Matching for Dataset Condensation Ganlong Zhao, Guanbin Li, Yipeng Qin, Yizhou Yu
PDF
Improved Test-Time Adaptation for Domain Generalization Liang Chen, Yong Zhang, Yibing Song, Ying Shan, Lingqiao Liu
PDF
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao
PDF
Improving Cross-Modal Retrieval with Set of Diverse Embeddings Dongwon Kim, Namyup Kim, Suha Kwak
PDF
Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues Xingyu Ren, Jiankang Deng, Chao Ma, Yichao Yan, Xiaokang Yang
PDF
Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level Lianzhe Wang, Shiji Zhou, Shanghang Zhang, Xu Chu, Heng Chang, Wenwu Zhu
PDF
Improving Generalization with Domain Convex Game Fangrui Lv, Jian Liang, Shuang Li, Jinming Zhang, Di Liu
PDF
Improving Graph Representation for Point Cloud Segmentation via Attentive Filtering Nan Zhang, Zhiyi Pan, Thomas H. Li, Wei Gao, Ge Li
PDF
Improving Image Recognition by Retrieving from Web-Scale Image-Text Data Ahmet Iscen, Alireza Fathi, Cordelia Schmid
PDF
Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization Zifan Wang, Nan Ding, Tomer Levinboim, Xi Chen, Radu Soricut
PDF
Improving Robustness of Semantic Segmentation to Motion-Blur Using Class-Centric Augmentation Aakanksha, A. N. Rajagopalan
PDF
Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions Yong Guo, David Stutz, Bernt Schiele
PDF
Improving Selective Visual Question Answering by Learning from Your Peers Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach
PDF
Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling Yongshuai Huang, Ning Lu, Dapeng Chen, Yibo Li, Zecheng Xie, Shenggao Zhu, Liangcai Gao, Wei Peng
PDF
Improving the Transferability of Adversarial Samples by Path-Augmented Method Jianping Zhang, Jen-tse Huang, Wenxuan Wang, Yichen Li, Weibin Wu, Xiaosen Wang, Yuxin Su, Michael R. Lyu
PDF
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics Jialu Li, Mohit Bansal
PDF
Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations Ziyan Yang, Kushal Kafle, Franck Dernoncourt, Vicente Ordonez
PDF
Improving Visual Representation Learning Through Perceptual Understanding Samyakh Tukra, Frederick Hoffman, Ken Chatfield
PDF
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels Jingqiu Zhou, Linjiang Huang, Liang Wang, Si Liu, Hongsheng Li
PDF
Improving Zero-Shot Generalization and Robustness of Multi-Modal Models Yunhao Ge, Jie Ren, Andrew Gallagher, Yuxiao Wang, Ming-Hsuan Yang, Hartwig Adam, Laurent Itti, Balaji Lakshminarayanan, Jiaping Zhao
PDF
In-Hand 3D Object Scanning from an RGB Sequence Shreyas Hampali, Tomas Hodan, Luan Tran, Lingni Ma, Cem Keskin, Vincent Lepetit
PDF
Incremental 3D Semantic Scene Graph Prediction from RGB Sequences Shun-Cheng Wu, Keisuke Tateno, Nassir Navab, Federico Tombari
PDF
Incrementer: Transformer for Class-Incremental Semantic Segmentation with Knowledge Distillation Focusing on Old Class Chao Shang, Hongliang Li, Fanman Meng, Qingbo Wu, Heqian Qiu, Lanxiao Wang
PDF
Independent Component Alignment for Multi-Task Learning Dmitry Senushkin, Nikolay Patakin, Arseny Kuznetsov, Anton Konushin
PDF
Indescribable Multi-Modal Spatial Evaluator Lingke Kong, X. Sharon Qi, Qijin Shen, Jiacheng Wang, Jingyi Zhang, Yanle Hu, Qichao Zhou
PDF
Indiscernible Object Counting in Underwater Scenes Guolei Sun, Zhaochong An, Yun Liu, Ce Liu, Christos Sakaridis, Deng-Ping Fan, Luc Van Gool
PDF
Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis Yuxiang Wei, Zhilong Ji, Xiaohe Wu, Jinfeng Bai, Lei Zhang, Wangmeng Zuo
PDF
Infinite Photorealistic Worlds Using Procedural Generation Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, Alejandro Newell, Hei Law, Ankit Goyal, Kaiyu Yang, Jia Deng
PDF
Ingredient-Oriented Multi-Degradation Learning for Image Restoration Jinghao Zhang, Jie Huang, Mingde Yao, Zizheng Yang, Hu Yu, Man Zhou, Feng Zhao
PDF
Initialization Noise in Image Gradients and Saliency Maps Ann-Christin Woerl, Jan Disselhoff, Michael Wand
PDF
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection Vibashan Vs, Poojan Oza, Vishal M. Patel
PDF
Instance-Aware Domain Generalization for Face Anti-Spoofing Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Ran Yi, Shouhong Ding, Lizhuang Ma
PDF
Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation Zhen Zhao, Sifan Long, Jimin Pi, Jingdong Wang, Luping Zhou
PDF
Instant Domain Augmentation for LiDAR Semantic Segmentation Kwonyoung Ryu, Soonmin Hwang, Jaesik Park
PDF
Instant Multi-View Head Capture Through Learnable Registration Timo Bolkart, Tianye Li, Michael J. Black
PDF
Instant Volumetric Head Avatars Wojciech Zielonka, Timo Bolkart, Justus Thies
PDF
Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions from Monocular RGBD Stream Yuheng Jiang, Kaixin Yao, Zhuo Su, Zhehao Shen, Haimin Luo, Lan Xu
PDF
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds Tianjian Jiang, Xu Chen, Jie Song, Otmar Hilliges
PDF
InstMove: Instance Motion for Object-Centric Video Segmentation Qihao Liu, Junfeng Wu, Yi Jiang, Xiang Bai, Alan L. Yuille, Song Bai
PDF
InstructPix2Pix: Learning to Follow Image Editing Instructions Tim Brooks, Aleksander Holynski, Alexei A. Efros
PDF
Integral Neural Networks Kirill Solodskikh, Azim Kurbanov, Ruslan Aydarkhanov, Irina Zhelavskaya, Yury Parfenov, Dehua Song, Stamatios Lefkimmiatis
PDF
Integrally Pre-Trained Transformer Pyramid Networks Yunjie Tian, Lingxi Xie, Zhaozhi Wang, Longhui Wei, Xiaopeng Zhang, Jianbin Jiao, Yaowei Wang, Qi Tian, Qixiang Ye
PDF
Interactive and Explainable Region-Guided Radiology Report Generation Tim Tanida, Philip Müller, Georgios Kaissis, Daniel Rueckert
PDF
Interactive Cartoonization with Controllable Perceptual Factors Namhyuk Ahn, Patrick Kwon, Jihye Back, Kibeom Hong, Seungkwon Kim
PDF
Interactive Segmentation as Gaussion Process Classification Minghao Zhou, Hong Wang, Qian Zhao, Yuexiang Li, Yawen Huang, Deyu Meng, Yefeng Zheng
PDF
Interactive Segmentation of Radiance Fields Rahul Goel, Dhawal Sirikonda, Saurabh Saini, P. J. Narayanan
PDF
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao
PDF
Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images Tiancheng Lin, Zhimiao Yu, Hongyu Hu, Yi Xu, Chang-Wen Chen
PDF
Intrinsic Physical Concepts Discovery with Object-Centric Predictive Models Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang
PDF
Introducing Competition to Boost the Transferability of Targeted Adversarial Examples Through Clean Feature Mixup Junyoung Byun, Myung-Joon Kwon, Seungju Cho, Yoonji Kim, Changick Kim
PDF
Inverse Rendering of Translucent Objects Using Physical and Neural Renderers Chenhao Li, Trung Thanh Ngo, Hajime Nagahara
PDF
Inversion-Based Style Transfer with Diffusion Models Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu
PDF
Invertible Neural Skinning Yash Kant, Aliaksandr Siarohin, Riza Alp Guler, Menglei Chai, Jian Ren, Sergey Tulyakov, Igor Gilitschenski
PDF
Inverting the Imaging Process by Learning an Implicit Camera Model Xin Huang, Qi Zhang, Ying Feng, Hongdong Li, Qing Wang
PDF
IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction Dekai Zhu, Guangyao Zhai, Yan Di, Fabian Manhardt, Hendrik Berkemeyer, Tuan Tran, Nassir Navab, Federico Tombari, Benjamin Busam
PDF
iQuery: Instruments as Queries for Audio-Visual Sound Separation Jiaben Chen, Renrui Zhang, Dongze Lian, Jiaqi Yang, Ziyao Zeng, Jianbo Shi
PDF
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding Morris Alper, Michael Fiman, Hadar Averbuch-Elor
PDF
IS-GGT: Iterative Scene Graph Generation with Generative Transformers Sanjoy Kundu, Sathyanarayanan N. Aakur
PDF
ISBNet: A 3D Point Cloud Instance Segmentation Network with Instance-Aware Sampling and Box-Aware Dynamic Convolution Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen
PDF
Iterative Geometry Encoding Volume for Stereo Matching Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang
PDF
Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections Alexander Gillert, Giulia Resente, Alba Anadon-Rosell, Martin Wilmking, Uwe Freiherr von Lukas
PDF
Iterative Proposal Refinement for Weakly-Supervised Video Grounding Meng Cao, Fangyun Wei, Can Xu, Xiubo Geng, Long Chen, Can Zhang, Yuexian Zou, Tao Shen, Daxin Jiang
PDF
Iterative Vision-and-Language Navigation Jacob Krantz, Shurjo Banerjee, Wang Zhu, Jason Corso, Peter Anderson, Stefan Lee, Jesse Thomason
PDF
IterativePFN: True Iterative Point Cloud Filtering Dasith de Silva Edirimuni, Xuequan Lu, Zhiwen Shao, Gang Li, Antonio Robles-Kelly, Ying He
PDF
itKD: Interchange Transfer-Based Knowledge Distillation for 3D Object Detection Hyeon Cho, Junyong Choi, Geonwoo Baek, Wonjun Hwang
PDF
JacobiNeRF: NeRF Shaping with Mutual Information Gradients Xiaomeng Xu, Yanchao Yang, Kaichun Mo, Boxiao Pan, Li Yi, Leonidas Guibas
PDF
JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields Xi Wang, Robin Courant, Jinglei Shi, Eric Marchand, Marc Christie
PDF
Jedi: Entropy-Based Localization and Removal of Adversarial Patches Bilel Tarchoun, Anouar Ben Khalifa, Mohamed Ali Mahjoub, Nael Abu-Ghazaleh, Ihsen Alouani
PDF
Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction Bin Fan, Yuxin Mao, Yuchao Dai, Zhexiong Wan, Qi Liu
PDF
Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset Shuaizheng Liu, Xindong Zhang, Lingchen Sun, Zhetong Liang, Hui Zeng, Lei Zhang
PDF
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers Siyuan Wei, Tianzhu Ye, Shen Zhang, Yao Tang, Jiajun Liang
PDF
Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time Wei Shang, Dongwei Ren, Yi Yang, Hongzhi Zhang, Kede Ma, Wangmeng Zuo
PDF
Joint Visual Grounding and Tracking with Natural Language Specification Li Zhou, Zikun Zhou, Kaige Mao, Zhenyu He
PDF
JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking Edward Vendrow, Duy Tho Le, Jianfei Cai, Hamid Rezatofighi
PDF
K-Planes: Explicit Radiance Fields in Space, Time, and Appearance Sara Fridovich-Keil, Giacomo Meanti, Frederik Rahbæk Warburg, Benjamin Recht, Angjoo Kanazawa
PDF
K3DN: Disparity-Aware Kernel Estimation for Dual-Pixel Defocus Deblurring Yan Yang, Liyuan Pan, Liu Liu, Miaomiao Liu
PDF
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation Kaiwen Cui, Yingchen Yu, Fangneng Zhan, Shengcai Liao, Shijian Lu, Eric P. Xing
PDF
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation Xiangyang Li, Zihan Wang, Jiahao Yang, Yaowei Wang, Shuqiang Jiang
PDF
Kernel Aware Resampler Michael Bernasconi, Abdelaziz Djelouah, Farnood Salehi, Markus Gross, Christopher Schroers
PDF
KiUT: Knowledge-Injected U-Transformer for Radiology Report Generation Zhongzhen Huang, Xiaofan Zhang, Shaoting Zhang
PDF
Knowledge Combination to Learn Rotated Detection Without Rotated Annotation Tianyu Zhu, Bryce Ferenczi, Pulak Purkait, Tom Drummond, Hamid Rezatofighi, Anton van den Hengel
PDF
Knowledge Distillation for 6d Pose Estimation by Aligning Distributions of Local Predictions Shuxuan Guo, Yinlin Hu, Jose M. Alvarez, Mathieu Salzmann
PDF
L-CoIns: Language-Based Colorization with Instance Awareness Zheng Chang, Shuchen Weng, Peixuan Zhang, Yu Li, Si Li, Boxin Shi
PDF
Label Information Bottleneck for Label Enhancement Qinghai Zheng, Jihua Zhu, Haoyu Tang
PDF
Label-Free Liver Tumor Segmentation Qixin Hu, Yixiong Chen, Junfei Xiao, Shuwen Sun, Jieneng Chen, Alan L. Yuille, Zongwei Zhou
PDF
LANA: A Language-Capable Navigator for Instruction Following and Generation Xiaohan Wang, Wenguan Wang, Jiayi Shao, Yi Yang
PDF
Language Adaptive Weight Generation for Multi-Task Visual Grounding Wei Su, Peihan Miao, Huanzhang Dou, Gaoang Wang, Liang Qiao, Zheyang Li, Xi Li
PDF
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification Yue Yang, Artemis Panagopoulou, Shenghao Zhou, Daniel Jin, Chris Callison-Burch, Mark Yatskar
PDF
Language-Guided Audio-Visual Source Separation via Trimodal Consistency Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko
PDF
Language-Guided Music Recommendation for Video via Prompt Analogies Daniel McKee, Justin Salamon, Josef Sivic, Bryan Russell
PDF
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data Jihye Park, Sunwoo Kim, Soohyun Kim, Seokju Cho, Jaejun Yoo, Youngjung Uh, Seungryong Kim
PDF
Large-Capacity and Flexible Video Steganography via Invertible Neural Network Chong Mou, Youmin Xu, Jiechong Song, Chen Zhao, Bernard Ghanem, Jian Zhang
PDF
Large-Scale Training Data Search for Object Re-Identification Yue Yao, Tom Gedeon, Liang Zheng
PDF
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia
PDF
LaserMix for Semi-Supervised LiDAR Semantic Segmentation Lingdong Kong, Jiawei Ren, Liang Pan, Ziwei Liu
PDF
LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models Adrian Bulat, Georgios Tzimiropoulos
PDF
Latency Matters: Real-Time Action Forecasting Transformer Harshayu Girase, Nakul Agarwal, Chiho Choi, Karttikeya Mangalam
PDF
Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures Gal Metzer, Elad Richardson, Or Patashnik, Raja Giryes, Daniel Cohen-Or
PDF
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang
PDF
Layout-Based Causal Inference for Object Navigation Sixian Zhang, Xinhang Song, Weijie Li, Yubing Bai, Xinyao Yu, Shuqiang Jiang
PDF
LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation Guangcong Zheng, Xianpan Zhou, Xuewei Li, Zhongang Qi, Ying Shan, Xi Li
PDF
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
PDF
LayoutDM: Transformer-Based Diffusion Model for Layout Generation Shang Chai, Liansheng Zhuang, Fengying Yan
PDF
LayoutFormer++: Conditional Graphic Layout Generation via Constraint Serialization and Decoding Space Restriction Zhaoyun Jiang, Jiaqi Guo, Shizhao Sun, Huayu Deng, Zhongkai Wu, Vuksan Mijovic, Zijiang James Yang, Jian-Guang Lou, Dongmei Zhang
PDF
Leapfrog Diffusion Model for Stochastic Trajectory Prediction Weibo Mao, Chenxin Xu, Qi Zhu, Siheng Chen, Yanfeng Wang
PDF
Learnable Skeleton-Aware 3D Point Cloud Sampling Cheng Wen, Baosheng Yu, Dacheng Tao
PDF
Learned Image Compression with Mixed Transformer-CNN Architectures Jinming Liu, Heming Sun, Jiro Katto
PDF
Learned Two-Plane Perspective Prior Based Image Resampling for Efficient Object Detection Anurag Ghosh, N. Dinesh Reddy, Christoph Mertz, Srinivasa G. Narasimhan
PDF
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li
PDF
Learning 3D Scene Priors with 2D Supervision Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner
PDF
Learning 3D-Aware Image Synthesis with Unknown Pose Distribution Zifan Shi, Yujun Shen, Yinghao Xu, Sida Peng, Yiyi Liao, Sheng Guo, Qifeng Chen, Dit-Yan Yeung
PDF
Learning a 3D Morphable Face Reflectance Model from Low-Cost Data Yuxuan Han, Zhibo Wang, Feng Xu
PDF
Learning a Deep Color Difference Metric for Photographic Images Haoyu Chen, Zhihua Wang, Yang Yang, Qilin Sun, Kede Ma
PDF
Learning a Depth Covariance Function Eric Dexheimer, Andrew J. Davison
PDF
Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models Cheng Guo, Leidong Fan, Ziyu Xue, Xiuhua Jiang
PDF
Learning a Simple Low-Light Image Enhancer from Paired Low-Light Instances Zhenqi Fu, Yan Yang, Xiaotong Tu, Yue Huang, Xinghao Ding, Kai-Kuang Ma
PDF
Learning a Sparse Transformer Network for Effective Image Deraining Xiang Chen, Hao Li, Mingqiang Li, Jinshan Pan
PDF
Learning Accurate 3D Shape Based on Stereo Polarimetric Imaging Tianyu Huang, Haoang Li, Kejing He, Congying Sui, Bin Li, Yun-Hui Liu
PDF
Learning Action Changes by Measuring Verb-Adverb Textual Relationships Davide Moltisanti, Frank Keller, Hakan Bilen, Laura Sevilla-Lara
PDF
Learning Adaptive Dense Event Stereo from the Image Domain Hoonhee Cho, Jegyeong Cho, Kuk-Jin Yoon
PDF
Learning Analytical Posterior Probability for Human Mesh Recovery Qi Fang, Kang Chen, Yinghui Fan, Qing Shuai, Jiefeng Li, Weidong Zhang
PDF
Learning Anchor Transformations for 3D Garment Animation Fang Zhao, Zekun Li, Shaoli Huang, Junwu Weng, Tianfei Zhou, Guo-Sen Xie, Jue Wang, Ying Shan
PDF
Learning and Aggregating Lane Graphs for Urban Automated Driving Martin Büchner, Jannik Zürn, Ion-George Todoran, Abhinav Valada, Wolfram Burgard
PDF
Learning Articulated Shape with Keypoint Pseudo-Labels from Web Images Anastasis Stathopoulos, Georgios Pavlakos, Ligong Han, Dimitris N. Metaxas
PDF
Learning Attention as Disentangler for Compositional Zero-Shot Learning Shaozhe Hao, Kai Han, Kwan-Yee K. Wong
PDF
Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis Yang Jiao, Yan Gao, Jingjing Meng, Jin Shang, Yi Sun
PDF
Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, Nick Barnes
PDF
Learning Bottleneck Concepts in Image Classification Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara
PDF
Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems Yangyang Shu, Anton van den Hengel, Lingqiao Liu
PDF
Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong, Wei-Chiu Ma, Jingkang Wang, Raquel Urtasun
PDF
Learning Conditional Attributes for Compositional Zero-Shot Learning Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen
PDF
Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares Dominik Muhle, Lukas Koestler, Krishna Murthy Jatavallabhula, Daniel Cremers
PDF
Learning Customized Visual Models with Retrieval-Augmented Knowledge Haotian Liu, Kilho Son, Jianwei Yang, Ce Liu, Jianfeng Gao, Yong Jae Lee, Chunyuan Li
PDF
Learning Debiased Representations via Conditional Attribute Interpolation Yi-Kai Zhang, Qi-Wei Wang, De-Chuan Zhan, Han-Jia Ye
PDF
Learning Decorrelated Representations Efficiently Using Fast Fourier Transform Yutaro Shigeto, Masashi Shimbo, Yuya Yoshikawa, Akikazu Takeuchi
PDF
Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image Yu Deng, Baoyuan Wang, Heung-Yeung Shum
PDF
Learning Discriminative Representations for Skeleton Based Action Recognition Huanyu Zhou, Qingjie Liu, Yunhong Wang
PDF
Learning Distortion Invariant Representation for Image Restoration from a Causality Perspective Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen
PDF
Learning Dynamic Style Kernels for Artistic Style Transfer Wenju Xu, Chengjiang Long, Yongwei Nie
PDF
Learning Emotion Representations from Verbal and Nonverbal Communication Sitao Zhang, Yimu Pan, James Z. Wang
PDF
Learning Event Guided High Dynamic Range Video Reconstruction Yixin Yang, Jin Han, Jinxiu Liang, Imari Sato, Boxin Shi
PDF
Learning Expressive Prompting with Residuals for Vision Transformers Rajshekhar Das, Yonatan Dukler, Avinash Ravichandran, Ashwin Swaminathan
PDF
Learning Federated Visual Prompt in Null Space for MRI Reconstruction Chun-Mei Feng, Bangjun Li, Xinxing Xu, Yong Liu, Huazhu Fu, Wangmeng Zuo
PDF
Learning from Noisy Labels with Decoupled Meta Label Purifier Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Yabiao Wang, Chengjie Wang, Cai Rong Zhao
PDF
Learning from Unique Perspectives: User-Aware Saliency Modeling Shi Chen, Nachiappan Valliappan, Shaolei Shen, Xinyu Ye, Kai Kohlhoff, Junfeng He
PDF
Learning Generative Structure Prior for Blind Text Image Super-Resolution Xiaoming Li, Wangmeng Zuo, Chen Change Loy
PDF
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs Pattaramanee Arsomngern, Sarana Nutanong, Supasorn Suwajanakorn
PDF
Learning Geometry-Aware Representations by Sketching Hyundo Lee, Inwoo Hwang, Hyunsung Go, Won-Seok Choi, Kibeom Kim, Byoung-Tak Zhang
PDF
Learning Human Mesh Recovery in 3D Scenes Zehong Shen, Zhi Cen, Sida Peng, Qing Shuai, Hujun Bao, Xiaowei Zhou
PDF
Learning Human-to-Robot Handovers from Point Clouds Sammy Christen, Wei Yang, Claudia Pérez-D’Arpino, Otmar Hilliges, Dieter Fox, Yu-Wei Chao
PDF
Learning Imbalanced Data with Vision Transformers Zhengzhuo Xu, Ruikang Liu, Shuo Yang, Zenghao Chai, Chun Yuan
PDF
Learning Instance-Level Representation for Large-Scale Multi-Modal Pretraining in E-Commerce Yang Jin, Yongzhi Li, Zehuan Yuan, Yadong Mu
PDF
Learning Joint Latent Space EBM Prior Model for Multi-Layer Generator Jiali Cui, Ying Nian Wu, Tian Han
PDF
Learning Locally Editable Virtual Humans Hsuan-I Ho, Lixin Xue, Jie Song, Otmar Hilliges
PDF
Learning Multi-Modal Class-Specific Tokens for Weakly Supervised Dense Object Localization Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu
PDF
Learning Neural Duplex Radiance Fields for Real-Time View Synthesis Ziyu Wan, Christian Richardt, Aljaž Božič, Chao Li, Vijay Rengarajan, Seonghyeon Nam, Xiaoyu Xiang, Tuotuo Li, Bo Zhu, Rakesh Ranjan, Jing Liao
PDF
Learning Neural Parametric Head Models Simon Giebenhain, Tobias Kirschstein, Markos Georgopoulos, Martin Rünz, Lourdes Agapito, Matthias Nießner
PDF
Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the Wild Zhenyu Zhang, Renwang Chen, Weijian Cao, Ying Tai, Chengjie Wang
PDF
Learning Neural Volumetric Representations of Dynamic Humans in Minutes Chen Geng, Sida Peng, Zhen Xu, Hujun Bao, Xiaowei Zhou
PDF
Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection Chuangchuang Tan, Yao Zhao, Shikui Wei, Guanghua Gu, Yunchao Wei
PDF
Learning Open-Vocabulary Semantic Segmentation Models from Natural Language Supervision Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie
PDF
Learning Optical Expansion from Scale Matching Han Ling, Yinghui Sun, Quansen Sun, Zhenwen Ren
PDF
Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation Sun-Ao Liu, Yiheng Zhang, Zhaofan Qiu, Hongtao Xie, Yongdong Zhang, Ting Yao
PDF
Learning Partial Correlation Based Deep Visual Representation for Image Classification Saimunur Rahman, Piotr Koniusz, Lei Wang, Luping Zhou, Peyman Moghadam, Changming Sun
PDF
Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, Ping Tan, Thabo Beeler, Sean Fanello, Yinda Zhang
PDF
Learning Procedure-Aware Video Representation from Instructional Videos and Their Narrations Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li
PDF
Learning Rotation-Equivariant Features for Visual Correspondence Jongmin Lee, Byungjin Kim, Seungwook Kim, Minsu Cho
PDF
Learning Sample Relationship for Exposure Correction Jie Huang, Feng Zhao, Man Zhou, Jie Xiao, Naishan Zheng, Kaiwen Zheng, Zhiwei Xiong
PDF
Learning Semantic Relationship Among Instances for Image-Text Matching Zheren Fu, Zhendong Mao, Yan Song, Yongdong Zhang
PDF
Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing Xiaokun Sun, Qiao Feng, Xiongzheng Li, Jinsong Zhang, Yu-Kun Lai, Jingyu Yang, Kun Li
PDF
Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li, Heng Tao Shen
PDF
Learning Situation Hyper-Graphs for Video Question Answering Aisha Urooj, Hilde Kuehne, Bo Wu, Kim Chheu, Walid Bousselham, Chuang Gan, Niels Lobo, Mubarak Shah
PDF
Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution Yunfan Lu, Zipeng Wang, Minjie Liu, Hongjian Wang, Lin Wang
PDF
Learning Steerable Function for Efficient Image Resampling Jiacheng Li, Chang Chen, Wei Huang, Zhiqiang Lang, Fenglong Song, Youliang Yan, Zhiwei Xiong
PDF
Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation Liyan Chen, Weihan Wang, Philippos Mordohai
PDF
Learning to Detect and Segment for Open Vocabulary Object Detection Tao Wang
PDF
Learning to Detect Mirrors from Videos via Dual Correspondences Jiaying Lin, Xin Tan, Rynson W.H. Lau
PDF
Learning to Dub Movies via Hierarchical Prosody Models Gaoxiang Cong, Liang Li, Yuankai Qi, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang
PDF
Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Maximilian Ilse, Daniel C. Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anja Thieme, Anton Schwaighofer, Maria Wetscherek, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay
PDF
Learning to Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization Haina Qin, Longfei Han, Weihua Xiong, Juan Wang, Wentao Ma, Bing Li, Weiming Hu
PDF
Learning to Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes Rui Li, Dong Gong, Wei Yin, Hao Chen, Yu Zhu, Kaixuan Wang, Xiaozhi Chen, Jinqiu Sun, Yanning Zhang
PDF
Learning to Generate Image Embeddings with User-Level Differential Privacy Zheng Xu, Maxwell Collins, Yuxiao Wang, Liviu Panait, Sewoong Oh, Sean Augenstein, Ting Liu, Florian Schroff, H. Brendan McMahan
PDF
Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic Space Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
PDF
Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs Junbum Cha, Jonghwan Mun, Byungseok Roh
PDF
Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space Tianxin Huang, Zhonggan Ding, Jiangning Zhang, Ying Tai, Zhenyu Zhang, Mingang Chen, Chengjie Wang, Yong Liu
PDF
Learning to Name Classes for Vision and Language Models Sarah Parisot, Yongxin Yang, Steven McDonagh
PDF
Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data Nilesh Kulkarni, Linyi Jin, Justin Johnson, David F. Fouhey
PDF
Learning to Render Novel Views from Wide-Baseline Stereo Pairs Yilun Du, Cameron Smith, Ayush Tewari, Vincent Sitzmann
PDF
Learning to Retain While Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation Gaurav Patel, Konda Reddy Mopuri, Qiang Qiu
PDF
Learning to Segment Every Referring Object Point by Point Mengxue Qu, Yu Wu, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao
PDF
Learning to Zoom and Unzoom Chittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan
PDF
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge
PDF
Learning Transformation-Predictive Representations for Detection and Description of Local Features Zihao Wang, Chunxu Wu, Yifei Yang, Zhen Li
PDF
Learning Transformations to Reduce the Geometric Shift in Object Detection Vidit Vidit, Martin Engilberge, Mathieu Salzmann
PDF
Learning Video Representations from Large Language Models Yue Zhao, Ishan Misra, Philipp Krähenbühl, Rohit Girdhar
PDF
Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting Ruichen Zheng, Peng Li, Haoqian Wang, Tao Yu
PDF
Learning Visual Representations via Language-Guided Sampling Mohamed El Banani, Karan Desai, Justin Johnson
PDF
Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions Yurui Zhu, Tianyu Wang, Xueyang Fu, Xuanyu Yang, Xin Guo, Jifeng Dai, Yu Qiao, Xiaowei Hu
PDF
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning Zeyin Song, Yifan Zhao, Yujun Shi, Peixi Peng, Li Yuan, Yonghong Tian
PDF
Learning with Noisy Labels via Self-Supervised Adversarial Noisy Masking Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhao
PDF
LEGO-Net: Learning Regular Rearrangements of Objects in Rooms Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas Guibas
PDF
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid
PDF
Less Is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation Li Li, Hubert P. H. Shum, Toby P. Breckon
PDF
Level-S$^2$fM: Structure from Motion on Neural Level Set of Implicit Surfaces Yuxi Xiao, Nan Xue, Tianfu Wu, Gui-Song Xia
PDF
Leverage Interactive Affinity for Affordance Learning Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao
PDF
Leveraging Hidden Positives for Unsupervised Semantic Segmentation Hyun Seok Seong, WonJun Moon, SuBeen Lee, Jae-Pil Heo
PDF
Leveraging Inter-Rater Agreement for Classification in the Presence of Noisy Labels Maria Sofia Bucarelli, Lucas Cassano, Federico Siciliano, Amin Mantrach, Fabrizio Silvestri
PDF
Leveraging per Image-Token Consistency for Vision-Language Pre-Training Yunhao Gou, Tom Ko, Hansi Yang, James Kwok, Yu Zhang, Mingxuan Wang
PDF
Leveraging Temporal Context in Low Representational Power Regimes Camilo L. Fosco, SouYoung Jin, Emilie Josephs, Aude Oliva
PDF
LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising Zichun Wang, Ying Fu, Ji Liu, Yulun Zhang
PDF
LiDAR-in-the-Loop Hyperparameter Optimization Félix Goudreault, Dominik Scheuble, Mario Bijelic, Nicolas Robidoux, Felix Heide
PDF
LiDAR2Map: In Defense of LiDAR-Based Semantic mAP Construction Using Online Camera Distillation Song Wang, Wentong Li, Wenyu Liu, Xiaolu Liu, Jianke Zhu
PDF
LidarGait: Benchmarking 3D Gait Recognition with Point Clouds Chuanfu Shen, Chao Fan, Wei Wu, Rui Wang, George Q. Huang, Shiqi Yu
PDF
Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field Leheng Li, Qing Lian, Luozhou Wang, Ningning Ma, Ying-Cong Chen
PDF
Light Source Separation and Intrinsic Image Decomposition Under AC Illumination Yusaku Yoshida, Ryo Kawahara, Takahiro Okabe
PDF
LightedDepth: Video Depth Estimation in Light of Limited Inference View Angles Shengjie Zhu, Xiaoming Liu
PDF
LightPainter: Interactive Portrait Relighting with Freehand Scribble Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel
PDF
LINe: Out-of-Distribution Detection by Leveraging Important Neurons Yong Hyun Ahn, Gyeong-Moon Park, Seong Tae Kim
PDF
LinK: Linear Kernel for LiDAR-Based 3D Perception Tao Lu, Xiang Ding, Haisong Liu, Gangshan Wu, Limin Wang
PDF
Linking Garment with Person via Semantically Associated Landmarks for Virtual Try-on Keyu Yan, Tingwei Gao, Hui Zhang, Chengjun Xie
PDF
LipFormer: High-Fidelity and Generalizable Talking Face Generation with a Pre-Learned Facial Codebook Jiayu Wang, Kang Zhao, Shiwei Zhang, Yingya Zhang, Yujun Shen, Deli Zhao, Jingren Zhou
PDF
Listening Human Behavior: 3D Human Pose Estimation with Acoustic Signals Yuto Shibata, Yutaka Kawashima, Mariko Isogawa, Go Irie, Akisato Kimura, Yoshimitsu Aoki
PDF
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR Feng Li, Ailing Zeng, Shilong Liu, Hao Zhang, Hongyang Li, Lei Zhang, Lionel M. Ni
PDF
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation Ning Zhang, Francesco Nex, George Vosselman, Norman Kerle
PDF
Local 3D Editing via 3D Distillation of CLIP Knowledge Junha Hyung, Sungwon Hwang, Daejin Kim, Hyunji Lee, Jaegul Choo
PDF
Local Connectivity-Based Density Estimation for Face Clustering Junho Shin, Hyo-Jun Lee, Hyunseop Kim, Jong-Hyeon Baek, Daehyun Kim, Yeong Jun Koh
PDF
Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution Jie-En Yao, Li-Yuan Tsao, Yi-Chen Lo, Roy Tseng, Chia-Che Chang, Chun-Yi Lee
PDF
Local Implicit Ray Function for Generalizable Radiance Field Representation Xin Huang, Qi Zhang, Ying Feng, Xiaoyu Li, Xuan Wang, Qing Wang
PDF
Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning Hyesong Choi, Hunsang Lee, Wonil Song, Sangryul Jeon, Kwanghoon Sohn, Dongbo Min
PDF
Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields Yue Chen, Xingyu Chen, Xuan Wang, Qi Zhang, Yu Guo, Ying Shan, Fei Wang
PDF
Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving Abdul Hannan Khan, Mohammed Shariq Nawaz, Andreas Dengel
PDF
LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding Gen Li, Varun Jampani, Deqing Sun, Laura Sevilla-Lara
PDF
Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning Haiyu Wu, Grace Bezold, Aman Bhatta, Kevin W. Bowyer
PDF
Logical Implications for Visual Question Answering Consistency Sergio Tascon-Morales, Pablo Márquez-Neila, Raphael Sznitman
PDF
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment Shiyi Zhang, Wenxun Dai, Sujia Wang, Xiangwei Shen, Jiwen Lu, Jie Zhou, Yansong Tang
PDF
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He
PDF
Long Range Pooling for 3D Large-Scale Scene Understanding Xiang-Li Li, Meng-Hao Guo, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu
PDF
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation Yan Jin, Mengke Li, Yang Lu, Yiu-ming Cheung, Hanzi Wang
PDF
Long-Term Visual Localization with Mobile Sensors Shen Yan, Yu Liu, Long Wang, Zehong Shen, Zhen Peng, Haomin Liu, Maojun Zhang, Guofeng Zhang, Xiaowei Zhou
PDF
Look Around for Anomalies: Weakly-Supervised Anomaly Detection via Context-Motion Relational Learning MyeongAh Cho, Minjung Kim, Sangwon Hwang, Chaewon Park, Kyungjae Lee, Sangyoun Lee
PDF
Look Before You Match: Instance Understanding Matters in Video Object Segmentation Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang
PDF
Look, Radiate, and Learn: Self-Supervised Localisation via Radio-Visual Correspondence Mohammed Alloulah, Maximilian Arnold
PDF
Lookahead Diffusion Probabilistic Models for Refining Mean Estimation Guoqiang Zhang, Kenta Niwa, W. Bastiaan Kleijn
PDF
Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections Jiaxiong Qiu, Peng-Tao Jiang, Yifan Zhu, Ze-Xin Yin, Ming-Ming Cheng, Bo Ren
PDF
Low-Light Image Enhancement via Structure Modeling and Guidance Xiaogang Xu, Ruixing Wang, Jiangbo Lu
PDF
LP-DIF: Learning Local Pattern-Specific Deep Implicit Function for 3D Objects and Scenes Meng Wang, Yu-Shen Liu, Yue Gao, Kanle Shi, Yi Fang, Zhizhong Han
PDF
LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection Jinsheng Xiao, Yuanxu Wu, Yunhua Chen, Shurui Wang, Zhongyuan Wang, Jiayi Ma
PDF
LVQAC: Lattice Vector Quantization Coupled with Spatially Adaptive Companding for Efficient Learned Image Compression Xi Zhang, Xiaolin Wu
PDF
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin
PDF
MACARONS: Mapping and Coverage Anticipation with RGB Online Self-Supervision Antoine Guédon, Tom Monnier, Pascal Monasse, Vincent Lepetit
PDF
MAESTER: Masked Autoencoder Guided Segmentation at Pixel Resolution for Accurate, Self-Supervised Subcellular Structure Recognition Ronald Xie, Kuan Pang, Gary D. Bader, Bo Wang
PDF
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis Tianhong Li, Huiwen Chang, Shlok Mishra, Han Zhang, Dina Katabi, Dilip Krishnan
PDF
Magic3D: High-Resolution Text-to-3D Content Creation Chen-Hsuan Lin, Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu, Tsung-Yi Lin
PDF
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery Duowen Chen, Yunhao Bai, Wei Shen, Qingli Li, Lequan Yu, Yan Wang
PDF
MagicPony: Learning Articulated 3D Animals in the Wild Shangzhe Wu, Ruining Li, Tomas Jakab, Christian Rupprecht, Andrea Vedaldi
PDF
MAGVIT: Masked Generative Video Transformer Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang
PDF
MAGVLT: Masked Generative Vision-and-Language Transformer Sungwoong Kim, Daejin Jo, Donghoon Lee, Jongmin Kim
PDF
MAIR: Multi-View Attention Inverse Rendering with 3D Spatially-Varying Lighting Estimation JunYong Choi, SeokYeong Lee, Haesol Park, Seung-Won Jung, Ig-Jae Kim, Junghyun Cho
PDF
Make Landscape Flatter in Differentially Private Federated Learning Yifan Shi, Yingqi Liu, Kang Wei, Li Shen, Xueqian Wang, Dacheng Tao
PDF
Make-a-Story: Visual Memory Conditioned Consistent Story Generation Tanzila Rahman, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Shweta Mahajan, Leonid Sigal
PDF
Making Vision Transformers Efficient from a Token Sparsification View Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou
PDF
MaLP: Manipulation Localization Using a Proactive Scheme Vishal Asnani, Xi Yin, Tal Hassner, Xiaoming Liu
PDF
MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding Jun Chen, Ming Hu, Darren J. Coker, Michael L. Berumen, Blair Costelloe, Sara Beery, Anna Rohrbach, Mohamed Elhoseiny
PDF
Manipulating Transfer Learning for Property Inference Yulong Tian, Fnu Suya, Anshuman Suri, Fengyuan Xu, David Evans
PDF
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-Training Model Yatai Ji, Junjie Wang, Yuan Gong, Lin Zhang, Yanru Zhu, Hongfa Wang, Jiaxing Zhang, Tetsuya Sakai, Yujiu Yang
PDF
MaPLe: Multi-Modal Prompt Learning Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Khan, Fahad Shahbaz Khan
PDF
Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision Xinyi Ying, Li Liu, Yingqian Wang, Ruojing Li, Nuo Chen, Zaiping Lin, Weidong Sheng, Shilin Zhou
PDF
Marching-Primitives: Shape Abstraction from Signed Distance Function Weixiao Liu, Yuwei Wu, Sipu Ruan, Gregory S. Chirikjian
PDF
MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins Tiberiu Sosea, Cornelia Caragea
PDF
Markerless Camera-to-Robot Pose Estimation via Self-Supervised Sim-to-Real Transfer Jingpei Lu, Florian Richter, Michael C. Yip
PDF
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat
PDF
MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds Jiahui Liu, Chirui Chang, Jianhui Liu, Xiaoyang Wu, Lan Ma, Xiaojuan Qi
PDF
Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation Feng Li, Hao Zhang, Huaizhe Xu, Shilong Liu, Lei Zhang, Lionel M. Ni, Heung-Yeung Shum
PDF
Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations Vibashan Vs, Ning Yu, Chen Xing, Can Qin, Mingfei Gao, Juan Carlos Niebles, Vishal M. Patel, Ran Xu
PDF
Mask-Free Video Instance Segmentation Lei Ke, Martin Danelljan, Henghui Ding, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu
PDF
Mask-Guided Matting in the Wild Kwanyong Park, Sanghyun Woo, Seoung Wug Oh, In So Kweon, Joon-Young Lee
PDF
Mask3D: Pre-Training 2D Vision Transformers by Learning Masked 3D Priors Ji Hou, Xiaoliang Dai, Zijian He, Angela Dai, Matthias Nießner
PDF
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining Xiaoyi Dong, Jianmin Bao, Yinglin Zheng, Ting Zhang, Dongdong Chen, Hao Yang, Ming Zeng, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu
PDF
MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset Chen Feng, Ioannis Patras
PDF
Masked and Adaptive Transformer for Exemplar Based Image Translation Chang Jiang, Fei Gao, Biao Ma, Yuhao Lin, Nannan Wang, Gang Xu
PDF
Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang, Xiaoming Wei, Xiaolin Wei
PDF
Masked Autoencoders Enable Efficient Knowledge Distillers Yutong Bai, Zeyu Wang, Junfei Xiao, Chen Wei, Huiyu Wang, Alan L. Yuille, Yuyin Zhou, Cihang Xie
PDF
Masked Autoencoding Does Not Help Natural Language Supervision at Scale Floris Weers, Vaishaal Shankar, Angelos Katharopoulos, Yinfei Yang, Tom Gunter
PDF
Masked Image Modeling with Local Multi-Scale Reconstruction Haoqing Wang, Yehui Tang, Yunhe Wang, Jianyuan Guo, Zhi-Hong Deng, Kai Han
PDF
Masked Image Training for Generalizable Deep Image Denoising Haoyu Chen, Jinjin Gu, Yihao Liu, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu
PDF
Masked Images Are Counterfactual Samples for Robust Fine-Tuning Yao Xiao, Ziyi Tang, Pengxu Wei, Cong Liu, Liang Lin
PDF
Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers Bin Ren, Yahui Liu, Yue Song, Wei Bi, Rita Cucchiara, Nicu Sebe, Wei Wang
PDF
Masked Motion Encoding for Self-Supervised Video Representation Learning Xinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan
PDF
Masked Representation Learning for Domain Generalized Stereo Matching Zhibo Rao, Bangshu Xiong, Mingyi He, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li
PDF
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning Xiaoyang Wu, Xin Wen, Xihui Liu, Hengshuang Zhao
PDF
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang
PDF
Masked Wavelet Representation for Compact Neural Radiance Fields Daniel Rho, Byeonghyeon Lee, Seungtae Nam, Joo Chan Lee, Jong Hwan Ko, Eunbyung Park
PDF
MaskSketch: Unpaired Structure-Guided Masked Image Generation Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa
PDF
Master: Meta Style Transformer for Controllable Zero-Shot and Few-Shot Artistic Style Transfer Hao Tang, Songhua Liu, Tianwei Lin, Shaoli Huang, Fu Li, Dongliang He, Xinchao Wang
PDF
Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation Min Shi, Zihao Huang, Xianzheng Ma, Xiaowei Hu, Zhiguo Cao
PDF
MCF: Mutual Correction Framework for Semi-Supervised Medical Image Segmentation Yongchao Wang, Bin Xiao, Xiuli Bi, Weisheng Li, Xinbo Gao
PDF
MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos Zicheng Zhang, Wei Wu, Wei Sun, Danyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, Guangtao Zhai
PDF
MDL-NAS: A Joint Multi-Domain Learning Framework for Vision Transformer Shiguang Wang, Tao Xie, Jian Cheng, Xingcheng Zhang, Haijun Liu
PDF
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos Minghan Li, Shuai Li, Wangmeng Xiang, Lei Zhang
PDF
MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam
PDF
MEDIC: Remove Model Backdoors via Importance Driven Cloning Qiuling Xu, Guanhong Tao, Jean Honorio, Yingqi Liu, Shengwei An, Guangyu Shen, Siyuan Cheng, Xiangyu Zhang
PDF
Megahertz Light Steering Without Moving Parts Adithya Pediredla, Srinivasa G. Narasimhan, Maysamreza Chamanzar, Ioannis Gkioulekas
PDF
MEGANE: Morphable Eyeglass and Avatar Network Junxuan Li, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Hongdong Li, Jason Saragih
PDF
MELTR: Meta Loss Transformer for Learning to Fine-Tune Video Foundation Models Dohwan Ko, Joonmyung Choi, Hyeong Kyu Choi, Kyoung-Woon On, Byungseok Roh, Hyunwoo J. Kim
PDF
MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand Reconstruction Congyi Wang, Feida Zhu, Shilei Wen
PDF
Memory-Friendly Scalable Super-Resolution via Rewinding Lottery Ticket Hypothesis Jin Lin, Xiaotong Luo, Ming Hong, Yanyun Qu, Yuan Xie, Zongze Wu
PDF
Meta Architecture for Point Cloud Analysis Haojia Lin, Xiawu Zheng, Lijiang Li, Fei Chao, Shanshan Wang, Yan Wang, Yonghong Tian, Rongrong Ji
PDF
Meta Compositional Referring Expression Segmentation Li Xu, Mark He Huang, Xindi Shang, Zehuan Yuan, Ying Sun, Jun Liu
PDF
Meta Omnium: A Benchmark for General-Purpose Learning-to-Learn Ondrej Bohdal, Yinbing Tian, Yongshuo Zong, Ruchika Chavhan, Da Li, Henry Gouk, Li Guo, Timothy Hospedales
PDF
Meta-Causal Learning for Single Domain Generalization Jin Chen, Zhi Gao, Xinxiao Wu, Jiebo Luo
PDF
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding Minyoung Hwang, Jaeyeon Jeong, Minsoo Kim, Yoonseon Oh, Songhwai Oh
PDF
Meta-Learning with a Geometry-Adaptive Preconditioner Suhyun Kang, Duhun Hwang, Moonjung Eo, Taesup Kim, Wonjong Rhee
PDF
Meta-Personalizing Vision-Language Models to Find Named Instances in Video Chun-Hsiao Yeh, Bryan Russell, Josef Sivic, Fabian Caba Heilbron, Simon Jenni
PDF
Meta-Tuning Loss Functions and Data Augmentation for Few-Shot Object Detection Berkan Demirel, Orhun Buğra Baran, Ramazan Gokberk Cinbis
PDF
MetaCLUE: Towards Comprehensive Visual Metaphors Research Arjun R. Akula, Brendan Driscoll, Pradyumna Narayana, Soravit Changpinyo, Zhiwei Jia, Suyash Damle, Garima Pruthi, Sugato Basu, Leonidas Guibas, William T. Freeman, Yuanzhen Li, Varun Jampani
PDF
Metadata-Based RAW Reconstruction via Implicit Neural Functions Leyi Li, Huijie Qiao, Qi Ye, Qinmin Yang
PDF
MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding from Object Detection Wenda Zhao, Shigeng Xie, Fan Zhao, You He, Huchuan Lu
PDF
MetaMix: Towards Corruption-Robust Continual Learning with Temporally Self-Adaptive Data Transformation Zhenyi Wang, Li Shen, Donglin Zhan, Qiuling Suo, Yanjun Zhu, Tiehang Duan, Mingchen Gao
PDF
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation Bowen Zhang, Chenyang Qi, Pan Zhang, Bo Zhang, HsiangTao Wu, Dong Chen, Qifeng Chen, Yong Wang, Fang Wen
PDF
MetaViewer: Towards a Unified Multi-View Representation Ren Wang, Haoliang Sun, Yuling Ma, Xiaoming Xi, Yilong Yin
PDF
MethaneMapper: Spectral Absorption Aware Hyperspectral Transformer for Methane Detection Satish Kumar, Ivan Arevalo, Asm Iftekhar, B S Manjunath
PDF
METransformer: Radiology Report Generation by Transformer with Multiple Learnable Expert Tokens Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou
PDF
MHPL: Minimum Happy Points Learning for Active Source Free Domain Adaptation Fan Wang, Zhongyi Han, Zhiyan Zhang, Rundong He, Yilong Yin
PDF
MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation Yong Yang, Qiong Chen, Yuan Feng, Tianlin Huang
PDF
MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation Lukas Hoyer, Dengxin Dai, Haoran Wang, Luc Van Gool
PDF
Micron-BERT: BERT-Based Facial Micro-Expression Recognition Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li, Susan Gauch, Han-Seok Seo, Khoa Luu
PDF
MIME: Human-Aware 3D Scene Generation Hongwei Yi, Chun-Hao P. Huang, Shashank Tripathi, Lea Hering, Justus Thies, Michael J. Black
PDF
Mind the Label Shift of Augmentation-Based Graph OOD Generalization Junchi Yu, Jian Liang, Ran He
PDF
Minimizing Maximum Model Discrepancy for Transferable Black-Box Targeted Attacks Anqi Zhao, Tong Chu, Yahao Liu, Wen Li, Jingjing Li, Lixin Duan
PDF
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li
PDF
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence Yixuan Sun, Yiwen Huang, Haijing Guo, Yuzhou Zhao, Runmin Wu, Yizhou Yu, Weifeng Ge, Wenqiang Zhang
PDF
MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou
PDF
Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives Chuntao Ding, Zhichao Lu, Shangguang Wang, Ran Cheng, Vishnu Naresh Boddeti
PDF
Mixed Autoencoder for Self-Supervised Visual Representation Learning Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung
PDF
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li
PDF
MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs Seunghyeon Seo, Donghoon Han, Yeonjin Chang, Nojun Kwak
PDF
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering Jingjing Jiang, Nanning Zheng
PDF
MixSim: A Hierarchical Framework for Mixed Reality Traffic Simulation Simon Suo, Kelvin Wong, Justin Xu, James Tu, Alexander Cui, Sergio Casas, Raquel Urtasun
PDF
MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection Liang Liu, Boshen Zhang, Jiangning Zhang, Wuhao Zhang, Zhenye Gan, Guanzhong Tian, Wenbing Zhu, Yabiao Wang, Chengjie Wang
PDF
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao
PDF
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation Ludan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo
PDF
MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning Shicai Wei, Chunbo Luo, Yang Luo
PDF
MMG-Ego4D: Multimodal Generalization in Egocentric Action Recognition Xinyu Gong, Sreyas Mohan, Naina Dhingra, Jean-Charles Bazin, Yilei Li, Zhangyang Wang, Rakesh Ranjan
PDF
MMVC: Learned Multi-Mode Video Compression with Block-Based Prediction Mode Selection and Density-Adaptive Entropy Coding Bowen Liu, Yu Chen, Rakesh Chowdary Machineni, Shiyu Liu, Hun-Seok Kim
PDF
Mobile User Interface Element Detection via Adaptively Prompt Tuning Zhangxuan Gu, Zhuoer Xu, Haoxing Chen, Jun Lan, Changhua Meng, Weiqiang Wang
PDF
MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices Kejie Li, Jia-Wang Bian, Robert Castle, Philip H.S. Torr, Victor Adrian Prisacariu
PDF
MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures Zhiqin Chen, Thomas Funkhouser, Peter Hedman, Andrea Tagliasacchi
PDF
MobileOne: An Improved One Millisecond Mobile Backbone Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu, Oncel Tuzel, Anurag Ranjan
PDF
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation Roy Miles, Mehmet Kerim Yucel, Bruno Manganelli, Albert Saà-Garriga
PDF
Mod-SQuAD: Designing Mixtures of Experts as Modular Multi-Task Learners Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan
PDF
Modality-Agnostic Debiasing for Single Domain Generalization Sanqing Qu, Yingwei Pan, Guang Chen, Ting Yao, Changjun Jiang, Tao Mei
PDF
Modality-Invariant Visual Odometry for Embodied Vision Marius Memmel, Roman Bachmann, Amir Zamir
PDF
MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences Yingwei Li, Charles R. Qi, Yin Zhou, Chenxi Liu, Dragomir Anguelov
PDF
Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection Lianyu Wang, Meng Wang, Daoqiang Zhang, Huazhu Fu
PDF
Model-Agnostic Gender Debiased Image Captioning Yusuke Hirota, Yuta Nakashima, Noa Garcia
PDF
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao
PDF
Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery Wenbin Li, Zhichen Fan, Jing Huo, Yang Gao
PDF
Modeling the Distributional Uncertainty for Salient Object Detection Models Xinyu Tian, Jing Zhang, Mochu Xiang, Yuchao Dai
PDF
Modeling Video as Stochastic Processes for Fine-Grained Video Representation Learning Heng Zhang, Daqing Liu, Qi Zheng, Bing Su
PDF
Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer Agus Gunawan, Soo Ye Kim, Hyeonjun Sim, Jae-Ho Lee, Munchurl Kim
PDF
MoDi: Unconditional Motion Synthesis from Diverse Data Sigal Raab, Inbal Leibovitch, Peizhuo Li, Kfir Aberman, Olga Sorkine-Hornung, Daniel Cohen-Or
PDF
Modular Memorability: Tiered Representations for Video Memorability Prediction Théo Dumont, Juan Segundo Hevia, Camilo L. Fosco
PDF
Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt
PDF
MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition Xiang Wang, Shiwei Zhang, Zhiwu Qing, Changxin Gao, Yingya Zhang, Deli Zhao, Nong Sang
PDF
MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer Yunsong Zhou, Hongzi Zhu, Quan Liu, Shan Chang, Minyi Guo
PDF
MonoHuman: Animatable Human Neural Field from Monocular Video Zhengming Yu, Wei Cheng, Xian Liu, Wayne Wu, Kwan-Yee Lin
PDF
MOSO: Decomposing MOtion, Scene and Object for Video Prediction Mingzhen Sun, Weining Wang, Xinxin Zhu, Jing Liu
PDF
MoStGAN-V: Video Generation with Temporal Motion Styles Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
PDF
MOT: Masked Optimal Transport for Partial Domain Adaptation You-Wei Luo, Chuan-Xian Ren
PDF
Motion Information Propagation for Neural Video Compression Linfeng Qi, Jiahao Li, Bin Li, Houqiang Li, Yan Lu
PDF
MotionDiffuser: Controllable Multi-Agent Motion Prediction Using Diffusion Chiyu “Max” Jiang, Andre Cornman, Cheolho Park, Benjamin Sapp, Yin Zhou, Dragomir Anguelov
PDF
MotionTrack: Learning Robust Short-Term and Long-Term Motions for Multi-Object Tracking Zheng Qin, Sanping Zhou, Le Wang, Jinghai Duan, Gang Hua, Wei Tang
PDF
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors Yuang Zhang, Tiancai Wang, Xiangyu Zhang
PDF
MOVES: Manipulated Objects in Video Enable Segmentation Richard E. L. Higgins, David F. Fouhey
PDF
Movies2Scenes: Using Movie Metadata to Learn Scene Representation Shixing Chen, Chun-Hao Liu, Xiang Hao, Xiaohan Nie, Maxim Arap, Raffay Hamid
PDF
MP-Former: Mask-Piloted Transformer for Image Segmentation Hao Zhang, Feng Li, Huaizhe Xu, Shijia Huang, Shilong Liu, Lionel M. Ni, Lei Zhang
PDF
MSeg3D: Multi-Modal 3D Semantic Segmentation for Autonomous Driving Jiale Li, Hang Dai, Hao Han, Yong Ding
PDF
MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud Sequences Chenhang He, Ruihuang Li, Yabin Zhang, Shuai Li, Lei Zhang
PDF
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao
PDF
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection Yang Jiao, Zequn Jie, Shaoxiang Chen, Jingjing Chen, Lin Ma, Yu-Gang Jiang
PDF
Multi Domain Learning for Motion Magnification Jasdeep Singh, Subrahmanyam Murala, G. Sankara Raju Kosuru
PDF
Multi-Agent Automated Machine Learning Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu
PDF
Multi-Centroid Task Descriptor for Dynamic Class Incremental Inference Tenghao Cai, Zhizhong Zhang, Xin Tan, Yanyun Qu, Guannan Jiang, Chengjie Wang, Yuan Xie
PDF
Multi-Concept Customization of Text-to-Image Diffusion Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu
PDF
Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph Rixin Zhou, Jiafu Wei, Qian Zhang, Ruihua Qi, Xi Yang, Chuntao Li
PDF
Multi-Label Compound Expression Recognition: C-EXPR Database & Network Dimitrios Kollias
PDF
Multi-Level Logit Distillation Ying Jin, Jiaqi Wang, Dahua Lin
PDF
Multi-Modal Gait Recognition via Effective Spatial-Temporal Feature Fusion Yufeng Cui, Yimei Kang
PDF
Multi-Modal Learning with Missing Modality via Shared-Specific Feature Modelling Hu Wang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo Carneiro
PDF
Multi-Modal Representation Learning with Text-Driven Soft Masks Jaeyoo Park, Bohyung Han
PDF
Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning Kaiyou Song, Jin Xie, Shan Zhang, Zimeng Luo
PDF
Multi-Object Manipulation via Object-Centric Neural Scattering Functions Stephen Tian, Yancheng Cai, Hong-Xing Yu, Sergey Zakharov, Katherine Liu, Adrien Gaidon, Yunzhu Li, Jiajun Wu
PDF
Multi-Realism Image Compression with a Conditional Generator Eirikur Agustsson, David Minnen, George Toderici, Fabian Mentzer
PDF
Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction Oleg Voynov, Gleb Bobrovskikh, Pavel Karpyshev, Saveliy Galochkin, Andrei-Timotei Ardelean, Arseniy Bozhenko, Ekaterina Karmanova, Pavel Kopanev, Yaroslav Labutin-Rymsho, Ruslan Rakhimov, Aleksandr Safin, Valerii Serpiva, Alexey Artemov, Evgeny Burnaev, Dzmitry Tsetserukou, Denis Zorin
PDF
Multi-Space Neural Radiance Fields Ze-Xin Yin, Jiaxiong Qiu, Ming-Ming Cheng, Bo Ren
PDF
Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains Mingjun Xu, Lingyun Qin, Weijie Chen, Shiliang Pu, Lei Zhang
PDF
Multi-View Azimuth Stereo via Tangent Space Consistency Xu Cao, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita
PDF
Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes Zhen Li, Lingli Wang, Mofang Cheng, Cihui Pan, Jiaqi Yang
PDF
Multi-View Reconstruction Using Signed Ray Distance Functions (SRDF) Pierre Zins, Yuanlu Xu, Edmond Boyer, Stefanie Wuhrer, Tony Tung
PDF
Multi-View Stereo Representation Revist: Region-Aware MVSNet Yisu Zhang, Jianke Zhu, Lixiang Lin
PDF
Multiclass Confidence and Localization Calibration for Object Detection Bimsara Pathiraja, Malitha Gunawardhana, Muhammad Haris Khan
PDF
Multilateral Semantic Relations Modeling for Image Text Retrieval Zheng Wang, Zhenwei Gao, Kangshuai Guo, Yang Yang, Xiaoming Wang, Heng Tao Shen
PDF
Multimodal Industrial Anomaly Detection via Hybrid Fusion Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Yabiao Wang, Chengjie Wang
PDF
Multimodal Prompting with Missing Modalities for Visual Recognition Yi-Lun Lee, Yi-Hsuan Tsai, Wei-Chen Chiu, Chen-Yu Lee
PDF
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models Zhiqiu Lin, Samuel Yu, Zhiyi Kuang, Deepak Pathak, Deva Ramanan
PDF
Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning Kangning Liu, Weicheng Zhu, Yiqiu Shen, Sheng Liu, Narges Razavian, Krzysztof J. Geras, Carlos Fernandez-Granda
PDF
Multiplicative Fourier Level of Detail Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni
PDF
Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis Kang Han, Wei Xiang
PDF
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline Wei Ji, Jingjing Li, Cheng Bian, Zongwei Zhou, Jiaying Zhao, Alan L. Yuille, Li Cheng
PDF
Multivariate, Multi-Frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation Feiyu Chen, Jie Shao, Shuyuan Zhu, Heng Tao Shen
PDF
Multiview Compressive Coding for 3D Reconstruction Chao-Yuan Wu, Justin Johnson, Jitendra Malik, Christoph Feichtenhofer, Georgia Gkioxari
PDF
Music-Driven Group Choreography Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen
PDF
Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video Runyang Feng, Yixing Gao, Xueqing Ma, Tze Ho Elden Tse, Hyung Jin Chang
PDF
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training Runsen Xu, Tai Wang, Wenwei Zhang, Runjian Chen, Jinkun Cao, Jiangmiao Pang, Dahua Lin
PDF
MVImgNet: A Large-Scale Dataset of Multi-View Images Xianggang Yu, Mutian Xu, Yidan Zhang, Haolin Liu, Chongjie Ye, Yushuang Wu, Zizheng Yan, Chenming Zhu, Zhangyang Xiong, Tianyou Liang, Guanying Chen, Shuguang Cui, Xiaoguang Han
PDF
N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution Haram Choi, Jeongmin Lee, Jihoon Yang
PDF
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory Santhosh Kumar Ramakrishnan, Ziad Al-Halah, Kristen Grauman
PDF
NAR-Former: Neural Architecture Representation Learning Towards Holistic Attributes Prediction Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang
PDF
Natural Language-Assisted Sign Language Recognition Ronglai Zuo, Fangyun Wei, Brian Mak
PDF
NeAT: Learning Neural Implicit Surfaces with Arbitrary Topologies from Multi-View Images Xiaoxu Meng, Weikai Chen, Bo Yang
PDF
NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-View Images Yunfan Ye, Renjiao Yi, Zhirui Gao, Chenyang Zhu, Zhiping Cai, Kai Xu
PDF
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination Haoqian Wu, Zhipeng Hu, Lincheng Li, Yongqiang Zhang, Changjie Fan, Xin Yu
PDF
Neighborhood Attention Transformer Ali Hassani, Steven Walton, Jiachen Li, Shen Li, Humphrey Shi
PDF
NeMo: Learning 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang, Zhenzhen Weng, Maria Xenochristou, João Pedro Araújo, Jeffrey Gu, Karen Liu, Serena Yeung
PDF
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors Congyue Deng, Chiyu “Max” Jiang, Charles R. Qi, Xinchen Yan, Yin Zhou, Leonidas Guibas, Dragomir Anguelov
PDF
NeRF in the PaLM of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis Allan Zhou, Moo Jin Kim, Lirui Wang, Pete Florence, Chelsea Finn
PDF
NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects Zhiwen Yan, Chen Li, Gim Hee Lee
PDF
NeRF-RPN: A General Framework for Object Detection in NeRFs Benran Hu, Junkai Huang, Yichen Liu, Yu-Wing Tai, Chi-Keung Tang
PDF
NeRF-Supervised Deep Stereo Fabio Tosi, Alessio Tonioni, Daniele De Gregorio, Matteo Poggi
PDF
NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-Shot Real Image Animation Yu Yin, Kamran Ghasedi, HsiangTao Wu, Jiaolong Yang, Xin Tong, Yun Fu
PDF
Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision Xiaoshuai Zhang, Abhijit Kundu, Thomas Funkhouser, Leonidas Guibas, Hao Su, Kyle Genova
PDF
NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid Fernando Rivas-Manzaneque, Jorge Sierra-Acosta, Adrian Penate-Sanchez, Francesc Moreno-Noguer, Angela Ribeiro
PDF
NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer Kun Zhou, Wenbo Li, Yi Wang, Tao Hu, Nianjuan Jiang, Xiaoguang Han, Jiangbo Lu
PDF
NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds Chen Yang, Peihao Li, Zanwei Zhou, Shanxin Yuan, Bingbing Liu, Xiaokang Yang, Weichao Qiu, Wei Shen
PDF
NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud Xiangyu Zhu, Dong Du, Weikai Chen, Zhiyou Zhao, Yinyu Nie, Xiaoguang Han
PDF
Network Expansion for Practical Training Acceleration Ning Ding, Yehui Tang, Kai Han, Chao Xu, Yunhe Wang
PDF
Network-Free, Unsupervised Semantic Segmentation with Synthetic Images Qianli Feng, Raghudeep Gadde, Wentong Liao, Eduard Ramon, Aleix Martinez
PDF
NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction Bowen Cai, Jinchi Huang, Rongfei Jia, Chengfei Lv, Huan Fu
PDF
NeUDF: Leaning Neural Unsigned Distance Fields with Volume Rendering Yu-Tao Liu, Li Wang, Jie Yang, Weikai Chen, Xiaoxu Meng, Bo Yang, Lin Gao
PDF
NeuFace: Realistic 3D Neural Face Rendering from Multi-View Images Mingwu Zheng, Haiyu Zhang, Hongyu Yang, Di Huang
PDF
Neumann Network with Recursive Kernels for Single Image Defocus Deblurring Yuhui Quan, Zicong Wu, Hui Ji
PDF
NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization Shitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa
PDF
Neural Congealing: Aligning Images to a Joint Semantic Atlas Dolev Ofri-Amar, Michal Geyer, Yoni Kasten, Tali Dekel
PDF
Neural Dependencies Emerging from Learning Massive Categories Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha
PDF
Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes Zian Wang, Tianchang Shen, Jun Gao, Shengyu Huang, Jacob Munkberg, Jon Hasselgren, Zan Gojcic, Wenzheng Chen, Sanja Fidler
PDF
Neural Fourier Filter Bank Zhijie Wu, Yuhe Jin, Kwang Moo Yi
PDF
Neural Intrinsic Embedding for Non-Rigid Point Cloud Matching Puhua Jiang, Mingze Sun, Ruqi Huang
PDF
Neural Kaleidoscopic Space Sculpting Byeongjoo Ahn, Michael De Zeeuw, Ioannis Gkioulekas, Aswin C. Sankaranarayanan
PDF
Neural Kernel Surface Reconstruction Jiahui Huang, Zan Gojcic, Matan Atzmon, Or Litany, Sanja Fidler, Francis Williams
PDF
Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition Xinghan Wang, Xin Xu, Yadong Mu
PDF
Neural Lens Modeling Wenqi Xian, Aljaž Božič, Noah Snavely, Christoph Lassner
PDF
Neural mAP Prior for Autonomous Driving Xuan Xiong, Yicheng Liu, Tianyuan Yuan, Yue Wang, Yilun Wang, Hang Zhao
PDF
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans Aleksei Bokhovkin, Angela Dai
PDF
Neural Pixel Composition for 3D-4D View Synthesis from Multi-Views Aayush Bansal, Michael Zollhöfer
PDF
Neural Preset for Color Style Transfer Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W.H. Lau
PDF
Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN Models Nilesh Ahuja, Parual Datta, Bhavya Kanzariya, V. Srinivasa Somayazulu, Omesh Tickoo
PDF
Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos Liao Wang, Qiang Hu, Qihan He, Ziyu Wang, Jingyi Yu, Tinne Tuytelaars, Lan Xu, Minye Wu
PDF
Neural Scene Chronology Haotong Lin, Qianqian Wang, Ruojin Cai, Sida Peng, Hadar Averbuch-Elor, Xiaowei Zhou, Noah Snavely
PDF
Neural Texture Synthesis with Guided Correspondence Yang Zhou, Kaijian Chen, Rongjun Xiao, Hui Huang
PDF
Neural Transformation Fields for Arbitrary-Styled Font Generation Bin Fu, Junjun He, Jianjun Wang, Yu Qiao
PDF
Neural Vector Fields: Implicit Representation by Explicit Learning Xianghui Yang, Guosheng Lin, Zhenghao Chen, Luping Zhou
PDF
Neural Video Compression with Diverse Contexts Jiahao Li, Bin Li, Yan Lu
PDF
Neural Volumetric Memory for Visual Locomotion Control Ruihan Yang, Ge Yang, Xiaolong Wang
PDF
Neural Voting Field for Camera-Space 3D Hand Pose Estimation Lin Huang, Chung-Ching Lin, Kevin Lin, Lin Liang, Lijuan Wang, Junsong Yuan, Zicheng Liu
PDF
Neuralangelo: High-Fidelity Neural Surface Reconstruction Zhaoshuo Li, Thomas Müller, Alex Evans, Russell H. Taylor, Mathias Unberath, Ming-Yu Liu, Chen-Hsuan Lin
PDF
NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions Juze Zhang, Haimin Luo, Hongdi Yang, Xinru Xu, Qianyang Wu, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang
PDF
NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang
PDF
NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim, Bradley Brown, Kangxue Yin, Karsten Kreis, Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler
PDF
Neuralizer: General Neuroimage Analysis Without Re-Training Steffen Czolbe, Adrian V. Dalca
PDF
NeuralLift-360: Lifting an In-the-Wild 2D Photo to a 3D Object with 360deg Views Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Yi Wang, Zhangyang Wang
PDF
NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation Zehan Zheng, Danni Wu, Ruisi Lu, Fan Lu, Guang Chen, Changjun Jiang
PDF
NeuralUDF: Learning Unsigned Distance Fields for Multi-View Reconstruction of Surfaces with Arbitrary Topologies Xiaoxiao Long, Cheng Lin, Lingjie Liu, Yuan Liu, Peng Wang, Christian Theobalt, Taku Komura, Wenping Wang
PDF
Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation Yushun Tang, Ce Zhang, Heng Xu, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He
PDF
NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization Zhixiang Min, Bingbing Zhuang, Samuel Schulter, Buyu Liu, Enrique Dunn, Manmohan Chandraker
PDF
Neuron Structure Modeling for Generalizable Remote Physiological Measurement Hao Lu, Zitong Yu, Xuesong Niu, Ying-Cong Chen
PDF
NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation Ziyan Wang, Giljoo Nam, Tuur Stuyck, Stephen Lombardi, Chen Cao, Jason Saragih, Michael Zollhöfer, Jessica Hodgins, Christoph Lassner
PDF
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation Haoqian Wu, Keyu Chen, Haozhe Liu, Mingchen Zhuge, Bing Li, Ruizhi Qiao, Xiujun Shu, Bei Gan, Liangsheng Xu, Bo Ren, Mengmeng Xu, Wentian Zhang, Raghavendra Ramachandra, Chia-Wen Lin, Bernard Ghanem
PDF
Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu
PDF
NICO++: Towards Better Benchmarking for Domain Generalization Xingxuan Zhang, Yue He, Renzhe Xu, Han Yu, Zheyan Shen, Peng Cui
PDF
NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging Karim Guirguis, Johannes Meier, George Eskandar, Matthias Kayser, Bin Yang, Jürgen Beyerer
PDF
Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior Yuekun Dai, Yihang Luo, Shangchen Zhou, Chongyi Li, Chen Change Loy
PDF
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation Jiefeng Li, Siyuan Bian, Qi Liu, Jiasheng Tang, Fan Wang, Cewu Lu
PDF
NIPQ: Noise Proxy-Based Integrated Pseudo-Quantization Juncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park
PDF
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-Wise Modeling Shishira R. Maiya, Sharath Girish, Max Ehrlich, Hanyu Wang, Kwot Sin Lee, Patrick Poirson, Pengxiang Wu, Chen Wang, Abhinav Shrivastava
PDF
NLOST: Non-Line-of-Sight Imaging with Transformer Yue Li, Jiayong Peng, Juntian Ye, Yueyi Zhang, Feihu Xu, Zhiwei Xiong
PDF
No One Left Behind: Improving the Worst Categories in Long-Tailed Learning Yingxiao Du, Jianxin Wu
PDF
Noisy Correspondence Learning with Meta Similarity Correction Haochen Han, Kaiyao Miao, Qinghua Zheng, Minnan Luo
PDF
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers Yijiang Liu, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang
PDF
NoisyTwins: Class-Consistent and Diverse Image Generation Through StyleGANs Harsh Rangwani, Lavish Bansal, Kartik Sharma, Tejan Karmali, Varun Jampani, R. Venkatesh Babu
PDF
Non-Contrastive Learning Meets Language-Image Pre-Training Jinghao Zhou, Li Dong, Zhe Gan, Lijuan Wang, Furu Wei
PDF
Non-Contrastive Unsupervised Learning of Physiological Signals from Video Jeremy Speth, Nathan Vance, Patrick Flynn, Adam Czajka
PDF
Non-Line-of-Sight Imaging with Signal Superresolution Network Jianyu Wang, Xintong Liu, Leping Xiao, Zuoqiang Shi, Lingyun Qiu, Xing Fu
PDF
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior Wenjing Bian, Zirui Wang, Kejie Li, Jia-Wang Bian, Victor Adrian Prisacariu
PDF
Normal-Guided Garment UV Prediction for Human Re-Texturing Yasamin Jafarian, Tuanfeng Y. Wang, Duygu Ceylan, Jimei Yang, Nathan Carr, Yi Zhou, Hyun Soo Park
PDF
Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection Nishant Kumar, Siniša Šegvić, Abouzar Eslami, Stefan Gumhold
PDF
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation Mengqi Huang, Zhendong Mao, Quan Wang, Yongdong Zhang
PDF
Novel Class Discovery for 3D Point Cloud Semantic Segmentation Luigi Riz, Cristiano Saltori, Elisa Ricci, Fabio Poiesi
PDF
Novel-View Acoustic Synthesis Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea Vedaldi
PDF
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations Joy Hsu, Jiayuan Mao, Jiajun Wu
PDF
NULL-Text Inversion for Editing Real Images Using Guided Diffusion Models Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or
PDF
NUWA-LIP: Language-Guided Image Inpainting with Defect-Free VQGAN Minheng Ni, Xiaoming Li, Wangmeng Zuo
PDF
NVTC: Nonlinear Vector Transform Coding Runsen Feng, Zongyu Guo, Weiping Li, Zhibo Chen
PDF
Objaverse: A Universe of Annotated 3D Objects Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel, Eli VanderBilt, Ludwig Schmidt, Kiana Ehsani, Aniruddha Kembhavi, Ali Farhadi
PDF
Object Detection with Self-Supervised Scene Adaptation Zekun Zhang, Minh Hoai
PDF
Object Discovery from Motion-Guided Tokens Zhipeng Bao, Pavel Tokmakov, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
PDF
Object Pop-up: Can We Infer 3D Objects and Their Poses from Human Interactions Alone? Ilya A. Petrov, Riccardo Marin, Julian Chibane, Gerard Pons-Moll
PDF
Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation Heng Yang, Marco Pavone
PDF
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu
PDF
Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States Heming Du, Lincheng Li, Zi Huang, Xin Yu
PDF
ObjectMatch: Robust Registration Using Canonical Object Correspondences Can Gümeli, Angela Dai, Matthias Nießner
PDF
ObjectStitch: Object Compositing with Diffusion Model Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga
PDF
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
PDF
Occlusion-Free Scene Recovery via Neural Radiance Fields Chengxuan Zhu, Renjie Wan, Yunkai Tang, Boxin Shi
PDF
OCELOT: Overlapped Cell on Tissue Dataset for Histopathology Jeongun Ryu, Aaron Valero Puche, JaeWoong Shin, Seonwook Park, Biagio Brattoli, Jinhee Lee, Wonkyung Jung, Soo Ick Cho, Kyunghyun Paeng, Chan-Young Ock, Donggeun Yoo, Sérgio Pereira
PDF
OCTET: Object-Aware Counterfactual Explanations Mehdi Zemni, Mickaël Chen, Éloi Zablocki, Hédi Ben-Younes, Patrick Pérez, Matthieu Cord
PDF
OcTr: Octree-Based Transformer for 3D Object Detection Chao Zhou, Yanan Zhang, Jiaxin Chen, Di Huang
PDF
Octree Guided Unoriented Surface Reconstruction Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Stephen Gould
PDF
Omni Aggregation Networks for Lightweight Image Super-Resolution Hang Wang, Xuanhong Chen, Bingbing Ni, Yutian Liu, Jinfan Liu
PDF
Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild Garrick Brazil, Abhinav Kumar, Julian Straub, Nikhila Ravi, Justin Johnson, Georgia Gkioxari
PDF
OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization Ying Zhao
PDF
OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis Hongyi Xu, Guoxian Song, Zihang Jiang, Jianfeng Zhang, Yichun Shi, Jing Liu, Wanchun Ma, Jiashi Feng, Linjie Luo
PDF
OmniCity: Omnipotent City Understanding with Multi-Level and Multi-View Images Weijia Li, Yawen Lai, Linning Xu, Yuanbo Xiangli, Jinhua Yu, Conghui He, Gui-Song Xia, Dahua Lin
PDF
OmniMAE: Single Model Masked Pretraining on Images and Videos Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra
PDF
Omnimatte3D: Associating Objects and Their Effects in Unconstrained Monocular Video Mohammed Suhail, Erika Lu, Zhengqi Li, Noah Snavely, Leonid Sigal, Forrester Cole
PDF
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu
PDF
OmniVidar: Omnidirectional Depth Estimation from Multi-Fisheye Images Sheng Xie, Daochuan Wang, Yun-Hui Liu
PDF
On Calibrating Semantic Segmentation Models: Analyses and an Algorithm Dongdong Wang, Boqing Gong, Liqiang Wang
PDF
On Data Scaling in Masked Image Modeling Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu
PDF
On Distillation of Guided Diffusion Models Chenlin Meng, Robin Rombach, Ruiqi Gao, Diederik Kingma, Stefano Ermon, Jonathan Ho, Tim Salimans
PDF
On the Benefits of 3D Pose and Tracking for Human Action Recognition Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik
PDF
On the Convergence of IRLS and Its Variants in Outlier-Robust Estimation Liangzu Peng, Christian Kümmerle, René Vidal
PDF
On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer Zhenjie Yu, Shuang Li, Yirui Shen, Chi Harold Liu, Shuigen Wang
PDF
On the Effectiveness of Partial Variance Reduction in Federated Learning with Heterogeneous Data Bo Li, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich
PDF
On the Effects of Self-Supervision and Contrastive Alignment in Deep Multi-View Clustering Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael C. Kampffmeyer
PDF
On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks HyunJun Jung, Patrick Ruhkamp, Guangyao Zhai, Nikolas Brasch, Yitong Li, Yannick Verdie, Jifei Song, Yiren Zhou, Anil Armagan, Slobodan Ilic, Aleš Leonardis, Nassir Navab, Benjamin Busam
PDF
On the Pitfall of Mixup for Uncertainty Calibration Deng-Bao Wang, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Min-Ling Zhang
PDF
On the Stability-Plasticity Dilemma of Class-Incremental Learning Dongwan Kim, Bohyung Han
PDF
On-the-Fly Category Discovery Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma
PDF
One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field Weichuang Li, Longhao Zhang, Dong Wang, Bin Zhao, Zhigang Wang, Mulin Chen, Bang Zhang, Zhongjian Wang, Liefeng Bo, Xuelong Li
PDF
One-Shot Model for Mixed-Precision Quantization Ivan Koryakovskiy, Alexandra Yakovleva, Valentin Buchnev, Temur Isaev, Gleb Odinokikh
PDF
One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer Jing Lin, Ailing Zeng, Haoqian Wang, Lei Zhang, Yu Li
PDF
One-to-Few Label Assignment for End-to-End Dense Detection Shuai Li, Minghan Li, Ruihuang Li, Chenhang He, Lei Zhang
PDF
OneFormer: One Transformer to Rule Universal Image Segmentation Jitesh Jain, Jiachen Li, Mang Tik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi
PDF
OPE-SR: Orthogonal Position Encoding for Designing a Parameter-Free Upsampling Module in Arbitrary-Scale Image Super-Resolution Gaochao Song, Qian Sun, Luo Zhang, Ran Su, Jianfeng Shi, Ying He
PDF
Open Set Action Recognition via Multi-Label Evidential Learning Chen Zhao, Dawei Du, Anthony Hoogs, Christopher Funk
PDF
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H.S. Torr, Ser-Nam Lim
PDF
Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework Sipeng Zheng, Boshen Xu, Qin Jin
PDF
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator Shijie Wang, Jianlong Chang, Haojie Li, Zhihui Wang, Wanli Ouyang, Qi Tian
PDF
Open-Set Likelihood Maximization for Few-Shot Learning Malik Boudiaf, Etienne Bennequin, Myriam Tami, Antoine Toubhans, Pablo Piantanida, Celine Hudelot, Ismail Ben Ayed
PDF
Open-Set Representation Learning Through Combinatorial Embedding Geeho Kim, Junoh Kang, Bohyung Han
PDF
Open-Set Semantic Segmentation for Point Clouds via Adversarial Prototype Framework Jianan Li, Qiulei Dong
PDF
Open-Vocabulary Attribute Detection María A. Bravo, Sudhanshu Mittal, Simon Ging, Thomas Brox
PDF
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Jiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello
PDF
Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang
PDF
Open-Vocabulary Semantic Segmentation with Mask-Adapted CLIP Feng Liang, Bichen Wu, Xiaoliang Dai, Kunpeng Li, Yinan Zhao, Hang Zhang, Peizhao Zhang, Peter Vajda, Diana Marculescu
PDF
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction Shaofei Cai, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
PDF
OpenGait: Revisiting Gait Recognition Towards Better Practicality Chao Fan, Junhao Liang, Chuanfu Shen, Saihui Hou, Yongzhen Huang, Shiqi Yu
PDF
OpenMix: Exploring Outlier Samples for Misclassification Detection Fei Zhu, Zhen Cheng, Xu-Yao Zhang, Cheng-Lin Liu
PDF
OpenScene: 3D Scene Understanding with Open Vocabularies Songyou Peng, Kyle Genova, Chiyu “Max” Jiang, Andrea Tagliasacchi, Marc Pollefeys, Thomas Funkhouser
PDF
Optimal Proposal Learning for Deployable End-to-End Pedestrian Detection Xiaolin Song, Binghui Chen, Pengyu Li, Jun-Yan He, Biao Wang, Yifeng Geng, Xuansong Xie, Honggang Zhang
PDF
Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting Wei Lin, Antoni B. Chan
PDF
Optimization-Inspired Cross-Attention Transformer for Compressive Sensing Jiechong Song, Chong Mou, Shiqi Wang, Siwei Ma, Jian Zhang
PDF
ORCa: Glossy Objects as Radiance-Field Cameras Kushagra Tiwary, Akshat Dave, Nikhil Behari, Tzofi Klinghoffer, Ashok Veeraraghavan, Ramesh Raskar
PDF
OReX: Object Reconstruction from Planar Cross-Sections Using Neural Fields Haim Sawdayee, Amir Vaxman, Amit H. Bermano
PDF
OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin, Daniel DeTone, Tsun-Yi Yang, Armen Avetisyan, Julian Straub, Tomasz Malisiewicz, Samuel Rota Bulò, Richard Newcombe, Peter Kontschieder, Vasileios Balntas
PDF
Orthogonal Annotation Benefits Barely-Supervised Medical Image Segmentation Heng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao
PDF
OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain Adaptation Ye Liu, Lingfeng Qiao, Changchong Lu, Di Yin, Chen Lin, Haoyuan Peng, Bo Ren
PDF
OSRT: Omnidirectional Image Super-Resolution with Distortion-Aware Transformer Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong
PDF
OT-Filter: An Optimal Transport Filter for Learning with Noisy Labels Chuanwen Feng, Yilong Ren, Xike Xie
PDF
OTAvatar: One-Shot Talking Face Avatar with Controllable Tri-Plane Rendering Zhiyuan Ma, Xiangyu Zhu, Guo-Jun Qi, Zhen Lei, Lei Zhang
PDF
Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation Zesen Cheng, Pengchong Qiao, Kehan Li, Siheng Li, Pengxu Wei, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen
PDF
Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning Yu Wang, Pengchong Qiao, Chang Liu, Guoli Song, Xiawu Zheng, Jie Chen
PDF
OvarNet: Towards Open-Vocabulary Object Attribute Recognition Keyan Chen, Xiaolong Jiang, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie
PDF
Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction Ziwei Yu, Chen Li, Linlin Yang, Xiaoxu Zheng, Michael Bi Mi, Gim Hee Lee, Angela Yao
PDF
Overlooked Factors in Concept-Based Explanations: Dataset Choice, Concept Learnability, and Human Capability Vikram V. Ramaswamy, Sunnie S. Y. Kim, Ruth Fong, Olga Russakovsky
PDF
OVTrack: Open-Vocabulary Multiple Object Tracking Siyuan Li, Tobias Fischer, Lei Ke, Henghui Ding, Martin Danelljan, Fisher Yu
PDF
PA&DA: Jointly Sampling Path and Data for Consistent NAS Shun Lu, Yu Hu, Longxing Yang, Zihao Sun, Jilin Mei, Jianchao Tan, Chengru Song
PDF
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers Ryan Grainger, Thomas Paniagua, Xi Song, Naresh Cuntoor, Mun Wai Lee, Tianfu Wu
PDF
PACO: Parts and Attributes of Common Objects Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng, Baishan Guo, Rui Wang, Aaron Marquez, Rama Kovvuri, Abhishek Kadian, Amir Mousavi, Yiwen Song, Abhimanyu Dubey, Dhruv Mahajan
PDF
Paint by Example: Exemplar-Based Image Editing with Diffusion Models Binxin Yang, Shuyang Gu, Bo Zhang, Ting Zhang, Xuejin Chen, Xiaoyan Sun, Dong Chen, Fang Wen
PDF
Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask Shangzhan Zhang, Sida Peng, Tianrun Chen, Linzhan Mou, Haotong Lin, Kaicheng Yu, Yiyi Liao, Xiaowei Zhou
PDF
Paired-Point Lifting for Enhanced Privacy-Preserving Visual Localization Chunghwan Lee, Jaihoon Kim, Chanhyuk Yun, Je Hyeong Hong
PDF
PaletteNeRF: Palette-Based Appearance Editing of Neural Radiance Fields Zhengfei Kuang, Fujun Luan, Sai Bi, Zhixin Shu, Gordon Wetzstein, Kalyan Sunkavalli
PDF
PanelNet: Understanding 360 Indoor Environment via Panel Representation Haozheng Yu, Lu He, Bing Jian, Weiwei Feng, Shan Liu
PDF
PAniC-3D: Stylized Single-View 3D Reconstruction from Portraits of Anime Characters Shuhong Chen, Kevin Zhang, Yichun Shi, Heng Wang, Yiheng Zhu, Guoxian Song, Sizhe An, Janus Kristjansson, Xiao Yang, Matthias Zwicker
PDF
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Y. Ogras, Linjie Luo
PDF
Panoptic Compositional Feature Field for Editable Scene Rendering with Network-Inferred Labels via Metric Learning Xinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang
PDF
Panoptic Lifting for 3D Scene Understanding with Neural Fields Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Norman Müller, Matthias Nießner, Angela Dai, Peter Kontschieder
PDF
Panoptic Video Scene Graph Generation Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang, Chen Change Loy, Ziwei Liu
PDF
PanoSwin: A Pano-Style Swin Transformer for Panorama Understanding Zhixin Ling, Zhen Xing, Xiangdong Zhou, Manliang Cao, Guichun Zhou
PDF
Parallel Diffusion Models of Operator and Image for Blind Inverse Problems Hyungjin Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye
PDF
Parameter Efficient Local Implicit Image Function Network for Face Segmentation Mausoom Sarkar, Nikitha Sr, Mayur Hemani, Rishabh Jain, Balaji Krishnamurthy
PDF
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment Ricong Huang, Peiwen Lai, Yipeng Qin, Guanbin Li
PDF
PartDistillation: Learning Parts from Instance Segmentation Jang Hyun Cho, Philipp Krähenbühl, Vignesh Ramanathan
PDF
Partial Network Cloning Jingwen Ye, Songhua Liu, Xinchao Wang
PDF
PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations Haoran Geng, Ziming Li, Yiran Geng, Jiayi Chen, Hao Dong, He Wang
PDF
PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-Identification Minsu Kim, Seungryong Kim, Jungin Park, Seongheon Park, Kwanghoon Sohn
PDF
Parts2Words: Learning Joint Embedding of Point Clouds and Texts by Bidirectional Matching Between Parts and Words Chuan Tang, Xi Yang, Bojian Wu, Zhizhong Han, Yi Chang
PDF
PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models Minghua Liu, Yinhao Zhu, Hong Cai, Shizhong Han, Zhan Ling, Fatih Porikli, Hao Su
PDF
Passive Micron-Scale Time-of-Flight with Sunlight Interferometry Alankar Kotwal, Anat Levin, Ioannis Gkioulekas
PDF
Patch-Based 3D Natural Scene Generation from a Single Example Weiyu Li, Xuelin Chen, Jue Wang, Baoquan Chen
PDF
Patch-Craft Self-Supervised Training for Correlated Image Denoising Gregory Vaksman, Michael Elad
PDF
Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective Jinjing Zhu, Haotian Bai, Lin Wang
PDF
PATS: Patch Area Transportation with Subdivision for Local Feature Matching Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang
PDF
PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction Luke Melas-Kyriazi, Christian Rupprecht, Andrea Vedaldi
PDF
pCON: Polarimetric Coordinate Networks for Neural Scene Representations Henry Peters, Yunhao Ba, Achuta Kadambi
PDF
PCR: Proxy-Based Contrastive Replay for Online Class-Incremental Continual Learning Huiwei Lin, Baoquan Zhang, Shanshan Feng, Xutao Li, Yunming Ye
PDF
PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations Julian Jorge Andrade Guerreiro, Mitsuru Nakazawa, Björn Stenger
PDF
PD-Quant: Post-Training Quantization Based on Prediction Difference Metric Jiawei Liu, Lin Niu, Zhihang Yuan, Dawei Yang, Xinggang Wang, Wenyu Liu
PDF
PDPP:Projected Diffusion for Procedure Planning in Instructional Videos Hanlin Wang, Yilu Wu, Sheng Guo, Limin Wang
PDF
PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation Liwen Zhang, Xinyan Zhang, Youcheng Zhang, Yufei Guo, Yuanpei Chen, Xuhui Huang, Zhe Ma
PDF
PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration Junle Yu, Luwei Ren, Yu Zhang, Wenhui Zhou, Lili Lin, Guojun Dai
PDF
PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training Qingjie Zeng, Yutong Xie, Zilin Lu, Yong Xia
PDF
Perception and Semantic Aware Regularization for Sequential Confidence Calibration Zhenghua Peng, Yu Luo, Tianshui Chen, Keke Xu, Shuangping Huang
PDF
Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation Seung Ho Park, Young Su Moon, Nam Ik Cho
PDF
PermutoSDF: Fast Multi-View Reconstruction with Implicit Surfaces Using Permutohedral Lattices Radu Alexandru Rosu, Sven Behnke
PDF
Persistent Nature: A Generative Model of Unbounded 3D Worlds Lucy Chai, Richard Tucker, Zhengqi Li, Phillip Isola, Noah Snavely
PDF
Person Image Synthesis via Denoising Diffusion Model Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Mubarak Shah, Fahad Shahbaz Khan
PDF
PersonNeRF: Personalized Reconstruction from Photo Collections Chung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman
PDF
Perspective Fields for Single Image Camera Calibration Linyi Jin, Jianming Zhang, Yannick Hold-Geoffroy, Oliver Wang, Kevin Blackburn-Matzen, Matthew Sticha, David F. Fouhey
PDF
PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces Yiqun Wang, Ivan Skorokhodov, Peter Wonka
PDF
PHA: Patch-Wise High-Frequency Augmentation for Transformer-Based Person Re-Identification Guiwei Zhang, Yongfei Zhang, Tianyu Zhang, Bo Li, Shiliang Pu
PDF
Phase-Shifting Coder: Predicting Accurate Orientation in Oriented Object Detection Yi Yu, Feipeng Da
PDF
Phone2Proc: Bringing Robust Robots into Our Chaotic World Matt Deitke, Rose Hendrix, Ali Farhadi, Kiana Ehsani, Aniruddha Kembhavi
PDF
Photo Pre-Training, but for Sketch Ke Li, Kaiyue Pang, Yi-Zhe Song
PDF
Physical-World Optical Adversarial Attacks on 3D Face Recognition Yanjie Li, Yiquan Li, Xuelong Dai, Songtao Guo, Bin Xiao
PDF
Physically Adversarial Infrared Patches with Learnable Shapes and Locations Xingxing Wei, Jie Yu, Yao Huang
PDF
Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling Zhanhao Hu, Wenda Chu, Xiaopei Zhu, Hui Zhang, Bo Zhang, Xiaolin Hu
PDF
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su, Kaizhi Qian, Eli Shlizerman, Antonio Torralba, Chuang Gan
PDF
Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography Yue Cao, Ming Liu, Shuai Liu, Xiaotao Wang, Lei Lei, Wangmeng Zuo
PDF
Pic2Word: Mapping Pictures to Words for Zero-Shot Composed Image Retrieval Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister
PDF
Picture That Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
PDF
PIDNet: A Real-Time Semantic Segmentation Network Inspired by PID Controllers Jiacong Xu, Zixiang Xiong, Shankar P. Bhattacharyya
PDF
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds Jinyu Li, Chenxu Luo, Xiaodong Yang
PDF
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang
PDF
PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification Meike Nauta, Jörg Schlötterer, Maurice van Keulen, Christin Seifert
PDF
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav Ram Ramrakhya, Dhruv Batra, Erik Wijmans, Abhishek Das
PDF
PIVOT: Prompting for Video Continual Learning Andrés Villa, Juan León Alcázar, Motasem Alfarra, Kumail Alhamoud, Julio Hurtado, Fabian Caba Heilbron, Alvaro Soto, Bernard Ghanem
PDF
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization Mamshad Nayeem Rizve, Gaurav Mittal, Ye Yu, Matthew Hall, Sandra Sajeev, Mubarak Shah, Mei Chen
PDF
Pix2map: Cross-Modal Retrieval for Inferring Street Maps from Images Xindi Wu, KwunFung Lau, Francesco Ferroni, Aljoša Ošep, Deva Ramanan
PDF
Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection Yi Wang, Ruili Wang, Xin Fan, Tianzhu Wang, Xiangjian He
PDF
PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing Yichen Sheng, Jianming Zhang, Julien Philip, Yannick Hold-Geoffroy, Xin Sun, He Zhang, Lu Ling, Bedrich Benes
PDF
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi
PDF
PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes Ruoyu Wang, Zehao Yu, Shenghua Gao
PDF
Planning-Oriented Autonomous Driving Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li
PDF
Plateau-Reduced Differentiable Path Tracing Michael Fischer, Tobias Ritschel
PDF
PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering Han Yan, Celong Liu, Chao Ma, Xing Mei
PDF
PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation Karthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Markus Kowarschik, Andreas Maier, Bernhard Egger
PDF
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation Narek Tumanyan, Michal Geyer, Shai Bagon, Tali Dekel
PDF
PMatch: Paired Masked Image Modeling for Dense Geometric Matching Shengjie Zhu, Xiaoming Liu
PDF
PMR: Prototypical Modal Rebalance for Multimodal Learning Yunfeng Fan, Wenchao Xu, Haozhao Wang, Junxiao Wang, Song Guo
PDF
POEM: Reconstructing Hand in a Point Embedded Multi-View Stereo Lixin Yang, Jian Xu, Licheng Zhong, Xinyu Zhan, Zhicheng Wang, Kejian Wu, Cewu Lu
PDF
Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting Tarasha Khurana, Peiyun Hu, David Held, Deva Ramanan
PDF
Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields Tao Hu, Xiaogang Xu, Shu Liu, Jiaya Jia
PDF
PointAvatar: Deformable Point-Based Head Avatars from Videos Yufeng Zheng, Wang Yifan, Gordon Wetzstein, Michael J. Black, Otmar Hilliges
PDF
PointCert: Point Cloud Classification with Deterministic Certified Robustness Guarantees Jinghuai Zhang, Jinyuan Jia, Hongbin Liu, Neil Zhenqiang Gong
PDF
PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering Fuchen Long, Ting Yao, Zhaofan Qiu, Lusong Li, Tao Mei
PDF
PointCMP: Contrastive Mask Prediction for Self-Supervised Learning on Point Cloud Videos Zhiqiang Shen, Xiaoxiao Sheng, Longguang Wang, Yulan Guo, Qiong Liu, Xi Zhou
PDF
PointConvFormer: Revenge of the Point-Based Convolution Wenxuan Wu, Li Fuxin, Qi Shan
PDF
PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection Linfeng Zhang, Runpei Dong, Hung-Shuo Tai, Kaisheng Ma
PDF
Pointersect: Neural Rendering with Cloud-Ray Intersection Jen-Hao Rick Chang, Wei-Yu Chen, Anurag Ranjan, Kwang Moo Yi, Oncel Tuzel
PDF
PointListNet: Deep Learning on 3D Point Lists Hehe Fan, Linchao Zhu, Yi Yang, Mohan Kankanhalli
PDF
PointVector: A Vector Representation in Point Cloud Analysis Xin Deng, WenYu Zhang, Qing Ding, XinMing Zhang
PDF
Polarimetric iToF: Measuring High-Fidelity Depth Through Scattering Media Daniel S. Jeon, Andréas Meuleman, Seung-Hwan Baek, Min H. Kim
PDF
Polarized Color Image Denoising Zhuoxiao Li, Haiyang Jiang, Mingdeng Cao, Yinqiang Zheng
PDF
Policy Adaptation from Foundation Model Feedback Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang
PDF
Poly-PC: A Polyhedral Network for Multiple Point Cloud Tasks at Once Tao Xie, Shiguang Wang, Ke Wang, Linqi Yang, Zhiqiang Jiang, Xingcheng Zhang, Kun Dai, Ruifeng Li, Jian Cheng
PDF
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha
PDF
Polynomial Implicit Neural Representations for Large Diverse Datasets Rajhans Singh, Ankita Shukla, Pavan Turaga
PDF
Pose Synchronization Under Multiple Pair-Wise Relative Poses Yifan Sun, Qixing Huang
PDF
Pose-Disentangled Contrastive Learning for Self-Supervised Facial Representation Yuanyuan Liu, Wenbin Wang, Yibing Zhan, Shaoze Feng, Kejun Liu, Zhe Chen
PDF
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation Qihao Liu, Adam Kortylewski, Alan L. Yuille
PDF
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation Qitao Zhao, Ce Zheng, Mengyuan Liu, Pichao Wang, Chen Chen
PDF
Position-Guided Text Prompt for Vision-Language Pre-Training Jinpeng Wang, Pan Zhou, Mike Zheng Shou, Shuicheng Yan
PDF
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation Sara Sarto, Manuele Barraco, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
PDF
Post-Processing Temporal Action Detection Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang
PDF
Post-Training Quantization on Diffusion Models Yuzhang Shang, Zhihang Yuan, Bin Xie, Bingzhe Wu, Yan Yan
PDF
PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout Hsiao Yuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang
PDF
POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery Ce Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen
PDF
Power Bundle Adjustment for Large-Scale 3D Reconstruction Simon Weber, Nikolaus Demmel, Tin Chon Chan, Daniel Cremers
PDF
Practical Network Acceleration with Tiny Sets Guo-Hua Wang, Jianxin Wu
PDF
Prefix Conditioning Unifies Language and Label Supervision Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister
PDF
PREIM3D: 3D Consistent Precise Image Attribute Editing from a Single Image Jianhui Li, Jianmin Li, Haoji Zhang, Shilong Liu, Zhengyi Wang, Zihao Xiao, Kaiwen Zheng, Jun Zhu
PDF
Preserving Linear Separability in Continual Learning by Backward Feature Projection Qiao Gu, Dongsub Shim, Florian Shkurti
PDF
Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation Shuting He, Henghui Ding, Wei Jiang
PDF
Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions Tobias Kalb, Jürgen Beyerer
PDF
PRISE: Demystifying Deep Lucas-Kanade with Strongly Star-Convex Constraints for Multimodel Image Alignment Yiqing Zhang, Xinming Huang, Ziming Zhang
PDF
Privacy-Preserving Adversarial Facial Features Zhibo Wang, He Wang, Shuaifan Jin, Wenwen Zhang, Jiahui Hu, Yan Wang, Peng Sun, Wei Yuan, Kaixin Liu, Kui Ren
PDF
Privacy-Preserving Representations Are Not Enough: Recovering Scene Content from Camera Poses Kunal Chelani, Torsten Sattler, Fredrik Kahl, Zuzana Kukelova
PDF
Private Image Generation with Dual-Purpose Auxiliary Classifier Chen Chen, Daochang Liu, Siqi Ma, Surya Nepal, Chang Xu
PDF
PROB: Probabilistic Objectness for Open World Object Detection Orr Zohar, Kuan-Chieh Wang, Serena Yeung
PDF
Probabilistic Debiasing of Scene Graphs Bashirul Azam Biswas, Qiang Ji
PDF
Probabilistic Knowledge Distillation of Face Ensembles Jianqing Xu, Shen Li, Ailin Deng, Miao Xiong, Jiaying Wu, Jiaxiang Wu, Shouhong Ding, Bryan Hooi
PDF
Probabilistic Prompt Learning for Dense Prediction Hyeongjun Kwon, Taeyong Song, Somi Jeong, Jin Kim, Jinhyun Jang, Kwanghoon Sohn
PDF
Probability-Based Global Cross-Modal Upsampling for Pansharpening Zeyu Zhu, Xiangyong Cao, Man Zhou, Junhao Huang, Deyu Meng
PDF
Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks Markus Frey, Christian F. Doeller, Caswell Barry
PDF
Probing Sentiment-Oriented Pre-Training Inspired by Human Sentiment Perception Mechanism Tinglei Feng, Jiaxuan Liu, Jufeng Yang
PDF
Procedure-Aware Pretraining for Instructional Video Understanding Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles
PDF
ProD: Prompting-to-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification Tianyi Ma, Yifan Sun, Zongxin Yang, Yi Yang
PDF
Progressive Backdoor Erasing via Connecting Backdoor and Adversarial Attacks Bingxu Mu, Zhenxing Niu, Le Wang, Xue Wang, Qiguang Miao, Rong Jin, Gang Hua
PDF
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis Duomin Wang, Yu Deng, Zixin Yin, Heung-Yeung Shum, Baoyuan Wang
PDF
Progressive Neighbor Consistency Mining for Correspondence Pruning Xin Liu, Jufeng Yang
PDF
Progressive Open Space Expansion for Open-Set Model Attribution Tianyun Yang, Danding Wang, Fan Tang, Xinying Zhao, Juan Cao, Sheng Tang
PDF
Progressive Random Convolutions for Single Domain Generalization Seokeon Choi, Debasmit Das, Sungha Choi, Seunghan Yang, Hyunsin Park, Sungrack Yun
PDF
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning Man Liu, Feng Li, Chunjie Zhang, Yunchao Wei, Huihui Bai, Yao Zhao
PDF
Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation Xueyan Huang, Yueyi Zhang, Zhiwei Xiong
PDF
Progressive Transformation Learning for Leveraging Virtual Images in Training Yi-Ting Shen, Hyungtae Lee, Heesung Kwon, Shuvra S. Bhattacharyya
PDF
Progressively Optimized Local Radiance Fields for Robust View Synthesis Andréas Meuleman, Yu-Lun Liu, Chen Gao, Jia-Bin Huang, Changil Kim, Min H. Kim, Johannes Kopf
PDF
Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain Generalization Yuchen Liu, Yaoming Wang, Yabo Chen, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong
PDF
Prompt-Guided Zero-Shot Anomaly Action Recognition Using Pretrained Deep Skeleton Features Fumiaki Sato, Ryo Hachiuma, Taiki Sekii
PDF
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li
PDF
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery Sheng Zhang, Salman Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Shahbaz Khan
PDF
Prompting Large Language Models with Answer Heuristics for Knowledge-Based Visual Question Answering Zhenwei Shao, Zhou Yu, Meng Wang, Jun Yu
PDF
Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking Yihao Wang, Zhigang Wang, Bin Zhao, Dong Wang, Mulin Chen, Xuelong Li
PDF
ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals Xishun Wang, Tong Su, Fang Da, Xiaodong Yang
PDF
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization Huan Ren, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang
PDF
ProTeGe: Untrimmed Pretraining for Video Temporal Grounding by Video Temporal Grounding Lan Wang, Gaurav Mittal, Sandra Sajeev, Ye Yu, Matthew Hall, Vishnu Naresh Boddeti, Mei Chen
PDF
ProtoCon: Pseudo-Label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-Supervised Learning Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Gholamreza Haffari
PDF
Prototype-Based Embedding Network for Scene Graph Generation Chaofan Zheng, Xinyu Lyu, Lianli Gao, Bo Dai, Jingkuan Song
PDF
Prototypical Residual Networks for Anomaly Detection and Localization Hui Zhang, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang
PDF
Proximal Splitting Adversarial Attack for Semantic Segmentation Jérôme Rony, Jean-Christophe Pesquet, Ismail Ben Ayed
PDF
ProxyFormer: Proxy Alignment Assisted Point Cloud Completion with Missing Part Sensitive Transformer Shanshan Li, Pan Gao, Xiaoyang Tan, Mingqiang Wei
PDF
Pruning Parameterization with Bi-Level Optimization for Efficient Semantic Segmentation on the Edge Changdi Yang, Pu Zhao, Yanyu Li, Wei Niu, Jiexiong Guan, Hao Tang, Minghai Qin, Bin Ren, Xue Lin, Yanzhi Wang
PDF
Pseudo-Label Guided Contrastive Learning for Semi-Supervised Medical Image Segmentation Hritam Basak, Zhaozheng Yin
PDF
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers Zhongwei Qiu, Qiansheng Yang, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang
PDF
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes Sumith Kulal, Tim Brooks, Alex Aiken, Jiajun Wu, Jimei Yang, Jingwan Lu, Alexei A. Efros, Krishna Kumar Singh
PDF
PVO: Panoptic Visual Odometry Weicai Ye, Xinyue Lan, Shuo Chen, Yuhang Ming, Xingyuan Yu, Hujun Bao, Zhaopeng Cui, Guofeng Zhang
PDF
PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer Honghui Yang, Wenxiao Wang, Minghao Chen, Binbin Lin, Tong He, Hua Chen, Xiaofei He, Wanli Ouyang
PDF
PyPose: A Library for Robot Learning with Physics-Based Optimization Chen Wang, Dasong Gao, Kuan Xu, Junyi Geng, Yaoyu Hu, Yuheng Qiu, Bowen Li, Fan Yang, Brady Moon, Abhinav Pandey, Aryan, Jiahe Xu, Tianhao Wu, Haonan He, Daning Huang, Zhongqiang Ren, Shibo Zhao, Taimeng Fu, Pranay Reddy, Xiao Lin, Wenshan Wang, Jingnan Shi, Rajat Talak, Kun Cao, Yi Du, Han Wang, Huai Yu, Shanzhao Wang, Siyu Chen, Ananth Kashyap, Rohan Bandaru, Karthik Dantu, Jiajun Wu, Lihua Xie, Luca Carlone, Marco Hutter, Sebastian Scherer
PDF
PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow Jiarui Lei, Xiaobo Hu, Yue Wang, Dong Liu
PDF
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang
PDF
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? a: Self-Train on Unlabeled Images! Zaid Khan, Vijay Kumar Bg, Samuel Schulter, Xiang Yu, Yun Fu, Manmohan Chandraker
PDF
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation Sicheng Yang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Lei Hao, Weihong Bao, Haolin Zhuang
PDF
Quality-Aware Pre-Trained Models for Blind Image Quality Assessment Kai Zhao, Kun Yuan, Ming Sun, Mading Li, Xing Wen
PDF
QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister
PDF
Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis Hoseok Do, EunKyung Yoo, Taehyeong Kim, Chul Lee, Jin Young Choi
PDF
Quantum Multi-Model Fitting Matteo Farina, Luca Magri, Willi Menapace, Elisa Ricci, Vladislav Golyanik, Federica Arrigoni
PDF
Quantum-Inspired Spectral-Spatial Pyramid Network for Hyperspectral Image Classification Jie Zhang, Yongshan Zhang, Yicong Zhou
PDF
Query-Centric Trajectory Prediction Zikang Zhou, Jianping Wang, Yung-Hui Li, Yu-Kai Huang
PDF
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection WonJun Moon, Sangeek Hyun, SangUk Park, Dongchan Park, Jae-Pil Heo
PDF
R2Former: Unified Retrieval and Reranking Transformer for Place Recognition Sijie Zhu, Linjie Yang, Chen Chen, Mubarak Shah, Xiaohui Shen, Heng Wang
PDF
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training Chen-Wei Xie, Siyang Sun, Xiong Xiong, Yun Zheng, Deli Zhao, Jingren Zhou
PDF
RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-Consistent Dataset Zhongjin Luo, Shengcai Cai, Jinguo Dong, Ruibo Ming, Liangdong Qiu, Xiaohang Zhan, Xiaoguang Han
PDF
Randomized Adversarial Training via Taylor Expansion Gaojie Jin, Xinping Yi, Dengyu Wu, Ronghui Mu, Xiaowei Huang
PDF
Range-Nullspace Video Frame Interpolation with Focalized Motion Estimation Zhiyang Yu, Yu Zhang, Dongqing Zou, Xijun Chen, Jimmy S. Ren, Shunqing Ren
PDF
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving Angelika Ando, Spyros Gidaris, Andrei Bursuc, Gilles Puy, Alexandre Boulch, Renaud Marlet
PDF
Ranking Regularization for Critical Rare Classes: Minimizing False Positives at a High True Positive Rate Kiarash Mohammadi, He Zhao, Mengyao Zhai, Frederick Tung
PDF
RankMix: Data Augmentation for Weakly Supervised Learning of Classifying Whole Slide Images with Diverse Sizes and Imbalanced Categories Yuan-Chih Chen, Chun-Shien Lu
PDF
Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks Tong Bu, Jianhao Ding, Zecheng Hao, Zhaofei Yu
PDF
Raw Image Reconstruction with Learned Compact Metadata Yufei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
PDF
Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments Masakazu Yoshimura, Junji Otsuka, Atsushi Irie, Takeshi Ohashi
PDF
Re-Basin via Implicit Sinkhorn Differentiation Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli
PDF
Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration Divya Saxena, Jiannong Cao, Jiahao Xu, Tarun Kulshrestha
PDF
Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild Avinab Saha, Sandeep Mishra, Alan C. Bovik
PDF
Re-Thinking Federated Active Learning Based on Inter-Class Diversity SangMook Kim, Sangmin Bae, Hwanjun Song, Se-Young Yun
PDF
Re-Thinking Model Inversion Attacks Against Deep Neural Networks Ngoc-Bao Nguyen, Keshigeyan Chandrasegaran, Milad Abdollahzadeh, Ngai-Man Cheung
PDF
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem
PDF
Real-Time 6k Image Rescaling with Rate-Distortion Optimization Chenyang Qi, Xin Yang, Ka Leong Cheng, Ying-Cong Chen, Qifeng Chen
PDF
Real-Time Controllable Denoising for Image and Video Zhaoyang Zhang, Yitong Jiang, Wenqi Shao, Xiaogang Wang, Ping Luo, Kaimo Lin, Jinwei Gu
PDF
Real-Time Evaluation in Online Continual Learning: A New Hope Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip H.S. Torr, Bernard Ghanem
PDF
Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video Wenzheng Zeng, Yang Xiao, Sicheng Wei, Jinfang Gan, Xintao Zhang, Zhiguo Cao, Zhiwen Fang, Joey Tianyi Zhou
PDF
Real-Time Neural Light Field on Mobile Devices Junli Cao, Huan Wang, Pavlo Chemerys, Vladislav Shakhrai, Ju Hu, Yun Fu, Denys Makoviichuk, Sergey Tulyakov, Jian Ren
PDF
RealFusion: 360deg Reconstruction of Any Object from a Single Image Luke Melas-Kyriazi, Iro Laina, Christian Rupprecht, Andrea Vedaldi
PDF
RealImpact: A Dataset of Impact Sound Fields for Real Objects Samuel Clarke, Ruohan Gao, Mason Wang, Mark Rau, Julia Xu, Jui-Hsien Wang, Doug L. James, Jiajun Wu
PDF
Realistic Saliency Guided Image Enhancement S. Mahdi H. Miangoleh, Zoya Bylinskii, Eric Kee, Eli Shechtman, Yağiz Aksoy
PDF
ReasonNet: End-to-End Driving with Temporal and Global Reasoning Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu
PDF
Rebalancing Batch Normalization for Exemplar-Based Class-Incremental Learning Sungmin Cha, Sungjun Cho, Dasol Hwang, Sunwon Hong, Moontae Lee, Taesup Moon
PDF
REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos Lingteng Qiu, Guanying Chen, Jiapeng Zhou, Mutian Xu, Junle Wang, Xiaoguang Han
PDF
ReCo: Region-Controlled Text-to-Image Generation Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
PDF
Recognizability Embedding Enhancement for Very Low-Resolution Face Recognition and Quality Estimation Jacky Chen Long Chai, Tiong-Sik Ng, Cheng-Yaw Low, Jaewoo Park, Andrew Beng Jin Teoh
PDF
Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants with No False Negatives and No False Positives Daniel Widdowson, Vitaliy Kurlin
PDF
Reconstructing Animatable Categories from Videos Gengshan Yang, Chaoyang Wang, N. Dinesh Reddy, Deva Ramanan
PDF
Reconstructing Signing Avatars from Video Using Linguistic Priors Maria-Paola Forte, Peter Kulits, Chun-Hao P. Huang, Vasileios Choutas, Dimitrios Tzionas, Katherine J. Kuchenbecker, Michael J. Black
PDF
Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal Unfolding Yeonguk Oh, JoonKyu Park, Jaeha Kim, Gyeongsik Moon, Kyoung Mu Lee
PDF
Recurrence Without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models Paul Micaelli, Arash Vahdat, Hongxu Yin, Jan Kautz, Pavlo Molchanov
PDF
Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer Si-Yuan Cao, Runmin Zhang, Lun Luo, Beinan Yu, Zehua Sheng, Junwei Li, Hui-Liang Shen
PDF
Recurrent Vision Transformers for Object Detection with Event Cameras Mathias Gehrig, Davide Scaramuzza
PDF
ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection Shiwei Jin, Zhen Wang, Lei Wang, Ning Bi, Truong Nguyen
PDF
Reducing the Label Bias for Timestamp Supervised Temporal Action Segmentation Kaiyuan Liu, Yunheng Li, Shenglan Liu, Chenwei Tan, Zihang Shao
PDF
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization Yuechen Zhang, Zexin He, Jinbo Xing, Xufeng Yao, Jiaya Jia
PDF
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension Lei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji
PDF
Referring Image Matting Jizhizi Li, Jing Zhang, Dacheng Tao
PDF
Referring Multi-Object Tracking Dongming Wu, Wencheng Han, Tiancai Wang, Xingping Dong, Xiangyu Zhang, Jianbing Shen
PDF
RefSR-NeRF: Towards High Fidelity and Super Resolution View Synthesis Xudong Huang, Wei Li, Jie Hu, Hanting Chen, Yunhe Wang
PDF
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension Jiamu Sun, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji
PDF
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers Dahun Kim, Anelia Angelova, Weicheng Kuo
PDF
Regularization of Polynomial Networks for Image Recognition Grigorios G. Chrysos, Bohan Wang, Jiankang Deng, Volkan Cevher
PDF
Regularize Implicit Neural Representation by Itself Zhemin Li, Hongxia Wang, Deyu Meng
PDF
Regularized Vector Quantization for Tokenized Image Synthesis Jiahui Zhang, Fangneng Zhan, Christian Theobalt, Shijian Lu
PDF
Regularizing Second-Order Influences for Continual Learning Zhicheng Sun, Yadong Mu, Gang Hua
PDF
Reinforcement Learning-Based Black-Box Model Inversion Attacks Gyojin Han, Jaehyun Choi, Haeil Lee, Junmo Kim
PDF
Relational Context Learning for Human-Object Interaction Detection Sanghyun Kim, Deunsol Jung, Minsu Cho
PDF
Relational Space-Time Query in Long-Form Videos Xitong Yang, Fu-Jen Chu, Matt Feiszli, Raghav Goyal, Lorenzo Torresani, Du Tran
PDF
Reliability in Semantic Segmentation: Are We on the Right Track? Pau de Jorge, Riccardo Volpi, Philip H.S. Torr, Grégory Rogez
PDF
Reliable and Interpretable Personalized Federated Learning Zixuan Qin, Liu Yang, Qilong Wang, Yahong Han, Qinghua Hu
PDF
ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects Marco Toschi, Riccardo De Matteo, Riccardo Spezialetti, Daniele De Gregorio, Luigi Di Stefano, Samuele Salti
PDF
Relightable Neural Human Assets from Multi-View Gradient Illuminations Taotao Zhou, Kai He, Di Wu, Teng Xu, Qixuan Zhang, Kuixiang Shao, Wenzheng Chen, Lan Xu, Jingyi Yu
PDF
RelightableHands: Efficient Neural Relighting of Articulated Hand Models Shun Iwase, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Timur Bagautdinov, Rohan Joshi, Fabian Prada, Takaaki Shiratori, Yaser Sheikh, Jason Saragih
PDF
Removing Objects from Neural Radiance Fields Silvan Weder, Guillermo Garcia-Hernando, Áron Monszpart, Marc Pollefeys, Gabriel J. Brostow, Michael Firman, Sara Vicente
PDF
Renderable Neural Radiance mAP for Visual Navigation Obin Kwon, Jeongho Park, Songhwai Oh
PDF
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation Titas Anciukevičius, Zexiang Xu, Matthew Fisher, Paul Henderson, Hakan Bilen, Niloy J. Mitra, Paul Guerrero
PDF
RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction Donghao Zhou, Chunbin Gu, Junde Xu, Furui Liu, Qiong Wang, Guangyong Chen, Pheng-Ann Heng
PDF
Representation Learning for Visual Object Tracking by Masked Appearance Transfer Haojie Zhao, Dong Wang, Huchuan Lu
PDF
Representing Volumetric Videos as Dynamic MLP Maps Sida Peng, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou
PDF
Reproducible Scaling Laws for Contrastive Language-Image Learning Mehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, Jenia Jitsev
PDF
ResFormer: Scaling ViTs with Multi-Resolution Training Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang
PDF
Residual Degradation Learning Unfolding Framework with Mixing Priors Across Spectral and Spatial for Compressive Spectral Imaging Yubo Dong, Dahua Gao, Tian Qiu, Yuyan Li, Minxi Yang, Guangming Shi
PDF
Resource-Efficient RGBD Aerial Tracking Jinyu Yang, Shang Gao, Zhe Li, Feng Zheng, Aleš Leonardis
PDF
Restoration of Hand-Drawn Architectural Drawings Using Latent Space Mapping with Degradation Generator Nakkwan Choi, Seungjae Lee, Yongsik Lee, Seungjoon Yang
PDF
Rethinking Domain Generalization for Face Anti-Spoofing: Separability and Alignment Yiyou Sun, Yaojie Liu, Xiaoming Liu, Yixuan Li, Wen-Sheng Chu
PDF
Rethinking Feature-Based Knowledge Distillation for Face Recognition Jingzhi Li, Zidong Guo, Hui Li, Seungju Han, Ji-won Baek, Min Yang, Ran Yang, Sungjoo Suh
PDF
Rethinking Federated Learning with Domain Shift: A Prototype View Wenke Huang, Mang Ye, Zekun Shi, He Li, Bo Du
PDF
Rethinking Few-Shot Medical Segmentation: A Vector Quantization View Shiqi Huang, Tingfa Xu, Ning Shen, Feng Mu, Jianan Li
PDF
Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma
PDF
Rethinking Image Super Resolution from Long-Tailed Distribution Learning Perspective Yuanbiao Gou, Peng Hu, Jiancheng Lv, Hongyuan Zhu, Xi Peng
PDF
Rethinking Optical Flow from Geometric Matching Consistent Perspective Qiaole Dong, Chenjie Cao, Yanwei Fu
PDF
Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need Jingyao Li, Pengguang Chen, Zexin He, Shaozuo Yu, Shu Liu, Jiaya Jia
PDF
Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, Shiliang Pu
PDF
Rethinking the Correlation in Few-Shot Segmentation: A Buoys View Yuan Wang, Rui Sun, Tianzhu Zhang
PDF
Rethinking the Learning Paradigm for Dynamic Facial Expression Recognition Hanyang Wang, Bo Li, Shuang Wu, Siyuan Shen, Feng Liu, Shouhong Ding, Aimin Zhou
PDF
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning Aj Piergiovanni, Weicheng Kuo, Anelia Angelova
PDF
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi
PDF
Revealing the Dark Secrets of Masked Image Modeling Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao
PDF
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi
PDF
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens Yuxiao Chen, Jianbo Yuan, Yu Tian, Shijie Geng, Xinyu Li, Ding Zhou, Dimitris N. Metaxas, Hongxia Yang
PDF
Revisiting Prototypical Network for Cross Domain Few-Shot Learning Fei Zhou, Peng Wang, Lei Zhang, Wei Wei, Yanning Zhang
PDF
Revisiting Residual Networks for Adversarial Robustness Shihua Huang, Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti
PDF
Revisiting Reverse Distillation for Anomaly Detection Tran Dinh Tien, Anh Tuan Nguyen, Nguyen Hoang Tran, Ta Duc Huy, Soan T.M. Duong, Chanh D. Tr. Nguyen, Steven Q. H. Truong
PDF
Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution Bangyan Liao, Delin Qu, Yifei Xue, Huiqing Zhang, Yizhen Lao
PDF
Revisiting Rotation Averaging: Uncertainties and Robust Losses Ganlin Zhang, Viktor Larsson, Daniel Barath
PDF
Revisiting Self-Similarity: Structural Embedding for Image Retrieval Seongwon Lee, Suhyeon Lee, Hongje Seong, Euntai Kim
PDF
Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring Ruyang Liu, Jingjia Huang, Ge Li, Jiashi Feng, Xinglong Wu, Thomas H. Li
PDF
Revisiting the P3P Problem Yaqing Ding, Jian Yang, Viktor Larsson, Carl Olsson, Kalle Åström
PDF
Revisiting the Stack-Based Inverse Tone Mapping Ning Zhang, Yuyao Ye, Yang Zhao, Ronggang Wang
PDF
Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation Lihe Yang, Lei Qi, Litong Feng, Wayne Zhang, Yinghuan Shi
PDF
RGB No More: Minimally-Decoded JPEG Vision Transformers Jeongsoo Park, Justin Johnson
PDF
RGBD2: Generative Scene Synthesis via Incremental View Inpainting Using RGBD Diffusion Models Jiabao Lei, Jiapeng Tang, Kui Jia
PDF
RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation with Natural Prompts Han Liu, Yuhao Wu, Shixuan Zhai, Bo Yuan, Ning Zhang
PDF
RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo Changjiang Cai, Pan Ji, Qingan Yan, Yi Xu
PDF
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors Rui-Qi Wu, Zheng-Peng Duan, Chun-Le Guo, Zhi Chai, Chongyi Li
PDF
RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin
PDF
Rigidity-Aware Detection for 6d Object Pose Estimation Yang Hai, Rui Song, Jiaojiao Li, Mathieu Salzmann, Yinlin Hu
PDF
RILS: Masked Visual Reconstruction in Language Semantic Space Shusheng Yang, Yixiao Ge, Kun Yi, Dian Li, Ying Shan, Xiaohu Qie, Xinggang Wang
PDF
RMLVQA: A Margin Loss Approach for Visual Question Answering with Language Biases Abhipsa Basu, Sravanti Addepalli, R. Venkatesh Babu
PDF
Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence Yang Tian, Jiyao Zhang, Zekai Yin, Hao Dong
PDF
Robust 3D Shape Classification via Non-Local Graph Attention Network Shengwei Qin, Zhong Li, Ligang Liu
PDF
Robust and Scalable Gaussian Process Regression and Its Applications Yifan Lu, Jiayi Ma, Leyuan Fang, Xin Tian, Junjun Jiang
PDF
Robust Dynamic Radiance Fields Yu-Lun Liu, Chen Gao, Andréas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim, Yung-Yu Chuang, Johannes Kopf, Jia-Bin Huang
PDF
Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization Zhuo Huang, Miaoxi Zhu, Xiaobo Xia, Li Shen, Jun Yu, Chen Gong, Bo Han, Bo Du, Tongliang Liu
PDF
Robust Mean Teacher for Continual and Gradual Test-Time Adaptation Mario Döbler, Robert A. Marsden, Bin Yang
PDF
Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation Chunlu Li, Andreas Morel-Forster, Thomas Vetter, Bernhard Egger, Adam Kortylewski
PDF
Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting Haiping Wang, Yuan Liu, Zhen Dong, Yulan Guo, Yu-Shen Liu, Wenping Wang, Bisheng Yang
PDF
Robust Outlier Rejection for 3D Registration with Variational Bayes Haobo Jiang, Zheng Dang, Zhen Wei, Jin Xie, Jian Yang, Mathieu Salzmann
PDF
Robust Single Image Reflection Removal Against Adversarial Attacks Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Wenqi Ren, Jianfeng Lu
PDF
Robust Test-Time Adaptation in Dynamic Scenarios Longhui Yuan, Binhui Xie, Shuang Li
PDF
Robust Unsupervised StyleGAN Image Restoration Yohan Poirier-Ginter, Jean-François Lalonde
PDF
RobustNeRF: Ignoring Distractors with Robust Losses Sara Sabour, Suhani Vora, Daniel Duckworth, Ivan Krasin, David J. Fleet, Andrea Tagliasacchi
PDF
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion Tengfei Wang, Bo Zhang, Ting Zhang, Shuyang Gu, Jianmin Bao, Tadas Baltrusaitis, Jingjing Shen, Dong Chen, Fang Wen, Qifeng Chen, Baining Guo
PDF
Role of Transients in Two-Bounce Non-Line-of-Sight Imaging Siddharth Somasundaram, Akshat Dave, Connor Henley, Ashok Veeraraghavan, Ramesh Raskar
PDF
RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval Yanglin Feng, Hongyuan Zhu, Dezhong Peng, Xi Peng, Peng Hu
PDF
Rotation-Invariant Transformer for Point Cloud Matching Hao Yu, Zheng Qin, Ji Hou, Mahdi Saleh, Dongsheng Li, Benjamin Busam, Slobodan Ilic
PDF
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo, Song Wen, Chul-Ho Lee, S.-H. Gary Chan
PDF
RUST: Latent Neural Scene Representations from Unposed Imagery Mehdi S. M. Sajjadi, Aravindh Mahendran, Thomas Kipf, Etienne Pot, Daniel Duckworth, Mario Lučić, Klaus Greff
PDF
RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-Ray Security Image Synthesis Luwen Duan, Min Wu, Lijian Mao, Jun Yin, Jianping Xiong, Xi Li
PDF
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu
PDF
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation Wenxuan Zhang, Xiaodong Cun, Xuan Wang, Yong Zhang, Xi Shen, Yu Guo, Ying Shan, Fei Wang
PDF
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models Patrick Schramowski, Manuel Brack, Björn Deiseroth, Kristian Kersting
PDF
Sample-Level Multi-View Graph Clustering Yuze Tan, Yixi Liu, Shudong Huang, Wentao Feng, Jiancheng Lv
PDF
Samples with Low Loss Curvature Improve Data Efficiency Isha Garg, Kaushik Roy
PDF
Sampling Is Matter: Point-Guided 3D Human Mesh Reconstruction Jeonghwan Kim, Mi-Gyeong Gwon, Hyunwoo Park, Hyukmin Kwon, Gi-Mun Um, Wonjun Kim
PDF
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency Yang Liu, Yao Zhang, Yixin Wang, Yang Zhang, Jiang Tian, Zhongchao Shi, Jianping Fan, Zhiqiang He
PDF
SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates Mikaela Angelina Uy, Ricardo Martin-Brualla, Leonidas Guibas, Ke Li
PDF
Scalable, Detailed and Mask-Free Universal Photometric Stereo Satoshi Ikehata
PDF
ScaleDet: A Scalable Multi-Dataset Object Detector Yanbei Chen, Manchen Wang, Abhay Mittal, Zhenlin Xu, Paolo Favaro, Joseph Tighe, Davide Modolo
PDF
ScaleFL: Resource-Adaptive Federated Learning with Heterogeneous Clients Fatih Ilhan, Gong Su, Ling Liu
PDF
ScaleKD: Distilling Scale-Aware Knowledge in Small Object Detector Yichen Zhu, Qiqi Zhou, Ning Liu, Zhiyuan Xu, Zhicai Ou, Xiaofeng Mou, Jian Tang
PDF
Scaling Language-Image Pre-Training via Masking Yanghao Li, Haoqi Fan, Ronghang Hu, Christoph Feichtenhofer, Kaiming He
PDF
Scaling up GANs for Text-to-Image Synthesis Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park
PDF
ScanDMM: A Deep Markov Model of Scanpath Prediction for 360deg Images Xiangjie Sui, Yuming Fang, Hanwei Zhu, Shiqi Wang, Zhou Wang
PDF
ScarceNet: Animal Pose Estimation with Scarce Annotations Chen Li, Gim Hee Lee
PDF
SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy Jiafeng Li, Ying Wen, Lianghua He
PDF
Scene-Aware Egocentric 3D Human Pose Estimation Jian Wang, Diogo Luvizon, Weipeng Xu, Lingjie Liu, Kripasindhu Sarkar, Christian Theobalt
PDF
SceneComposer: Any-Level Semantic Image Synthesis Yu Zeng, Zhe Lin, Jianming Zhang, Qing Liu, John Collomosse, Jason Kuen, Vishal M. Patel
PDF
SceneTrilogy: On Human Scene-Sketch and Its Complementarity with Photo and Text Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song
PDF
SCoDA: Domain Adaptive Shape Completion for Real Scans Yushuang Wu, Zizheng Yan, Ce Chen, Lai Wei, Xiao Li, Guanbin Li, Yihao Li, Shuguang Cui, Xiaoguang Han
PDF
SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow Itai Lang, Dror Aiger, Forrester Cole, Shai Avidan, Michael Rubinstein
PDF
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich
PDF
SCOTCH and SODA: A Transformer Video Shadow Detection Framework Lihao Liu, Jean Prost, Lei Zhu, Nicolas Papadakis, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero
PDF
SCPNet: Semantic Scene Completion on Point Cloud Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao
PDF
SDC-UDA: Volumetric Unsupervised Domain Adaptation Framework for Slice-Direction Continuous Cross-Modality Medical Image Segmentation Hyungseob Shin, Hyeongyu Kim, Sewon Kim, Yohan Jun, Taejoon Eo, Dosik Hwang
PDF
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation Yen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander G. Schwing, Liang-Yan Gui
PDF
SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence Jiacheng Deng, Chuxin Wang, Jiahao Lu, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Zhe Zhang
PDF
Search-mAP-Search: A Frame Selection Paradigm for Action Recognition Mingjun Zhao, Yakun Yu, Xiaoli Wang, Lei Yang, Di Niu
PDF
Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts Francesco Croce, Sylvestre-Alvise Rebuffi, Evan Shelhamer, Sven Gowal
PDF
SeaThru-NeRF: Neural Radiance Fields in Scattering Media Deborah Levy, Amit Peleg, Naama Pearl, Dan Rosenbaum, Derya Akkaynak, Simon Korman, Tali Treibitz
PDF
SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations Pu Li, Jianwei Guo, Xiaopeng Zhang, Dong-Ming Yan
PDF
Seeing a Rose in Five Thousand Ways Yunzhi Zhang, Shangzhe Wu, Noah Snavely, Jiajun Wu
PDF
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding Zijiao Chen, Jiaxin Qing, Tiange Xiang, Wan Lin Yue, Juan Helen Zhou
PDF
Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong, Sundaram Muthu, Fahira Afzal Maken, Chuong Nguyen, Hongdong Li
PDF
Seeing What You Miss: Vision-Language Pre-Training with Semantic Completion Learning Yatai Ji, Rongcheng Tu, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu
PDF
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li
PDF
Seeing with Sound: Long-Range Acoustic Beamforming for Multimodal Scene Understanding Praneeth Chakravarthula, Jim Aldon D’Souza, Ethan Tseng, Joe Bartusek, Felix Heide
PDF
SegLoc: Learning Segmentation-Based Representations for Privacy-Preserving Visual Localization Maxime Pietrantoni, Martin Humenberger, Torsten Sattler, Gabriela Csurka
PDF
Selective Structured State-Spaces for Long-Form Video Understanding Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid
PDF
Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation Zhehan Kan, Shuoshuo Chen, Ce Zhang, Yushun Tang, Zhihai He
PDF
Self-Guided Diffusion Models Vincent Tao Hu, David W. Zhang, Yuki M. Asano, Gertjan J. Burghouts, Cees G. M. Snoek
PDF
Self-Positioning Point-Based Transformer for Point Cloud Understanding Jinyoung Park, Sanghyeok Lee, Sihyeon Kim, Yunyang Xiong, Hyunwoo J. Kim
PDF
Self-Supervised 3D Scene Flow Estimation Guided by Superpoints Yaqi Shen, Le Hui, Jin Xie, Jian Yang
PDF
Self-Supervised AutoFlow Hsin-Ping Huang, Charles Herrmann, Junhwa Hur, Erika Lu, Kyle Sargent, Austin Stone, Ming-Hsuan Yang, Deqing Sun
PDF
Self-Supervised Blind Motion Deblurring with Deep Expectation Maximization Ji Li, Weixi Wang, Yuesong Nan, Hui Ji
PDF
Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion Yushi Lan, Xuyi Meng, Shuai Yang, Chen Change Loy, Bo Dai
PDF
Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss Anas Mahmoud, Jordan S. K. Hu, Tianshu Kuai, Ali Harakeh, Liam Paull, Steven L. Waslander
PDF
Self-Supervised Implicit Glyph Attention for Text Recognition Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Wei Shen
PDF
Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching Dongliang Cao, Florian Bernard
PDF
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann LeCun, Nicolas Ballas
PDF
Self-Supervised Non-Uniform Kernel Estimation with Flow-Based Motion Prior for Blind Image Deblurring Zhenxuan Fang, Fangfang Wu, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi
PDF
Self-Supervised Pre-Training with Masked Shape Prediction for 3D Scene Understanding Li Jiang, Zetong Yang, Shaoshuai Shi, Vladislav Golyanik, Dengxin Dai, Bernt Schiele
PDF
Self-Supervised Representation Learning for CAD Benjamin T. Jones, Michael Hu, Milin Kodnongbua, Vladimir G. Kim, Adriana Schulz
PDF
Self-Supervised Super-Plane for Neural 3D Reconstruction Botao Ye, Sifei Liu, Xueting Li, Ming-Hsuan Yang
PDF
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection Chao Feng, Ziyang Chen, Andrew Owens
PDF
SelfME: Self-Supervised Motion Learning for Micro-Expression Recognition Xinqi Fan, Xueli Chen, Mingjie Jiang, Ali Raza Shahid, Hong Yan
PDF
Semantic Human Parsing via Scalable Semantic Transfer over Multiple Label Domains Jie Yang, Chaoqun Wang, Zhen Li, Junle Wang, Ruimao Zhang
PDF
Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention Fangfu Liu, Chubin Zhang, Yu Zheng, Yueqi Duan
PDF
Semantic Scene Completion with Cleaner Self Fengyun Wang, Dong Zhang, Hanwang Zhang, Jinhui Tang, Qianru Sun
PDF
Semantic-Conditional Diffusion Networks for Image Captioning Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei
PDF
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation Shuting He, Henghui Ding, Wei Jiang
PDF
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li
PDF
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module Linzhi Huang, Yulong Li, Hongbo Tian, Yue Yang, Xiangang Li, Weihong Deng, Jieping Ye
PDF
Semi-Supervised Domain Adaptation with Source Label Adaptation Yu-Chu Yu, Hsuan-Tien Lin
PDF
Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination Zimeng Zhao, Binghui Zuo, Zhiyu Long, Yangang Wang
PDF
Semi-Supervised Learning Made Simple with Self-Supervised Clustering Enrico Fini, Pietro Astolfi, Karteek Alahari, Xavier Alameda-Pineda, Julien Mairal, Moin Nabi, Elisa Ricci
PDF
Semi-Supervised Parametric Real-World Image Harmonization Ke Wang, Michaël Gharbi, He Zhang, Zhihao Xia, Eli Shechtman
PDF
Semi-Supervised Stereo-Based 3D Object Detection via Cross-View Consensus Wenhao Wu, Hau San Wong, Si Wu
PDF
Semi-Supervised Video Inpainting with Cycle Consistency Constraints Zhiliang Wu, Hanyu Xuan, Changchang Sun, Weili Guan, Kang Zhang, Yan Yan
PDF
Semi-Weakly Supervised Object Kinematic Motion Prediction Gengxin Liu, Qian Sun, Haibin Huang, Chongyang Ma, Yulan Guo, Li Yi, Hui Huang, Ruizhen Hu
PDF
SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation Huimin Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Hong Wang, Yawen Huang, Yefeng Zheng
PDF
Semidefinite Relaxations for Robust Multiview Triangulation Linus Härenstam-Nielsen, Niclas Zeller, Daniel Cremers
PDF
SeqTrack: Sequence to Sequence Learning for Visual Object Tracking Xin Chen, Houwen Peng, Dong Wang, Huchuan Lu, Han Hu
PDF
Sequential Training of GANs Against GAN-Classifiers Reveals Correlated "Knowledge Gaps" Present Among Independently Trained GAN Instances Arkanath Pathak, Nicholas Dufour
PDF
SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction Yukang Cao, Kai Han, Kwan-Yee K. Wong
PDF
SFD2: Semantic-Guided Feature Detection and Description Fei Xue, Ignas Budvytis, Roberto Cipolla
PDF
SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks Sergio Izquierdo, Javier Civera
PDF
SGLoc: Scene Geometry Encoding for Outdoor LiDAR Localization Wen Li, Shangshu Yu, Cheng Wang, Guosheng Hu, Siqi Shen, Chenglu Wen
PDF
ShadowDiffusion: When Degradation Prior Meets Diffusion Model for Shadow Removal Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, Yufei Wang, Hanspeter Pfister, Bihan Wen
PDF
ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision Jingwang Ling, Zhibo Wang, Feng Xu
PDF
Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized Photography Ilya Chugunov, Yuxuan Zhang, Felix Heide
PDF
Shape-Aware Text-Driven Layered Video Editing Yao-Chih Lee, Ji-Ze Genevieve Jang, Yi-Ting Chen, Elizabeth Qiu, Jia-Bin Huang
PDF
Shape-Constraint Recurrent Flow for 6d Object Pose Estimation Yang Hai, Rui Song, Jiaojiao Li, Yinlin Hu
PDF
Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification Jiawei Feng, Ancong Wu, Wei-Shi Zheng
PDF
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion Dario Pavllo, David Joseph Tan, Marie-Julie Rakotosaona, Federico Tombari
PDF
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-Based Consistency Zixuan Huang, Varun Jampani, Anh Thai, Yuanzhen Li, Stefan Stojanov, James M. Rehg
PDF
ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations Panos Achlioptas, Ian Huang, Minhyuk Sung, Sergey Tulyakov, Leonidas Guibas
PDF
Sharpness-Aware Gradient Matching for Domain Generalization Pengfei Wang, Zhaoxiang Zhang, Zhen Lei, Lei Zhang
PDF
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning Jinwoo Kim, Janghyuk Choi, Ho-Jin Choi, Seon Joo Kim
PDF
Shifted Diffusion for Text-to-Image Generation Yufan Zhou, Bingchen Liu, Yizhe Zhu, Xiao Yang, Changyou Chen, Jinhui Xu
PDF
Shortcomings of Top-Down Randomization-Based Sanity Checks for Evaluations of Deep Neural Network Explanations Alexander Binder, Leander Weber, Sebastian Lapuschkin, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek
PDF
SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds Qing Li, Huifang Feng, Kanle Shi, Yue Gao, Yi Fang, Yu-Shen Liu, Zhizhong Han
PDF
Siamese DETR Zeren Chen, Gengshi Huang, Wei Li, Jianing Teng, Kun Wang, Jing Shao, Chen Change Loy, Lu Sheng
PDF
Siamese Image Modeling for Self-Supervised Vision Representation Learning Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai
PDF
Sibling-Attack: Rethinking Transferable Adversarial Attacks Against Face Recognition Zexin Li, Bangjie Yin, Taiping Yao, Junfeng Guo, Shouhong Ding, Simin Chen, Cong Liu
PDF
Side Adapter Network for Open-Vocabulary Semantic Segmentation Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai
PDF
SIEDOB: Semantic Image Editing by Disentangling Object and Background Wuyang Luo, Su Yang, Xinjian Zhang, Weishan Zhang
PDF
SIM: Semantic-Aware Instance Mask Generation for Box-Supervised Instance Segmentation Ruihuang Li, Chenhang He, Yabin Zhang, Shuai Li, Liyi Chen, Lei Zhang
PDF
Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding Tal Shaharabany, Lior Wolf
PDF
Similarity Metric Learning for RGB-Infrared Group Re-Identification Jianghao Xiong, Jianhuang Lai
PDF
Simple Cues Lead to a Strong Multi-Object Tracker Jenny Seidenschwarz, Guillem Brasó, Víctor Castro Serrano, Ismail Elezi, Laura Leal-Taixé
PDF
SimpleNet: A Simple Network for Image Anomaly Detection and Localization Zhikang Liu, Yiming Zhou, Yuansheng Xu, Zilei Wang
PDF
SimpSON: Simplifying Photo Cleanup with Single-Click Distracting Object Segmentation Network Chuong Huynh, Yuqian Zhou, Zhe Lin, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Abhinav Shrivastava
PDF
Simulated Annealing in Early Layers Leads to Better Generalization Amir M. Sarfi, Zahra Karimpour, Muawiz Chaudhary, Nasir M. Khalid, Mirco Ravanelli, Sudhir Mudur, Eugene Belilovsky
PDF
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation Jiangwei Lao, Weixiang Hong, Xin Guo, Yingying Zhang, Jian Wang, Jingdong Chen, Wei Chu
PDF
SINE: Semantic-Driven Image-Based NeRF Editing with Prior-Guided Editing Field Chong Bao, Yinda Zhang, Bangbang Yang, Tianxing Fan, Zesong Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui
PDF
SINE: SINgle Image Editing with Text-to-Image Diffusion Models Zhixing Zhang, Ligong Han, Arnab Ghosh, Dimitris N. Metaxas, Jian Ren
PDF
Single Domain Generalization for LiDAR Semantic Segmentation Hyeonseong Kim, Yoonsu Kang, Changgyoon Oh, Kuk-Jin Yoon
PDF
Single Image Backdoor Inversion via Robust Smoothed Classifiers Mingjie Sun, Zico Kolter
PDF
Single Image Depth Prediction Made Better: A Multivariate Gaussian Take Ce Liu, Suryansh Kumar, Shuhang Gu, Radu Timofte, Luc Van Gool
PDF
Single View Scene Scale Estimation Using Scale Field Byeong-Uk Lee, Jianming Zhang, Yannick Hold-Geoffroy, In So Kweon
PDF
SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene Minjung Son, Jeong Joon Park, Leonidas Guibas, Gordon Wetzstein
PDF
Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings Ayan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
PDF
SketchXAI: A First Look at Explainability for Human Sketches Zhiyu Qu, Yulia Gryaditskaya, Ke Li, Kaiyue Pang, Tao Xiang, Yi-Zhe Song
PDF
Skinned Motion Retargeting with Residual Perception of Motion Semantics & Geometry Jiaxu Zhang, Junwu Weng, Di Kang, Fang Zhao, Shaoli Huang, Xuefei Zhe, Linchao Bao, Ying Shan, Jue Wang, Zhigang Tu
PDF
SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images Nikhil Gosala, Kürsat Petek, Paulo L. J. Drews-Jr, Wolfram Burgard, Abhinav Valada
PDF
SLACK: Stable Learning of Augmentations with Cold-Start and KL Regularization Juliette Marrie, Michael Arbel, Diane Larlus, Julien Mairal
PDF
Sliced Optimal Partial Transport Yikun Bai, Bernhard Schmitzer, Matthew Thorpe, Soheil Kolouri
PDF
SliceMatch: Geometry-Guided Aggregation for Cross-View Pose Estimation Ted Lentsch, Zimin Xia, Holger Caesar, Julian F. P. Kooij
PDF
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention Xuran Pan, Tianzhu Ye, Zhuofan Xia, Shiji Song, Gao Huang
PDF
Slimmable Dataset Condensation Songhua Liu, Jingwen Ye, Runpeng Yu, Xinchao Wang
PDF
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments Yudi Dai, Yitai Lin, Xiping Lin, Chenglu Wen, Lan Xu, Hongwei Yi, Siqi Shen, Yuexin Ma, Cheng Wang
PDF
SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples Han Liu, Yuhao Wu, Zhiyuan Yu, Yevgeniy Vorobeychik, Ning Zhang
PDF
SMAE: Few-Shot Learning for HDR Deghosting with Saturation-Aware Masked Autoencoders Qingsen Yan, Song Zhang, Weiye Chen, Hao Tang, Yu Zhu, Jinqiu Sun, Luc Van Gool, Yanning Zhang
PDF
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva
PDF
SmartAssign: Learning a Smart Knowledge Assignment Strategy for Deraining and Desnowing Yinglong Wang, Chao Ma, Jianzhuang Liu
PDF
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model Shaoan Xie, Zhifei Zhang, Zhe Lin, Tobias Hinz, Kun Zhang
PDF
SMOC-Net: Leveraging Camera Pose for Self-Supervised Monocular Object Pose Estimation Tao Tan, Qiulei Dong
PDF
SMPConv: Self-Moving Point Representations for Continuous Convolution Sanghyeon Kim, Eunbyung Park
PDF
Soft Augmentation for Image Classification Yang Liu, Shen Yan, Laura Leal-Taixé, James Hays, Deva Ramanan
PDF
Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks Hyolim Kang, Hanjung Kim, Joungbin An, Minsu Cho, Seon Joo Kim
PDF
Solving 3D Inverse Problems Using Pre-Trained 2D Diffusion Models Hyungjin Chung, Dohoon Ryu, Michael T. McCann, Marc L. Klasky, Jong Chul Ye
PDF
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective Yuexiao Ma, Huixia Li, Xiawu Zheng, Xuefeng Xiao, Rui Wang, Shilei Wen, Xin Pan, Fei Chao, Rongrong Ji
PDF
Solving Relaxations of MAP-MRF Problems: Combinatorial In-Face Frank-Wolfe Directions Vladimir Kolmogorov
PDF
SOOD: Towards Semi-Supervised Oriented Object Detection Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, Xiang Bai
PDF
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment Kim Sung-Bin, Arda Senocak, Hyunwoo Ha, Andrew Owens, Tae-Hyun Oh
PDF
Source-Free Adaptive Gaze Estimation by Uncertainty Reduction Xin Cai, Jiabei Zeng, Shiguang Shan, Xilin Chen
PDF
Source-Free Video Domain Adaptation with Spatial-Temporal-Historical Consistency Learning Kai Li, Deep Patel, Erik Kruus, Martin Renqiang Min
PDF
SPARF: Neural Radiance Fields from Sparse and Noisy Poses Prune Truong, Marie-Julie Rakotosaona, Fabian Manhardt, Federico Tombari
PDF
Sparse Multi-Modal Graph Transformer with Shared-Context Processing for Representation Learning of Giga-Pixel Images Ramin Nakhli, Puria Azadi Moghadam, Haoyang Mi, Hossein Farahani, Alexander Baras, Blake Gilks, Ali Bashashati
PDF
SparseFusion: Distilling View-Conditioned Diffusion for 3D Reconstruction Zhizhuo Zhou, Shubham Tulsiani
PDF
Sparsely Annotated Semantic Segmentation with Adaptive Gaussian Mixtures Linshan Wu, Zhun Zhong, Leyuan Fang, Xingxin He, Qiang Liu, Jiayi Ma, Hao Chen
PDF
SparsePose: Sparse-View Camera Pose Regression and Refinement Samarth Sinha, Jason Y. Zhang, Andrea Tagliasacchi, Igor Gilitschenski, David B. Lindell
PDF
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer Xuanyao Chen, Zhijian Liu, Haotian Tang, Li Yi, Hang Zhao, Song Han
PDF
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers Cong Wei, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor, Florian Shkurti
PDF
SpaText: Spatio-Textual Representation for Controllable Image Generation Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin
PDF
Spatial-Frequency Mutual Learning for Face Super-Resolution Chenyang Wang, Junjun Jiang, Zhiwei Zhong, Xianming Liu
PDF
Spatial-Temporal Concept Based Explanation of 3D ConvNets Ying Ji, Yu Wang, Jien Kato
PDF
Spatial-Then-Temporal Self-Supervised Learning for Video Correspondence Rui Li, Dong Liu
PDF
Spatially Adaptive Self-Supervised Learning for Real-World Image Denoising Junyi Li, Zhilu Zhang, Xiaoyu Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Wangmeng Zuo
PDF
Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image Donggun Kim, Hyeonjoong Jang, Inchul Kim, Min H. Kim
PDF
Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation Shao-Yuan Lo, Poojan Oza, Sumanth Chennupati, Alejandro Galindo, Vishal M. Patel
PDF
Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild Yanhao Wu, Tong Zhang, Wei Ke, Sabine Süsstrunk, Mathieu Salzmann
PDF
Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any Unseen Style Haoming Lu, Hazarapet Tunanyan, Kai Wang, Shant Navasardyan, Zhangyang Wang, Humphrey Shi
PDF
Spectral Bayesian Uncertainty for Image Super-Resolution Tao Liu, Jun Cheng, Shan Tan
PDF
Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising Miaoyu Li, Ji Liu, Ying Fu, Yulun Zhang, Dejing Dou
PDF
Sphere-Guided Training of Neural Implicit Surfaces Andreea Dogaru, Andrei-Timotei Ardelean, Savva Ignatyev, Egor Zakharov, Evgeny Burnaev
PDF
Spherical Transformer for LiDAR-Based 3D Recognition Xin Lai, Yukang Chen, Fanbin Lu, Jianhui Liu, Jiaya Jia
PDF
Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training Siddarth Asokan, Chandra Sekhar Seelamantula
PDF
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields Ashkan Mirzaei, Tristan Aumentado-Armstrong, Konstantinos G. Derpanis, Jonathan Kelly, Marcus A. Brubaker, Igor Gilitschenski, Alex Levinshtein
PDF
SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries Ahmed Imtiaz Humayun, Randall Balestriero, Guha Balakrishnan, Richard G. Baraniuk
PDF
Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo Lukas Mehl, Jenny Schmalfuss, Azin Jahedi, Yaroslava Nalivayko, Andrés Bruhn
PDF
SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan L. Yuille, Chaoyi Zhang, Weidong Cai, Zongwei Zhou
PDF
sRGB Real Noise Synthesizing with Neighboring Correlation-Aware Noise Model Zixuan Fu, Lanqing Guo, Bihan Wen
PDF
Standing Between past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking Ziqi Pang, Jie Li, Pavel Tokmakov, Dian Chen, Sergey Zagoruyko, Yu-Xiong Wang
PDF
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection Zhenglin Zhou, Huaxia Li, Hong Liu, Nanyang Wang, Gang Yu, Rongrong Ji
PDF
StarCraftImage: A Dataset for Prototyping Spatial Reasoning Methods for Multi-Agent Environments Sean Kulinski, Nicholas R. Waytowich, James Z. Hare, David I. Inouye
PDF
Stare at What You See: Masked Image Modeling Without Reconstruction Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo
PDF
Starting from Non-Parametric Networks for 3D Point Cloud Analysis Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi
PDF
STDLens: Model Hijacking-Resilient Federated Learning for Object Detection Ka-Ho Chow, Ling Liu, Wenqi Wei, Fatih Ilhan, Yanzhao Wu
PDF
SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory Sicheng Li, Hao Li, Yue Wang, Yiyi Liao, Lu Yu
PDF
StepFormer: Self-Supervised Step Discovery and Localization in Instructional Videos Nikita Dvornik, Isma Hadji, Ran Zhang, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson
PDF
Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction Jianhua Sun, Yuxuan Li, Liang Chai, Cewu Lu
PDF
Stitchable Neural Networks Zizheng Pan, Jianfei Cai, Bohan Zhuang
PDF
STMixer: A One-Stage Sparse Action Detector Tao Wu, Mengqi Cao, Ziteng Gao, Gangshan Wu, Limin Wang
PDF
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition Xiaoyu Zhu, Po-Yao Huang, Junwei Liang, Celso M. de Melo, Alexander G. Hauptmann
PDF
Streaming Video Model Yucheng Zhao, Chong Luo, Chuanxin Tang, Dongdong Chen, Noel Codella, Zheng-Jun Zha
PDF
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction Mingfang Zhang, Jinglu Wang, Xiao Li, Yifei Huang, Yoichi Sato, Yan Lu
PDF
Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising Zehua Sheng, Zhu Yu, Xiongwei Liu, Si-Yuan Cao, Yuqi Liu, Hui-Liang Shen, Huaqi Zhang
PDF
Structured 3D Features for Reconstructing Controllable Avatars Enric Corona, Mihai Zanfir, Thiemo Alldieck, Eduard Gabriel Bazavan, Andrei Zanfir, Cristian Sminchisescu
PDF
Structured Kernel Estimation for Photon-Limited Deconvolution Yash Sanghvi, Zhiyuan Mao, Stanley H. Chan
PDF
Structured Sparsity Learning for Efficient Video Super-Resolution Bin Xia, Jingwen He, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Luc Van Gool
PDF
StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen, Sanping Zhou, Jingwen Fu, Ruotong Wang, Shitao Chen, Nanning Zheng
PDF
Style Projected Clustering for Domain Generalized Semantic Segmentation Wei Huang, Chang Chen, Yong Li, Jiacheng Li, Cheng Li, Fenglong Song, Youliang Yan, Zhiwei Xiong
PDF
StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning Yuqian Fu, Yu Xie, Yanwei Fu, Yu-Gang Jiang
PDF
StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer Sasikarn Khwanmuang, Pakkapon Phongthawee, Patsorn Sangkloy, Supasorn Suwajanakorn
PDF
StyleGene: Crossover and Mutation of Region-Level Facial Genes for Kinship Face Synthesis Hao Li, Xianxu Hou, Zepeng Huang, Linlin Shen
PDF
StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping Diqiong Jiang, Dan Song, Ruofeng Tong, Min Tang
PDF
StyleRes: Transforming the Residuals for Real Image Editing with StyleGAN Hamza Pehlivan, Yusuf Dalva, Aysegul Dundar
PDF
StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields Kunhao Liu, Fangneng Zhan, Yiwen Chen, Jiahui Zhang, Yingchen Yu, Abdulmotaleb El Saddik, Shijian Lu, Eric P. Xing
PDF
StyLess: Boosting the Transferability of Adversarial Examples Kaisheng Liang, Bin Xiao
PDF
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang
PDF
SUDS: Scalable Urban Dynamic Scenes Haithem Turki, Jason Y. Zhang, Francesco Ferroni, Deva Ramanan
PDF
SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage Yifan Wang, Aleksander Holynski, Xiuming Zhang, Xuaner Zhang
PDF
Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning Zhuowan Li, Xingrui Wang, Elias Stengel-Eskin, Adam Kortylewski, Wufei Ma, Benjamin Van Durme, Alan L. Yuille
PDF
Super-Resolution Neural Operator Min Wei, Xuesong Zhang
PDF
Superclass Learning with Representation Enhancement Zeyu Gan, Suyun Zhao, Jinlong Kang, Liyuan Shang, Hong Chen, Cuiping Li
PDF
SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail Yingjun Du, Jiayi Shen, Xiantong Zhen, Cees G. M. Snoek
PDF
Supervised Masked Knowledge Distillation for Few-Shot Transformers Han Lin, Guangxing Han, Jiawei Ma, Shiyuan Huang, Xudong Lin, Shih-Fu Chang
PDF
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes Yiming Gao, Yan-Pei Cao, Ying Shan
PDF
SVFormer: Semi-Supervised Video Transformer for Action Recognition Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
PDF
SVGformer: Representation Learning for Continuous Vector Graphics Using Transformers Defu Cao, Zhaowen Wang, Jose Echevarria, Yan Liu
PDF
SViTT: Temporal Learning of Sparse Video-Text Transformers Yi Li, Kyle Min, Subarna Tripathi, Nuno Vasconcelos
PDF
Swept-Angle Synthetic Wavelength Interferometry Alankar Kotwal, Anat Levin, Ioannis Gkioulekas
PDF
Switchable Representation Learning Framework with Self-Compatibility Shengsen Wu, Yan Bai, Yihang Lou, Xiongkun Linghu, Jianzhong He, Ling-Yu Duan
PDF
Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion Changfeng Ma, Yinuo Chen, Pengxiao Guo, Jie Guo, Chongjun Wang, Yanwen Guo
PDF
Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev, Hyun Jae Kang
PDF
SynthVSR: Scaling up Visual Speech Recognition with Synthetic Supervision Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jachym Kolar, Stavros Petridis, Maja Pantic, Christian Fuegen
PDF
System-Status-Aware Adaptive Network for Online Streaming Video Understanding Lin Geng Foo, Jia Gong, Zhipeng Fan, Jun Liu
PDF
T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection Hao Huang, Ziyan Chen, Huanran Chen, Yongtao Wang, Kevin Zhang
PDF
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation Lingting Zhu, Xian Liu, Xuanyu Liu, Rui Qian, Ziwei Liu, Lequan Yu
PDF
Tangentially Elongated Gaussian Belief Propagation for Event-Based Incremental Optical Flow Estimation Jun Nagata, Yusuke Sekikawa
PDF
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap
PDF
Target-Referenced Reactive Grasping for Dynamic Objects Jirong Liu, Ruo Zhang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Chenxi Wang, Sheng Xu, Hengxu Yan, Cewu Lu
PDF
TarViS: A Unified Approach for Target-Based Video Segmentation Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe
PDF
Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning Wenjin Wang, Yunqing Hu, Qianglong Chen, Yin Zhang
PDF
Task Residual for Tuning Vision-Language Models Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang
PDF
Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification Honglin Li, Chenglu Zhu, Yunlong Zhang, Yuxuan Sun, Zhongyi Shui, Wenwei Kuang, Sunyi Zheng, Lin Yang
PDF
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving Shaoheng Fang, Zi Wang, Yiqi Zhong, Junhao Ge, Siheng Chen
PDF
Teacher-Generated Spatial-Attention Labels Boost Robustness and Accuracy of Contrastive Models Yushi Yao, Chang Ye, Junfeng He, Gamaleldin F. Elsayed
PDF
Teaching Matters: Investigating the Role of Supervision in Vision Transformers Matthew Walmer, Saksham Suri, Kamal Gupta, Abhinav Shrivastava
PDF
Teaching Structured Vision & Language Concepts to Vision & Language Models Sivan Doveh, Assaf Arbelle, Sivan Harary, Eli Schwartz, Roei Herzig, Raja Giryes, Rogerio Feris, Rameswar Panda, Shimon Ullman, Leonid Karlinsky
PDF
Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction Ryo Kawahara, Meng-Yu Jennifer Kuo, Shohei Nobuhara
PDF
Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation Tsu-Jui Fu, Licheng Yu, Ning Zhang, Cheng-Yang Fu, Jong-Chyi Su, William Yang Wang, Sean Bell
PDF
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning Cheng Tan, Zhangyang Gao, Lirong Wu, Yongjie Xu, Jun Xia, Siyuan Li, Stan Z. Li
PDF
Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving Lucas Nunes, Louis Wiesmann, Rodrigo Marcuzzi, Xieyuanli Chen, Jens Behley, Cyrill Stachniss
PDF
Temporal Interpolation Is All You Need for Dynamic Neural Radiance Fields Sungheon Park, Minjung Son, Seokhwan Jang, Young Chun Ahn, Ji-Yeon Kim, Nahyup Kang
PDF
Temporally Consistent Online Depth Estimation Using Point-Based Fusion Numair Khan, Eric Penner, Douglas Lanman, Lei Xiao
PDF
TempSAL - Uncovering Temporal Information for Deep Saliency Prediction Bahar Aydemir, Ludo Hoffstetter, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk
PDF
TensoIR: Tensorial Inverse Rendering Haian Jin, Isabella Liu, Peijia Xu, Xiaoshuai Zhang, Songfang Han, Sai Bi, Xiaowei Zhou, Zexiang Xu, Hao Su
PDF
Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu
PDF
TeSLA: Test-Time Self-Learning with Automatic Adversarial Augmentation Devavrat Tomar, Guillaume Vray, Behzad Bozorgtabar, Jean-Philippe Thiran
PDF
Test of Time: Instilling Video-Language Models with a Sense of Time Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
PDF
Test Time Adaptation with Regularized Loss for Weakly Supervised Salient Object Detection Olga Veksler
PDF
TexPose: Neural Texture Learning for Self-Supervised 6d Object Pose Estimation Hanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam
PDF
Text with Knowledge Graph Augmented Transformer for Video Captioning Xin Gu, Guang Chen, Yufei Wang, Libo Zhang, Tiejian Luo, Longyin Wen
PDF
Text-Guided Unsupervised Latent Transformation for Multi-Attribute Image Manipulation Xiwen Wei, Zhen Xu, Cheng Liu, Si Wu, Zhiwen Yu, Hau San Wong
PDF
Text-Visual Prompting for Efficient 2D Temporal Video Grounding Yimeng Zhang, Xin Chen, Jinghan Jia, Sijia Liu, Ke Ding
PDF
Text2Scene: Text-Driven Indoor Scene Stylization with Part-Aware Details Inwoo Hwang, Hyeonwoo Kim, Young Min Kim
PDF
Texts as Images in Prompt Tuning for Multi-Label Image Recognition Zixian Guo, Bowen Dong, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo
PDF
Texture-Guided Saliency Distilling for Unsupervised Salient Object Detection Huajun Zhou, Bo Qiao, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie
PDF
The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks Iuri Frosio, Jan Kautz
PDF
The Dark Side of Dynamic Routing Neural Networks: Towards Efficiency Backdoor Injection Simin Chen, Hanlin Chen, Mirazul Haque, Cong Liu, Wei Yang
PDF
The Devil Is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided Mask Representation Beomyoung Kim, Joonhyun Jeong, Dongyoon Han, Sung Ju Hwang
PDF
The Dialog Must Go on: Improving Visual Dialog via Generative Self-Training Gi-Cheon Kang, Sungdong Kim, Jin-Hwa Kim, Donghyun Kwak, Byoung-Tak Zhang
PDF
The Differentiable Lens: Compound Lens Search over Glass Surfaces and Materials for Object Detection Geoffroi Côté, Fahim Mannan, Simon Thibault, Jean-François Lalonde, Felix Heide
PDF
The Enemy of My Enemy Is My Friend: Exploring Inverse Adversaries for Improving Adversarial Training Junhao Dong, Seyed-Mohsen Moosavi-Dezfooli, Jianhuang Lai, Xiaohua Xie
PDF
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects Ruohan Gao, Yiming Dou, Hao Li, Tanmay Agarwal, Jeannette Bohg, Yunzhu Li, Li Fei-Fei, Jiajun Wu
PDF
The Resource Problem of Using Linear Layer Leakage Attack in Federated Learning Joshua C. Zhao, Ahmed Roushdy Elkordy, Atul Sharma, Yahya H. Ezzeldin, Salman Avestimehr, Saurabh Bagchi
PDF
The Treasure Beneath Multiple Annotations: An Uncertainty-Aware Edge Detector Caixia Zhou, Yaping Huang, Mengyang Pu, Qingji Guan, Li Huang, Haibin Ling
PDF
The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction Alexandros Stergiou, Dima Damen
PDF
Therbligs in Action: Video Understanding Through Motion Primitives Eadom Dessalene, Michael Maynord, Cornelia Fermüller, Yiannis Aloimonos
PDF
Thermal Spread Functions (TSF): Physics-Guided Material Classification Aniket Dashpute, Vishwanath Saragadam, Emma Alexander, Florian Willomitzer, Aggelos Katsaggelos, Ashok Veeraraghavan, Oliver Cossairt
PDF
Think Twice Before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving Xiaosong Jia, Penghao Wu, Li Chen, Jiangwei Xie, Conghui He, Junchi Yan, Hongyang Li
PDF
Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning Yun-Hao Cao, Peiqin Sun, Shuchang Zhou
PDF
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Chen Chen, Mubarak Shah
PDF
TINC: Tree-Structured Implicit Neural Compression Runzhao Yang
PDF
TinyMIM: An Empirical Study of Distilling MIM Pre-Trained Models Sucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu
PDF
TIPI: Test Time Adaptation with Transformation Invariance A. Tuan Nguyen, Thanh Nguyen-Tang, Ser-Nam Lim, Philip H.S. Torr
PDF
TMO: Textured Mesh Acquisition of Objects with a Mobile Device by Using Differentiable Rendering Jaehoon Choi, Dongki Jung, Taejae Lee, Sangwook Kim, Youngdong Jung, Dinesh Manocha, Donghwan Lee
PDF
Token Boosting for Robust Self-Supervised Visual Transformer Pre-Training Tianjiao Li, Lin Geng Foo, Ping Hu, Xindi Shang, Hossein Rahmani, Zehuan Yuan, Jun Liu
PDF
Token Contrast for Weakly-Supervised Semantic Segmentation Lixiang Ru, Heliang Zheng, Yibing Zhan, Bo Du
PDF
Token Turing Machines Michael S. Ryoo, Keerthana Gopalakrishnan, Kumara Kahatapitiya, Ted Xiao, Kanishka Rao, Austin Stone, Yao Lu, Julian Ibarz, Anurag Arnab
PDF
TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers Cheng Zhang, Hai Liu, Yongjian Deng, Bochen Xie, Youfu Li
PDF
Top-Down Visual Attention from Analysis by Synthesis Baifeng Shi, Trevor Darrell, Xin Wang
PDF
TopDiG: Class-Agnostic Topological Directional Graph Extraction from Remote Sensing Images Bingnan Yang, Mi Zhang, Zhan Zhang, Zhili Zhang, Xiangyun Hu
PDF
TOPLight: Lightweight Neural Networks with Task-Oriented Pretraining for Visible-Infrared Recognition Hao Yu, Xu Cheng, Wei Peng
PDF
TopNet: Transformer-Based Object Placement Network for Image Compositing Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen
PDF
Topology-Guided Multi-Class Cell Context Generation for Digital Pathology Shahira Abousamra, Rajarsi Gupta, Tahsin Kurc, Dimitris Samaras, Joel Saltz, Chao Chen
PDF
ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling Xinglin Li, Jiajing Chen, Jinhui Ouyang, Hanhui Deng, Senem Velipasalar, Di Wu
PDF
Toward Accurate Post-Training Quantization for Image Super Resolution Zhijun Tu, Jie Hu, Hanting Chen, Yunhe Wang
PDF
Toward RAW Object Detection: A New Benchmark and a New Model Ruikang Xu, Chang Chen, Jingyang Peng, Cheng Li, Yibin Huang, Fenglong Song, Youliang Yan, Zhiwei Xiong
PDF
Toward Stable, Interpretable, and Lightweight Hyperspectral Super-Resolution Wen-jin Guo, Weiying Xie, Kai Jiang, Yunsong Li, Jie Lei, Leyuan Fang
PDF
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin’ichi Satoh
PDF
Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval Yi Xie, Huaidong Zhang, Xuemiao Xu, Jianqing Zhu, Shengfeng He
PDF
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization Mengqi Huang, Zhendong Mao, Zhuowei Chen, Yongdong Zhang
PDF
Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai
PDF
Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method Ran Yi, Haoyuan Tian, Zhihao Gu, Yu-Kun Lai, Paul L. Rosin
PDF
Towards Benchmarking and Assessing Visual Naturalness of Physical World Adversarial Attacks Simin Li, Shuning Zhang, Gujun Chen, Dong Wang, Pu Feng, Jiakai Wang, Aishan Liu, Xin Yi, Xianglong Liu
PDF
Towards Better Decision Forests: Forest Alternating Optimization Miguel Á. Carreira-Perpiñán, Magzhan Gabidolla, Arman Zharmagambetov
PDF
Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment Baorui Ma, Junsheng Zhou, Yu-Shen Liu, Zhizhong Han
PDF
Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation Dong Zhao, Shuang Wang, Qi Zang, Dou Quan, Xiutiao Ye, Licheng Jiao
PDF
Towards Bridging the Performance Gaps of Joint Energy-Based Models Xiulong Yang, Qing Su, Shihao Ji
PDF
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration Kemal Oksuz, Tom Joy, Puneet K. Dokania
PDF
Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations Lei Hsiung, Yun-Yun Tsai, Pin-Yu Chen, Tsung-Yi Ho
PDF
Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View Shuo Wang, Xinhai Zhao, Hai-Ming Xu, Zehui Chen, Dameng Yu, Jiahao Chang, Zhen Yang, Feng Zhao
PDF
Towards Effective Adversarial Textured 3D Meshes on Physical Face Recognition Xiao Yang, Chang Liu, Longlong Xu, Yikai Wang, Yinpeng Dong, Ning Chen, Hang Su, Jun Zhu
PDF
Towards Effective Visual Representations for Partial-Label Learning Shiyu Xia, Jiaqi Lv, Ning Xu, Gang Niu, Xin Geng
PDF
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors Gongjie Zhang, Zhipeng Luo, Zichen Tian, Jingyi Zhang, Xiaoqin Zhang, Shijian Lu
PDF
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers Jaehoon Yoo, Semin Kim, Doyup Lee, Chiheon Kim, Seunghoon Hong
PDF
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang
PDF
Towards Flexible Multi-Modal Document Models Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
PDF
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu
PDF
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma
PDF
Towards Modality-Agnostic Person Re-Identification with Descriptive Query Cuiqun Chen, Mang Ye, Ding Jiang
PDF
Towards Open-World Segmentation of Parts Tai-Yu Pan, Qing Liu, Wei-Lun Chao, Brian Price
PDF
Towards Practical Plug-and-Play Diffusion Models Hyojun Go, Yunsung Lee, Jin-Young Kim, Seunghyun Lee, Myeongho Jeong, Hyun Seung Lee, Seungtaek Choi
PDF
Towards Professional Level Crowd Annotation of Expert Domain Data Pei Wang, Nuno Vasconcelos
PDF
Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency Is All You Need Tong Wei, Kai Gan
PDF
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution Chenfan Qu, Chongyu Liu, Yuliang Liu, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin
PDF
Towards Scalable Neural Representation for Diverse Videos Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
PDF
Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization Li’an Zhuo, Jian Cao, Qi Wang, Bang Zhang, Liefeng Bo
PDF
Towards Transferable Targeted Adversarial Examples Zhibo Wang, Hongshan Yang, Yunhe Feng, Peng Sun, Hengchang Guo, Zhifei Zhang, Kui Ren
PDF
Towards Trustable Skin Cancer Diagnosis via Rewriting Model's Decision Siyuan Yan, Zhen Yu, Xuelin Zhang, Dwarikanath Mahapatra, Shekhar S. Chandra, Monika Janda, Peter Soyer, Zongyuan Ge
PDF
Towards Unbiased Volume Rendering of Neural Implicit Surfaces with Geometry Priors Yongqiang Zhang, Zhipeng Hu, Haoqian Wu, Minda Zhao, Lincheng Li, Zhengxia Zou, Changjie Fan
PDF
Towards Unified Scene Text Spotting Based on Sequence Generation Taeho Kil, Seonghyeon Kim, Sukmin Seo, Yoonsik Kim, Daehee Kim
PDF
Towards Universal Fake Image Detectors That Generalize Across Generative Models Utkarsh Ojha, Yuheng Li, Yong Jae Lee
PDF
Towards Unsupervised Object Detection from LiDAR Point Clouds Lunjun Zhang, Anqi Joyce Yang, Yuwen Xiong, Sergio Casas, Bin Yang, Mengye Ren, Raquel Urtasun
PDF
Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe, Zhengyi Luo, Xue Bin Peng, Ye Yuan, Kris Kitani, Karsten Kreis, Sanja Fidler, Or Litany
PDF
TRACE: 5d Temporal Regression of Avatars with Dynamic Cameras in 3D Environments Yu Sun, Qian Bao, Wu Liu, Tao Mei, Michael J. Black
PDF
Tracking Multiple Deformable Objects in Egocentric Videos Mingzhen Huang, Xiaoxing Li, Jun Hu, Honghong Peng, Siwei Lyu
PDF
Tracking Through Containers and Occluders in the Wild Basile Van Hoorick, Pavel Tokmakov, Simon Stent, Jie Li, Carl Vondrick
PDF
Trade-Off Between Robustness and Accuracy of Vision Transformers Yanxi Li, Chang Xu
PDF
Train-Once-for-All Personalization Hong-You Chen, Yandong Li, Yin Cui, Mingda Zhang, Wei-Lun Chao, Li Zhang
PDF
Train/Test-Time Adaptation with Retrieval Luca Zancato, Alessandro Achille, Tian Yu Liu, Matthew Trager, Pramuditha Perera, Stefano Soatto
PDF
Trainable Projected Gradient Method for Robust Fine-Tuning Junjiao Tian, Zecheng He, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Zsolt Kira
PDF
Training Debiased Subnetworks with Contrastive Weight Pruning Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye
PDF
Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting Xiaogang Peng, Siyuan Mao, Zizhao Wu
PDF
Transductive Few-Shot Learning with Prototype-Based Label Propagation by Iterative Graph Refinement Hao Zhu, Piotr Koniusz
PDF
Transfer Knowledge from Head to Tail: Uncertainty Calibration Under Long-Tailed Distribution Jiahao Chen, Bing Su
PDF
Transfer4D: A Framework for Frugal Motion Capture and Deformation Transfer Shubh Maheshwari, Rahul Narain, Ramya Hebbalaguppe
PDF
Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization Jianping Zhang, Yizhan Huang, Weibin Wu, Michael R. Lyu
PDF
TransFlow: Transformer as Flow Learner Yawen Lu, Qifan Wang, Siqi Ma, Tong Geng, Yingjie Victor Chen, Huaijin Chen, Dongfang Liu
PDF
Transformer Scale Gate for Semantic Segmentation Hengcan Shi, Munawar Hayat, Jianfei Cai
PDF
Transformer-Based Learned Optimization Erik Gärtner, Luke Metz, Mykhaylo Andriluka, C. Daniel Freeman, Cristian Sminchisescu
PDF
Transformer-Based Unified Recognition of Two Hands Manipulating Objects Hoseong Cho, Chanwoo Kim, Jihyeon Kim, Seongyeong Lee, Elkhan Ismayilzada, Seungryul Baek
PDF
Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization Zicheng Zhang, Yinglu Liu, Congying Han, Yingwei Pan, Tiande Guo, Ting Yao
PDF
TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompted Reconstruction for Person Re-Identification Haocong Rao, Chunyan Miao
PDF
Trap Attention: Monocular Depth Estimation with Manual Traps Chao Ning, Hongping Gan
PDF
Tree Instance Segmentation with Temporal Contour Graph Adnan Firoze, Cameron Wingren, Raymond A. Yeh, Bedrich Benes, Daniel Aliaga
PDF
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction Yuanhui Huang, Wenzhao Zheng, Yunpeng Zhang, Jie Zhou, Jiwen Lu
PDF
TriDet: Temporal Action Detection with Relative Boundary Modeling Dingfeng Shi, Yujie Zhong, Qiong Cao, Lin Ma, Jia Li, Dacheng Tao
PDF
TriVol: Point Cloud Rendering via Triple Volumes Tao Hu, Xiaogang Xu, Ruihang Chu, Jiaya Jia
PDF
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets Weixin Chen, Dawn Song, Bo Li
PDF
TrojViT: Trojan Insertion in Vision Transformers Mengxin Zheng, Qian Lou, Lei Jiang
PDF
TruFor: Leveraging All-Round Clues for Trustworthy Image Forgery Detection and Localization Fabrizio Guillaro, Davide Cozzolino, Avneesh Sud, Nicholas Dufour, Luisa Verdoliva
PDF
TryOnDiffusion: A Tale of Two UNets Luyang Zhu, Dawei Yang, Tyler Zhu, Fitsum Reda, William Chan, Chitwan Saharia, Mohammad Norouzi, Ira Kemelmacher-Shlizerman
PDF
TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation Taeyeop Lee, Jonathan Tremblay, Valts Blukis, Bowen Wen, Byeong-Uk Lee, Inkyu Shin, Stan Birchfield, In So Kweon, Kuk-Jin Yoon
PDF
Tunable Convolutions with Parametric Multi-Loss Optimization Matteo Maggioni, Thomas Tanay, Francesca Babiloni, Steven McDonagh, Aleš Leonardis
PDF
Turning a CLIP Model into a Scene Text Detector Wenwen Yu, Yuliang Liu, Wei Hua, Deqiang Jiang, Bo Ren, Xiang Bai
PDF
Turning Strengths into Weaknesses: A Certified Robustness Inspired Attack Framework Against Graph Neural Networks Binghui Wang, Meng Pang, Yun Dong
PDF
Twin Contrastive Learning with Noisy Labels Zhizhong Huang, Junping Zhang, Hongming Shan
PDF
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization Ziquan Liu, Yi Xu, Xiangyang Ji, Antoni B. Chan
PDF
Two-Shot Video Object Segmentation Kun Yan, Xiao Li, Fangyun Wei, Jinglu Wang, Chenbin Zhang, Ping Wang, Yan Lu
PDF
Two-Stage Co-Segmentation Network Based on Discriminative Representation for Recovering Human Mesh from Videos Boyang Zhang, Kehua Ma, Suping Wu, Zhixiang Yuan
PDF
Two-Stream Networks for Weakly-Supervised Temporal Action Localization with Semantic-Aware Mechanisms Yu Wang, Yadong Li, Hongbin Wang
PDF
Two-View Geometry Scoring Without Correspondences Axel Barroso-Laguna, Eric Brachmann, Victor Adrian Prisacariu, Gabriel J. Brostow, Daniyar Turmukhambetov
PDF
Two-Way Multi-Label Loss Takumi Kobayashi
PDF
UDE: A Unified Driving Engine for Human Motion Generation Zixiang Zhou, Baoyuan Wang
PDF
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese
PDF
Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jieping Ye
PDF
Ultrahigh Resolution Image/Video Matting with Spatio-Temporal Sparsity Yanan Sun, Chi-Keung Tang, Yu-Wing Tai
PDF
UMat: Uncertainty-Aware Single Image High Resolution Material Capture Carlos Rodriguez-Pardo, Henar Domínguez-Elvira, David Pascual-Hernández, Elena Garces
PDF
Unbalanced Optimal Transport: A Unified Framework for Object Detection Henri De Plaen, Pierre-François De Plaen, Johan A. K. Suykens, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool
PDF
Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection Hui Lv, Zhongqi Yue, Qianru Sun, Bin Luo, Zhen Cui, Hanwang Zhang
PDF
Unbiased Scene Graph Generation in Videos Sayak Nag, Kyle Min, Subarna Tripathi, Amit K. Roy-Chowdhury
PDF
Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, Yang Cao
PDF
Uncertainty-Aware Unsupervised Image Deblurring with Deep Residual Prior Xiaole Tang, Xile Zhao, Jun Liu, Jianli Wang, Yuchun Miao, Tieyong Zeng
PDF
Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner, Michael Arens, Rainer Stiefelhagen
PDF
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models Qiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui, Tong Yu, Zhe Lin, Yang Zhang, Shiyu Chang
PDF
Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction Yi Xu, Armin Bazarjani, Hyung-gun Chi, Chiho Choi, Yun Fu
PDF
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias Noa Garcia, Yusuke Hirota, Yankun Wu, Yuta Nakashima
PDF
Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning Qian Jiang, Changyou Chen, Han Zhao, Liqun Chen, Qing Ping, Son Dinh Tran, Yi Xu, Belinda Zeng, Trishul Chilimbi
PDF
Understanding and Improving Features Learned in Deep Functional Maps Souhaib Attaiki, Maks Ovsjanikov
PDF
Understanding and Improving Visual Prompting: A Label-Mapping Perspective Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, Sijia Liu
PDF
Understanding Deep Generative Models with Generalized Empirical Likelihoods Suman Ravuri, Mélanie Rey, Shakir Mohamed, Marc Peter Deisenroth
PDF
Understanding Imbalanced Semantic Segmentation Through Neural Collapse Zhisheng Zhong, Jiequan Cui, Yibo Yang, Xiaoyang Wu, Xiaojuan Qi, Xiangyu Zhang, Jiaya Jia
PDF
Understanding Masked Autoencoders via Hierarchical Latent Variable Models Lingjing Kong, Martin Q. Ma, Guangyi Chen, Eric P. Xing, Yuejie Chi, Louis-Philippe Morency, Kun Zhang
PDF
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature Xiangwen Kong, Xiangyu Zhang
PDF
Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving Zijian Zhu, Yichi Zhang, Hai Chen, Yinpeng Dong, Shu Zhao, Wenbo Ding, Jiachen Zhong, Shibao Zheng
PDF
Uni-Perceiver V2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai
PDF
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao
PDF
Unicode Analogies: An Anti-Objectivist Visual Reasoning Challenge Steven Spratley, Krista A. Ehinger, Tim Miller
PDF
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration Jingyi Zhang, Jiaxing Huang, Xiaoqin Zhang, Shijian Lu
PDF
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy Yinzhen Xu, Weikang Wan, Jialiang Zhang, Haoran Liu, Zikang Shan, Hao Shen, Ruicheng Wang, Haoran Geng, Yijia Weng, Jiayi Chen, Tengyu Liu, Li Yi, He Wang
PDF
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View Shengchao Zhou, Weizhou Liu, Chen Hu, Shuchang Zhou, Chao Ma
PDF
Unified Keypoint-Based Action Recognition Framework via Structured Keypoint Pooling Ryo Hachiuma, Fumiaki Sato, Taiki Sekii
PDF
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation Liulei Li, Wenguan Wang, Tianfei Zhou, Jianwu Li, Yi Yang
PDF
Unified Pose Sequence Modeling Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu
PDF
Unifying Layout Generation with a Decoupled Diffusion Model Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu
PDF
Unifying Short and Long-Term Tracking with Graph Hierarchies Orcun Cetintas, Guillem Brasó, Laura Leal-Taixé
PDF
Unifying Vision, Text, and Layout for Universal Document Processing Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng, Cha Zhang, Mohit Bansal
PDF
UniHCP: A Unified Model for Human-Centric Perceptions Yuanzheng Ci, Yizhou Wang, Meilin Chen, Shixiang Tang, Lei Bai, Feng Zhu, Rui Zhao, Fengwei Yu, Donglian Qi, Wanli Ouyang
PDF
UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang, Yun Chen, Jingkang Wang, Sivabalan Manivasagam, Wei-Chiu Ma, Anqi Joyce Yang, Raquel Urtasun
PDF
Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel
PDF
Universal Instance Perception as Object Discovery and Retrieval Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu
PDF
Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown Objects Wenteng Liang, Feng Xue, Yihao Liu, Guofeng Zhong, Anlong Ming
PDF
Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples Jiaming Zhang, Xingjun Ma, Qi Yi, Jitao Sang, Yu-Gang Jiang, Yaowei Wang, Changsheng Xu
PDF
Unpaired Image-to-Image Translation with Shortest Path Regularization Shaoan Xie, Yanwu Xu, Mingming Gong, Kun Zhang
PDF
Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving Bo Pang, Hongchi Xia, Cewu Lu
PDF
Unsupervised 3D Shape Reconstruction by Part Retrieval and Assembly Xianghao Xu, Paul Guerrero, Matthew Fisher, Siddhartha Chaudhuri, Daniel Ritchie
PDF
Unsupervised Continual Semantic Adaptation Through Neural Rendering Zhizheng Liu, Francesco Milano, Jonas Frey, Roland Siegwart, Hermann Blum, Cesar Cadena
PDF
Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses Junbong Jang, Kwonmoo Lee, Tae-Kyun Kim
PDF
Unsupervised Cumulative Domain Adaptation for Foggy Scene Optical Flow Hanyu Zhou, Yi Chang, Wending Yan, Luxin Yan
PDF
Unsupervised Deep Asymmetric Stereo Matching with Spatially-Adaptive Self-Similarity Taeyong Song, Sunok Kim, Kwanghoon Sohn
PDF
Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration Guofeng Mei, Hao Tang, Xiaoshui Huang, Weijie Wang, Juan Liu, Jian Zhang, Luc Van Gool, Qiang Wu
PDF
Unsupervised Domain Adaption with Pixel-Level Discriminator for Image-Aware Layout Generation Chenchen Xu, Min Zhou, Tiezheng Ge, Yuning Jiang, Weiwei Xu
PDF
Unsupervised Inference of Signed Distance Functions from Single Sparse Point Clouds Without Learning Priors Chao Chen, Yu-Shen Liu, Zhizhong Han
PDF
Unsupervised Intrinsic Image Decomposition with LiDAR Intensity Shogo Sato, Yasuhiro Yao, Taiga Yoshida, Takuhiro Kaneko, Shingo Ando, Jun Shimamura
PDF
Unsupervised Object Localization: Observing the Background to Discover Objects Oriane Siméoni, Chloé Sekkat, Gilles Puy, Antonín Vobecký, Éloi Zablocki, Patrick Pérez
PDF
Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction Guangyi Chen, Zhenhao Chen, Shunxing Fan, Kun Zhang
PDF
Unsupervised Space-Time Network for Temporally-Consistent Segmentation of Multiple Motions Etienne Meunier, Patrick Bouthemy
PDF
Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning Zesen Wu, Mang Ye
PDF
Unsupervised Volumetric Animation Aliaksandr Siarohin, Willi Menapace, Ivan Skorokhodov, Kyle Olszewski, Jian Ren, Hsin-Ying Lee, Menglei Chai, Sergey Tulyakov
PDF
Upcycling Models Under Domain and Category Shift Sanqing Qu, Tianpei Zou, Florian Röhrbein, Cewu Lu, Guang Chen, Dacheng Tao, Changjun Jiang
PDF
Use Your Head: Improving Long-Tail Video Recognition Toby Perrett, Saptarshi Sinha, Tilo Burghardt, Majid Mirmehdi, Dima Damen
PDF
UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement Sisi You, Hantao Yao, Bing-Kun Bao, Changsheng Xu
PDF
UV Volumes for Real-Time Rendering of Editable Free-View Human Performance Yue Chen, Xuan Wang, Xingyu Chen, Qi Zhang, Xiaoyu Li, Yu Guo, Jue Wang, Fei Wang
PDF
V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception Runsheng Xu, Xin Xia, Jinlong Li, Hanzhao Li, Shuo Zhang, Zhengzhong Tu, Zonglin Meng, Hao Xiang, Xiaoyu Dong, Rui Song, Hongkai Yu, Bolei Zhou, Jiaqi Ma
PDF
V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting Haibao Yu, Wenxian Yang, Hongzhi Ruan, Zhenwei Yang, Yingjuan Tang, Xu Gao, Xin Hao, Yifeng Shi, Yifeng Pan, Ning Sun, Juan Song, Jirui Yuan, Ping Luo, Zaiqing Nie
PDF
Variational Distribution Learning for Unsupervised Text-to-Image Generation Minsoo Kang, Doyup Lee, Jiseob Kim, Saehoon Kim, Bohyung Han
PDF
VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization Bingfan Zhu, Yanchao Yang, Xulong Wang, Youyi Zheng, Leonidas Guibas
PDF
VecFontSDF: Learning to Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions Zeqing Xia, Bojun Xiong, Zhouhui Lian
PDF
Vector Quantization with Self-Attention for Quality-Independent Representation Learning Zhou Yang, Weisheng Dong, Xin Li, Mengluan Huang, Yulin Sun, Guangming Shi
PDF
VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation Bingchen Yang, Haiyong Jiang, Hao Pan, Jun Xiao
PDF
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models Ajay Jain, Amber Xie, Pieter Abbeel
PDF
VGFlow: Visibility Guided Flow Network for Human Reposing Rishabh Jain, Krishna Kumar Singh, Mayur Hemani, Jingwan Lu, Mausoom Sarkar, Duygu Ceylan, Balaji Krishnamurthy
PDF
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-Supervised Scene Decomposition Chen Guo, Tianjian Jiang, Xu Chen, Jie Song, Otmar Hilliges
PDF
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo, Antoine Miech, Jordi Pont-Tuset, Ivan Laptev, Josef Sivic, Cordelia Schmid
PDF
Video Compression with Entropy-Constrained Neural Representations Carlos Gomes, Roberto Azevedo, Christopher Schroers
PDF
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng
PDF
Video Event Restoration Based on Keyframes for Video Anomaly Detection Zhiwei Yang, Jing Liu, Zhaoyang Wu, Peng Wu, Xiaotao Liu
PDF
Video Probabilistic Diffusion Models in Projected Latent Space Sihyun Yu, Kihyuk Sohn, Subin Kim, Jinwoo Shin
PDF
Video Test-Time Adaptation for Action Recognition Wei Lin, Muhammad Jehanzeb Mirza, Mateusz Kozinski, Horst Possegger, Hilde Kuehne, Horst Bischof
PDF
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen
PDF
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao
PDF
VideoTrack: Learning to Track Objects via Video Transformer Fei Xie, Lei Chu, Jiahao Li, Yan Lu, Chao Ma
PDF
ViewNet: A Novel Projection-Based Backbone with View Pooling for Few-Shot Point Cloud Classification Jiajing Chen, Minmin Yang, Senem Velipasalar
PDF
Viewpoint Equivariance for Multi-View 3D Object Detection Dian Chen, Jie Li, Vitor Guizilini, Rares Andrei Ambrus, Adrien Gaidon
PDF
VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining Junjie Ke, Keren Ye, Jiahui Yu, Yonghui Wu, Peyman Milanfar, Feng Yang
PDF
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Weiming Hu, Xiaohu Qie, Jianping Wu
PDF
VindLU: A Recipe for Effective Video-and-Language Pretraining Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius
PDF
ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries Junru Gu, Chenxu Hu, Tianyuan Zhang, Xuanyao Chen, Yilun Wang, Yue Wang, Hang Zhao
PDF
ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection Jeeseung Park, Jin-Woo Park, Jong-Seok Lee
PDF
Virtual Occlusions Through Implicit Depth Jamie Watson, Mohamed Sayed, Zawar Qureshi, Gabriel J. Brostow, Sara Vicente, Oisin Mac Aodha, Michael Firman
PDF
Virtual Sparse Convolution for Multimodal 3D Object Detection Hai Wu, Chenglu Wen, Shaoshuai Shi, Xin Li, Cheng Wang
PDF
VisFusion: Visibility-Aware Online 3D Scene Reconstruction from Videos Huiyu Gao, Wei Mao, Miaomiao Liu
PDF
Visibility Aware Human-Object Interaction Tracking from Single RGB Camera Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll
PDF
Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark Muyao Niu, Zhuoxiao Li, Zhihang Zhong, Yinqiang Zheng
PDF
Vision Transformers Are Good Mask Auto-Labelers Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, Jose M. Alvarez, Anima Anandkumar
PDF
Vision Transformers Are Parameter-Efficient Audio-Visual Learners Yan-Bo Lin, Yi-Lin Sung, Jie Lei, Mohit Bansal, Gedas Bertasius
PDF
Visual Atoms: Pre-Training Vision Transformers with Sinusoidal Waves Sora Takashima, Ryo Hayamizu, Nakamasa Inoue, Hirokatsu Kataoka, Rio Yokota
PDF
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
PDF
Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations Benjamin Ramtoula, Matthew Gadd, Paul Newman, Daniele De Martini
PDF
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang
PDF
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images Ming Y. Lu, Bowen Chen, Andrew Zhang, Drew F. K. Williamson, Richard J. Chen, Tong Ding, Long Phi Le, Yung-Sung Chuang, Faisal Mahmood
PDF
Visual Localization Using Imperfect 3D Models from the Internet Vojtech Panek, Zuzana Kukelova, Torsten Sattler
PDF
Visual Programming: Compositional Visual Reasoning Without Training Tanmay Gupta, Aniruddha Kembhavi
PDF
Visual Prompt Multi-Modal Tracking Jiawen Zhu, Simiao Lai, Xin Chen, Dong Wang, Huchuan Lu
PDF
Visual Prompt Tuning for Generative Transfer Learning Kihyuk Sohn, Huiwen Chang, José Lezama, Luisa Polania, Han Zhang, Yuan Hao, Irfan Essa, Lu Jiang
PDF
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning Cheng-Hao Tu, Zheda Mai, Wei-Lun Chao
PDF
Visual Recognition by Request Chufeng Tang, Lingxi Xie, Xiaopeng Zhang, Xiaolin Hu, Qi Tian
PDF
Visual Recognition-Driven Image Restoration for Multiple Degradation with Intrinsic Semantics Recovery Zizheng Yang, Jie Huang, Jiahao Chang, Man Zhou, Hu Yu, Jinghao Zhang, Feng Zhao
PDF
Visual-Language Prompt Tuning with Knowledge-Guided Context Optimization Hantao Yao, Rui Zhang, Changsheng Xu
PDF
Visual-Tactile Sensing for In-Hand Object Reconstruction Wenqiang Xu, Zhenjun Yu, Han Xue, Ruolin Ye, Siqiong Yao, Cewu Lu
PDF
Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah
PDF
ViTs for SITS: Vision Transformers for Satellite Image Time Series Michail Tarasiou, Erik Chavez, Stefanos Zafeiriou
PDF
VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs Anna Frühstück, Nikolaos Sarafianos, Yuanlu Xu, Peter Wonka, Tony Tung
PDF
VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud Ziqin Wang, Bowen Cheng, Lichen Zhao, Dong Xu, Yang Tang, Lu Sheng
PDF
VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision Mengyin Liu, Jie Jiang, Chao Zhu, Xu-Cheng Yin
PDF
vMAP: Vectorised Object Mapping for Neural Field SLAM Xin Kong, Shikun Liu, Marwan Taher, Andrew J. Davison
PDF
VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution Jaeill Kim, Suhyun Kang, Duhun Hwang, Jungwook Shin, Wonjong Rhee
PDF
VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction Yufan Ren, Fangjinhua Wang, Tong Zhang, Marc Pollefeys, Sabine Süsstrunk
PDF
VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval Siteng Huang, Biao Gong, Yulin Pan, Jianwen Jiang, Yiliang Lv, Yuyuan Li, Donglin Wang
PDF
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia
PDF
VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion Yiming Li, Zhiding Yu, Christopher Choy, Chaowei Xiao, Jose M. Alvarez, Sanja Fidler, Chen Feng, Anima Anandkumar
PDF
VQACL: A Novel Visual Question Answering Continual Learning Setting Xi Zhang, Feifei Zhang, Changsheng Xu
PDF
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro
PDF
Wavelet Diffusion Models Are Fast and Scalable Image Generators Hao Phung, Quan Dao, Anh Tran
PDF
Weak-Shot Object Detection Through Mutual Knowledge Transfer Xuanyi Du, Weitao Wan, Chong Sun, Chen Li
PDF
Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving Ruibo Li, Hanyu Shi, Ziang Fu, Zhe Wang, Guosheng Lin
PDF
Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency Runzhou Tao, Wencheng Han, Zhongying Qiu, Cheng-Zhong Xu, Jianbing Shen
PDF
Weakly Supervised Posture Mining for Fine-Grained Classification Zhenchao Tang, Hualin Yang, Calvin Yu-Chian Chen
PDF
Weakly Supervised Segmentation with Point Annotations for Histopathology Images via Contrast-Based Variational Model Hongrun Zhang, Liam Burrows, Yanda Meng, Declan Sculthorpe, Abhik Mukherjee, Sarah E. Coupland, Ke Chen, Yalin Zheng
PDF
Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor Hyeokjun Kweon, Sung-Hoon Yoon, Kuk-Jin Yoon
PDF
Weakly Supervised Temporal Sentence Grounding with Uncertainty-Guided Self-Training Yifei Huang, Lijin Yang, Yoichi Sato
PDF
Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network Zhicheng Zhang, Lijuan Wang, Jufeng Yang
PDF
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos Sixun Dong, Huazhang Hu, Dongze Lian, Weixin Luo, Yicheng Qian, Shenghua Gao
PDF
Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning Anurag Das, Yongqin Xian, Dengxin Dai, Bernt Schiele
PDF
Weakly-Supervised Single-View Image Relighting Renjiao Yi, Chenyang Zhu, Kai Xu
PDF
WeatherStream: Light Transport Automation of Single Image Deweathering Howard Zhang, Yunhao Ba, Ethan Yang, Varan Mehra, Blake Gella, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Alex Wong, Achuta Kadambi
PDF
What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song
PDF
What Happened 3 Seconds Ago? Inferring the past with Thermal Imaging Zitian Tang, Wenjie Ye, Wei-Chiu Ma, Hang Zhao
PDF
What You Can Reconstruct from a Shadow Ruoshi Liu, Sachit Menon, Chengzhi Mao, Dennis Park, Simon Stent, Carl Vondrick
PDF
Where Is My Spot? Few-Shot Image Generation via Latent Subspace Optimization Chenxi Zheng, Bangzhen Liu, Huaidong Zhang, Xuemiao Xu, Shengfeng He
PDF
Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization Mengmeng Xu, Yanghao Li, Cheng-Yang Fu, Bernard Ghanem, Tao Xiang, Juan-Manuel Pérez-Rúa
PDF
Where We Are and What We're Looking at: Query Based Worldwide Image Geo-Localization Using Hierarchies and Scenes Brandon Clark, Alec Kerrigan, Parth Parag Kulkarni, Vicente Vivanco Cepeda, Mubarak Shah
PDF
Why Is the Winner the Best? Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu D. Tizabi, Fabian Isensee, Tim J. Adler, Sharib Ali, Vincent Andrearczyk, Marc Aubreville, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Veronika Cheplygina, Marie Daum, Marleen de Bruijne, Adrien Depeursinge, Reuben Dorent, Jan Egger, David G. Ellis, Sandy Engelhardt, Melanie Ganz, Noha Ghatwary, Gabriel Girard, Patrick Godau, Anubha Gupta, Lasse Hansen, Kanako Harada, Mattias P. Heinrich, Nicholas Heller, Alessa Hering, Arnaud Huaulmé, Pierre Jannin, Ali Emre Kavur, Oldřich Kodym, Michal Kozubek, Jianning Li, Hongwei Li, Jun Ma, Carlos Martín-Isla, Bjoern Menze, Alison Noble, Valentin Oreiller, Nicolas Padoy, Sarthak Pati, Kelly Payette, Tim Rädsch, Jonathan Rafael-Patiño, Vivek Singh Bawa, Stefanie Speidel, Carole H. Sudre, Kimberlin van Wijnen, Martin Wagner, Donglai Wei, Amine Yamlahi, Moi Hoon Yap, Chun Yuan, Maximilian Zenk, Aneeq Zia, David Zimmerer, Dogu Baran Aydogan, Binod Bhattarai, Louise Bloch, Raphael Brüngel, Jihoon Cho, Chanyeol Choi, Qi Dou, Ivan Ezhov, Christoph M. Friedrich, Clifton D. Fuller, Rebati Raman Gaire, Adrian Galdran, Álvaro García Faura, Maria Grammatikopoulou, SeulGi Hong, Mostafa Jahanifar, Ikbeom Jang, Abdolrahim Kadkhodamohammadi, Inha Kang, Florian Kofler, Satoshi Kondo, Hugo Kuijf, Mingxing Li, Minh Luu, Tomaž Martinčič, Pedro Morais, Mohamed A. Naser, Bruno Oliveira, David Owen, Subeen Pang, Jinah Park, Sung-Hong Park, Szymon Plotka, Elodie Puybareau, Nasir Rajpoot, Kanghyun Ryu, Numan Saeed, Adam Shephard, Pengcheng Shi, Dejan Štepec, Ronast Subedi, Guillaume Tochon, Helena R. Torres, Helene Urien, João L. Vilaça, Kareem A. Wahid, Haojie Wang, Jiacheng Wang, Liansheng Wang, Xiyue Wang, Benedikt Wiestler, Marek Wodzinski, Fangfang Xia, Juanying Xie, Zhiwei Xiong, Sen Yang, Yanwu Yang, Zixuan Zhao, Klaus Maier-Hein, Paul F. Jäger, Annette Kopp-Schneider, Lena Maier-Hein
PDF
Wide-Angle Rectification via Content-Aware Conformal Mapping Qi Zhang, Hongdong Li, Qing Wang
PDF
WildLight: In-the-Wild Inverse Rendering with a Flashlight Ziang Cheng, Junxuan Li, Hongdong Li
PDF
WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation Jongheon Jeong, Yang Zou, Taewan Kim, Dongqing Zhang, Avinash Ravichandran, Onkar Dabeer
PDF
WINNER: Weakly-Supervised hIerarchical decompositioN and aligNment for Spatio-tEmporal Video gRounding Mengze Li, Han Wang, Wenqiao Zhang, Jiaxu Miao, Zhou Zhao, Shengyu Zhang, Wei Ji, Fei Wu
PDF
WIRE: Wavelet Implicit Neural Representations Vishwanath Saragadam, Daniel LeJeune, Jasper Tan, Guha Balakrishnan, Ashok Veeraraghavan, Richard G. Baraniuk
PDF
X-Avatar: Expressive Human Avatars Kaiyue Shen, Chen Guo, Manuel Kaufmann, Juan Jose Zarate, Julien Valentin, Jie Song, Otmar Hilliges
PDF
X-Pruner: eXplainable Pruning for Vision Transformers Lu Yu, Wei Xiang
PDF
X3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection Marvin Klingner, Shubhankar Borse, Varun Ravi Kumar, Behnaz Rezaei, Venkatraman Narayanan, Senthil Yogamani, Fatih Porikli
PDF
YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao
PDF
You Are Catching My Attention: Are Vision Transformers Bad Learners Under Backdoor Attacks? Zenghui Yuan, Pan Zhou, Kai Zou, Yu Cheng
PDF
You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos Xiang Fang, Daizong Liu, Pan Zhou, Guoshun Nan
PDF
You Do Not Need Additional Priors or Regularizers in Retinex-Based Low-Light Image Enhancement Huiyuan Fu, Wenkai Zheng, Xiangyu Meng, Xin Wang, Chuanming Wang, Huadong Ma
PDF
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model Shengkun Tang, Yaqing Wang, Zhenglun Kong, Tianchi Zhang, Yao Li, Caiwen Ding, Yanzhi Wang, Yi Liang, Dongkuan Xu
PDF
You Only Segment Once: Towards Real-Time Panoptic Segmentation Jie Hu, Linyan Huang, Tianhe Ren, Shengchuan Zhang, Rongrong Ji, Liujuan Cao
PDF
ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection Yongqi An, Xu Zhao, Tao Yu, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang
PDF
ZegCLIP: Towards Adapting CLIP for Zero-Shot Semantic Segmentation Ziqin Zhou, Yinjie Lei, Bowen Zhang, Lingqiao Liu, Yifan Liu
PDF
Zero-Shot Dual-Lens Super-Resolution Ruikang Xu, Mingde Yao, Zhiwei Xiong
PDF
Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin, Mingkang Li, Da Li, Timothy Hospedales, Yi-Zhe Song, Yonggang Qi
PDF
Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning Jiayi Guo, Chaofei Wang, You Wu, Eric Zhang, Kai Wang, Xingqian Xu, Shiji Song, Humphrey Shi, Gao Huang
PDF
Zero-Shot Model Diagnosis Jinqi Luo, Zhaoning Wang, Chen Henry Wu, Dong Huang, Fernando De la Torre
PDF
Zero-Shot Noise2Noise: Efficient Image Denoising Without Any Data Youssef Mansour, Reinhard Heckel
PDF
Zero-Shot Object Counting Jingyi Xu, Hieu Le, Vu Nguyen, Viresh Ranjan, Dimitris Samaras
PDF
Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters Jiashun Wang, Xueting Li, Sifei Liu, Shalini De Mello, Orazio Gallo, Xiaolong Wang, Jan Kautz
PDF
Zero-Shot Referring Image Segmentation with Global-Local Context Features Seonghoon Yu, Paul Hongsuck Seo, Jeany Son
PDF
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation Rui Zhao, Wei Li, Zhipeng Hu, Lincheng Li, Zhengxia Zou, Zhenwei Shi, Changjie Fan
PDF