ECCVW 2024
552 papers
✉ AirLetters 💨: An Open Vide 🎞 Dataset of Characters Drawn in the Air
Rishit Dagli, Guillaume Berger, Joanna Materzynska, Ingo Bax, Roland Memisevic 3D Object Detection and Tracking Refinement with Ensemble Methods and Spatiotemporal Filtering
Sandesh Rajendra Jain, Surendrabikram Thapa, Sanjana Bharadwaj, Abhijit Sarkar, A. Lynn Abbott, Jianhua Xuan 7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition
Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Abhinav Dhall, Shreya Ghosh, Chunchang Shao, Guanyu Hu A Bottom-up Approach to Class-Agnostic Image Segmentation
Sebastian Dille, Ari Blondal, Sylvain Paris, Yagiz Aksoy A CycleGAN Model to Synthesize Missing and Unpaired MRI Sequences for Under-Represented Multiple Sclerosis Lesions
Flavio D' Amato, Alessia Cipriani, Alessandro Di Matteo, Daniele Lozzi, Enrico Mattei, Matteo Polsinelli, Giuseppe Placidi A Data-Centric Module for Neural Rendering
Emanuele Balloni, Lorenzo Stacchio, Lucrezia Gorgoglione, Marina Paolanti, Roberto Pierdicca, Adriano Mancini, Emanuele Frontoni, Primo Zingaretti A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti, Roberto Basla, Riccardo Pieroni, Matteo Corno, Sergio M. Savaresi, Luca Magri, Giacomo Boracchi A Simple Approach to Pavement Cell Segmentation
Rostislav Shepel, Andrew Romanowski, Mario Valerio Giuffrida A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
Marius Aasan, Odd Kolbjørnsen, Anne H. Schistad Solberg, Adín Ramírez Rivera A Vision-Based Framework for Human Behavior Understanding in Industrial Assembly Lines
Konstantinos E. Papoutsakis, Nikolaos Bakalos, Konstantinos Fragkoulis, Athena Zacharia, Georgia Kapetadimitri, Maria Pateraki AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data
Mirko Zaffaroni, Federico Signoretta, Marco Grangetto, Attilio Fiandrotti ABAW7 Challenge: A Facial Affect Recognition Approach Based on Transformer Encoder and Multilayer Perceptron
Xuxiong Liu, Kang Shen, Jun Yao, Boyan Wang, Yu Wang, Yujie Guan, Xin Liu, Gengchen Li, Liuwei An, Zishun Cui, Minrui Liu, Xiao Sun, Weijie Feng Across-Game Engagement Modelling via Few-Shot Learning
Kosmas Pinitas, Konstantinos Makantasis, Georgios N. Yannakakis Adversarial Attacks on Hyperbolic Networks
Max van Spengler, Jan Zahálka, Pascal Mettes Affective Behaviour Analysis via Progressive Learning
Chen Liu, Wei Zhang, Feng Qiu, Lincheng Li, Dadong Wang, Xin Yu AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results
Maksim Smirnov, Aleksandr Gushchin, Anastasia Antsiferova, Dmitriy S. Vatolin, Radu Timofte, Ziheng Jia, Zicheng Zhang, Wei Sun, Jiaying Qian, Yuqin Cao, Yinan Sun, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Kanjar De, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Wenhui Meng, Xiaoheng Tan, Haiqiang Wang, Xiaozhong Xu, Shan Liu, Zhenzhong Chen, Zhengxue Cheng, Jiahao Xiao, Jun Xu, Chenlong He, Qi Zheng, Ruoxi Zhu, Min Li, Yibo Fan, Zhengzhong Tu AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content
Marcos V. Conde, Zhijun Lei, Wen Li, Christos G. Bampis, Ioannis Katsavounidis, Radu Timofte, Qing Luo, Jie Song, Linyan Jiang, Haibo Lei, Yaqing Li, Ziqi Luo, Rongkang Dong, Cuixin Yang, Zongqi He, Jun Xiao, Zhe Xiao, Yushen Zuo, Zihang Lyu, Kin-Man Lam, Yuxuan Jiang, Jakub Nawala, Chen Feng, Fan Zhang, Xiaoqing Zhu, Joel Sole, David Bull, Jae-Hyeon Lee, Dong-Hyeop Son, Ui-Jin Choi, Mingjun Zheng, Zhongbao Yang, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang AIM 2024 Challenge on UHD Blind Photo Quality Assessment
Vlad Hosu, Marcos V. Conde, Lorenzo Agnolucci, Nabajeet Barman, Saman Zadtootaghaj, Radu Timofte, Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, Zicheng Zhang, Xiongkuo Min, Guangtao Zhai, Songbai Tan, Lixin Zhang, Guanghui Yue, Daekyu Kwon, Dongyoung Kim, Seon Joo Kim, Yunchen Zhang, Xiangkai Xu, Hong Gao, Yiming Bao, Ji Shi, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu, Zewen Chen, Shunhan Xu, Haochen Guo, Yun Zeng, Shuai Liu, Jian Guo, Juan Wang, Bing Li, Dehua Liu, Hesong Liu, Grigory Malivenko, Asile Gerek, Xingyuan Ma, Cheng Li, Joonhee Lee, Junseo Bang, Se Young Chun AIM 2024 Challenge on Video Saliency Prediction: Methods and Results
Andrey Moskalenko, Alexey Bryncev, Dmitry S. Vatolin, Radu Timofte, Gen Zhan, Li Yang, Yunlong Tang, Yiting Liao, Jiongzhi Lin, Baitao Huang, Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo, Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai, Hao Fang, Runmin Cong, Xiankai Lu, Xiaofei Zhou, Wei Zhang, Chunyu Zhao, Wentao Mu, Tao Deng, Hamed R. Tavakoli AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Ivan Molodetskikh, Artem Borisov, Dmitriy S. Vatolin, Radu Timofte, Jianzhao Liu, Tianwu Zhi, Yabin Zhang, Yang Li, Jingwen Xu, Yiting Liao, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Yuqin Cao, Wei Sun, Weixia Zhang, Yinan Sun, Ziheng Jia, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Weihua Luo, Yupeng Zhang, Hong Yi AIM 2024 Sparse Neural Rendering Challenge: Dataset and Benchmark
Michal Nazarczuk, Thomas Tanay, Sibi Catley-Chandar, Richard Shaw, Radu Timofte, Eduardo Pérez-Pellitero AIM 2024 Sparse Neural Rendering Challenge: Methods and Results
Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay, Richard Shaw, Eduardo Pérez-Pellitero, Radu Timofte, Xing Yan, Pan Wang, Yali Guo, Yongxin Wu, Youcheng Cai, Yanan Yang, Junting Li, Yanghong Zhou, P. Y. Mok, Zongqi He, Zhe Xiao, Kin-Chung Chan, Hana Lebeta Goshu, Cuixin Yang, Rongkang Dong, Jun Xiao, Kin-Man Lam, Jiayao Hao, Qiong Gao, Yanyan Zu, Junpei Zhang, Licheng Jiao, Xu Liu, Kuldeep Purohit Alfie: Democratising RGBA Image Generation with No $$$
Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara Aligning Object Detector Bounding Boxes with Human Preference
Ombretta Strafforello, Osman Semih Kayhan, Oana Inel, Klamer Schutte, Jan van Gemert Aligning Vision Language Models with Contrastive Learning
Kenan E. Ak, Jay Mohta, Dimitris Dimitriadis, Saurav Manchanda, Yan Xu, Mingwei Shen An Infrastructure-Based Localization Method for Articulated Vehicles
Alberto Justo, Iker Pacho, Javier Araluce, Jesús Murgoitio Larrauri, Luis Miguel Bergasa Analysis of Hybrid Compositions in Animation Film with Weakly Supervised Learning
Mónica Apellaniz Portos, Roberto Labadie Tamayo, Claudius Stemmler, Erwin Feyersinger, Andreas Babic, Franziska Bruckner, Vrääth Öhner, Matthias Zeppelzauer AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll, Iramm Hamdard, Lukas Namgyu Rößler, Felix Geisler, Muhammed Bayram, Felix Wang, Jan Imhof, Miguel de Campos, Anushervon Tabarov, Yitian Yang, Martin Gontscharow, Hanno Gottschalk, J. Marius Zöllner ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
Takuma Nishimura, Andreea Dogaru, Martin Oeggerli, Bernhard Egger Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?
Charalambos Tzamos, Viktor Kocur, Yaqing Ding, Torsten Sattler, Zuzana Kukelova Are We Friends? End-to-End Prediction of Child Rapport in Guided Play
Marc Fraile, Giovanna Varni, Joakim Lindblad, Natasa Sladoje, Ginevra Castellano Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency
Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, Zicheng Zhang, Xiongkuo Min, Guangtao Zhai Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification
Mahrukh Awan, Asmar Nadeem, Muhammad Junaid Awan, Armin Mustafa, Syed Sameed Husain Autobiasing Event Cameras
Mehdi Sefidgar Dilmaghani, Waseem Shariff, Cian Ryan, Joseph Lemley, Peter Corcoran Automatic Die Studies for Ancient Numismatics
Clément Cornet, Héloïse Aumaître, Romaric Besançon, Julien Olivier, Thomas Faucher, Hervé Le Borgne Automatic Generation of Fashion Images Using Prompting in Generative Machine Learning Models
Georgia Argyrou, Angeliki Dimitriou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou Autonomous Drone-Person Tracking and Following in Uniform Appearance Scenarios
Mohamad Alansari, Oussama Abdul Hay, Sajid Javed, Hazem Elrefaei, Khaled Alnuaimi, Bilal Hassan, Jorge Dias, Yahya H. Zweiri, Naoufel Werghi AVSal: Enhancing Video Saliency Prediction Through Audio-Visual Fusion and Temporal Aggregation
Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment
Ombretta Strafforello, Gonzalo Muradas Odriozola, Fatemeh Behrad, Li-Wei Chen, Anne-Sofie Maerten, Derya Soydaner, Johan Wagemans BehAVE: Behaviour Alignment of Video Game Encodings
Nemanja Rasajski, Chintan Trivedi, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakis Beyond the Surface: A Comprehensive Analysis of Implicit Bias in Vision-Language Models
Giacomo Capitani, Alice Lucarini, Lorenzo Bonicelli, Federico Bolelli, Simone Calderara, Loris Vezzali, Elisa Ficarra Boosting Pose Estimators via Cross-Representation Distillation
Kang Liu, Zhendong Yang, Jingyun Zhang, Jun Wang, Shaoming Wang, Chun Yuan, Rizen Guo Boundary Attention: Learning Curves, Corners, Junctions and Grouping
Mia Gaia Polansky, Charles Herrmann, Junhwa Hur, Deqing Sun, Dor Verbin, Todd E. Zickler Can Your Generative Model Detect Out-of-Distribution Covariate Shift?
Christiaan G. A. Viviers, M. M. Amaan Valiuddin, Francisco Caetano, Lemar Abdi, Lena Filatova, Peter H. N. de With, Fons van der Sommen Coarse-to-Fine Human Mesh Recovery with Transformers
Vatsal Agarwal, Mara Levy, Max Ehrlich, Youbao Tang, Ning Zhang, Abhinav Shrivastava Collaborative Control for Geometry-Conditioned PBR Image Generation
Shimon Vainer, Mark Boss, Mathias Parger, Konstantin Kutsy, Dante De Nigris, Ciara Rowles, Nicolas Perony, Simon Donné ComiCap: A VLMs Pipeline for Dense Captioning of Comic Panels
Emanuele Vivoli, Niccolò Biondi, Marco Bertini, Dimosthenis Karatzas Compositional Text-to-Image Generation with Feedforward Layout Generation
Sifei Liu, Weili Nie, An-Chieh Cheng, Morteza Mardani, Chao Liu, Benjamin Eckart, Arash Vahdat Compound Expression Recognition via Curriculum Learning
Chen Liu, Feng Qiu, Wei Zhang, Lincheng Li, Dadong Wang, Xin Yu Compressed Depth mAP Super-Resolution and Restoration: AIM 2024 Challenge Results
Marcos V. Conde, Florin-Alexandru Vasluianu, Jinhui Xiong, Wei Ye, Rakesh Ranjan, Radu Timofte, Huan Zheng, Wencheng Han, Tianyi Yan, Jianbing Shen, Pihai Sun, Yuanqi Yao, Kui Jiang, Wenbo Zhao, Xianming Liu, Evgeny Burnaev, Junjun Jiang, Woojae Han, Kyeonghyun Lee, Seongmin Hong, Se Young Chun, Jinseong Kim, Dohyeong Kim, Jeahwan Kim, Yubo Wang, Chi Zhang, Huizhen Luo, Yansai Wu, Mengcheng Huang, Chengji Liu, Chongli Yve, Jianhang Sun, Cheng Guo, Yingcai Du, Huang Jianhao, Liu Shuai, Li Chenghua Compression-RQ-VQA: Leveraging Rich Quality-Aware Features for Compressed Video Quality Assessment
Ziheng Jia, Jiaying Qian, Wei Sun, Zicheng Zhang, Yuqin Cao, Yinan Sun, Yuxin Zhu, Guangtao Zhai, Xiongkuo Min Conditional Unscented Autoencoders for Trajectory Prediction
Faris Janjos, Marcel Hallgarten, Anthony Knittel, Maxim Dolgov, Andreas Zell, J. Marius Zöllner Context-Aware Full Body Anonymization
Pascal Zwick, Kevin Rösch, Marvin Klemp, Oliver Bringmann CycleBNN: Cyclic Precision Training in Binary Neural Networks
Federico Fontana, Romeo Lanzino, Anxhelo Diko, Gian Luca Foresti, Luigi Cinque DailyMAE: Towards Pretraining Masked Autoencoders in One Day
Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais DARES: Depth Anything in Robotic Endoscopic Surgery with Self-Supervised Vector-LoRA of the Foundation Model
Mona Sheikh Zeinoddin, Chiara Lena, Jiongqi Qu, Luca Carlini, Mattia Magro, Seunghoi Kim, Elena De Momi, Sophia Bano, Matthew Grech-Sollars, Evangelos B. Mazomenos, Daniel C. Alexander, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam Data-Efficient Generation for Dataset Distillation
Zhe Li, Weitong Zhang, Sarah Cechnicka, Bernhard Kainz DAVIDE: Depth-Aware Video Deblurring
German F. Torres, Jussi Kalliola, Soumya Tripathy, Erman Acar, Joni-Kristian Kämäräinen DebiasPI: Inference-Time Debiasing by Prompt Iteration of a Text-to-Image Generative Model
Sarah Bonna, Yu-Cheng Huang, Ekaterina Novozhilova, Sejin Paik, Zhengyang Shan, Michelle Yilin Feng, Ge Gao, Yonish Tayal, Rushil Kulkarni, Jialin Yu, Nupur Divekar, Deepti Ghadiyaram, Derry Wijaya, Margrit Betke Deep Learning for Automated Shark Detection and Biometrics Without Keypoints
Jaden V. Clark, Chinmay K. Lalgudi, Mark E. Leone, Jayson Meribe, Sergio Madrigal-Mora, Mario Espinoza Depth-Based Privileged Information for Boosting 3D Human Pose Estimation on RGB
Alessandro Simoni, Francesco Marchetti, Guido Borghi, Federico Becattini, Davide Davoli, Lorenzo Garattoni, Gianpiero Francesca, Lorenzo Seidenari, Roberto Vezzani Detecting Forged Sentinel-2 Images Through Parallax-Based Cloud Analysis
Matthieu Serfaty, Quentin Bammey, Tina Nikoukhah, Rafael Grompone von Gioi, Carlo de Franchis DIFF-NST: Diffusion Interleaving for deFormable Neural Style Transfer
Dan Ruta, Gemma Canet Tarres, Andrew Gilbert, Eli Shechtman, Nicholas I. Kolkin, John P. Collomosse DiffAugment: Diffusion Based Long-Tailed Visual Relationship Recognition
Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do Diffusion-Based Light Field Synthesis
Ruisheng Gao, Yutong Liu, Zeyu Xiao, Zhiwei Xiong Diffusion-Promoted HDR Video Reconstruction
Yuanshen Guan, Ruikang Xu, Mingde Yao, Ruisheng Gao, Lizhi Wang, Zhiwei Xiong DiM: Distilling Dataset into Generative Model
Kai Wang, Jianyang Gu, Hansong Zhang, Daquan Zhou, Zheng Zhu, Wei Jiang, Yang You DIVA: Deep Indic Virtual Apparel Try-on
Kuppa Sai Sri Teja, Hrishit Mitra, Rongali Simhachala Venkata Girish, Kaushik Mitra Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?
Kerem Cekmeceli, Meva Himmetoglu, Guney I. Tombak, Anna Susmelj, Ertunc Erdil, Ender Konukoglu DreamWalk: Style Space Exploration Using Diffusion Guidance
Michelle Shu, Charles Herrmann, Richard Strong Bowen, Forrester Cole, Ramin Zabih Drone Detection Using a Low-Power Neuromorphic Virtual Tripwire
Anton Eldeborg Lundin, Rasmus Winzell, Hanna Hamrell, David Gustafsson, Hannes Ovrén Dynamic Label Injection for Imbalanced Industrial Defect Segmentation
Emanuele Caruso, Francesco Pelosin, Alessandro Simoni, Marco Boschetti Edge-Aware Consistent Stereo Video Depth Estimation
Elena Kosheleva, Sunil Prasad Jaiswal, Faranak Shamsafar, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek Effective Prior Regularized Sparse Learning
Junting Li, Yanghong Zhou, Jintu Fan, Dahua Shou, Sa Xu, P. Y. Mok Empowering Autonomous Shuttles with Next-Generation Infrastructure
Sven Ochs, Melih Yazgan, Rupert Polley, Albert Schotschneider, Stefan Orf, Marc Uecker, Maximilian Zipfl, Julian Burger, Abhishek Vivekanandan, Jennifer Amritzer, Marc René Zofka, J. Marius Zöllner EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans
Nicola Garau, Giulia Martinelli, Niccolò Bisagno, Denis Tomè, Carsten Stoll Evaluating Image-Based Face and Eye Tracking with Event Cameras
Khadija Iddrisu, Waseem Shariff, Noel E. O'Connor, Joseph Lemley, Suzanne Little EventSleep: Sleep Activity Recognition with Event Cameras
Carlos Plou, Nerea Gallego, Alberto Sabater, Pablo Urcola, Eduardo Montijano, Luis Montesano, Ruben Martinez-Cantin, Ana C. Murillo Evolution of Detection Performance Throughout the Online Lifespan of Synthetic Images
Dimitrios Karageorgiou, Quentin Bammey, Valentin Porcellini, Bertrand Goupil, Denis Teyssou, Symeon Papadopoulos ExeChecker: Where Did I Go Wrong?
Yiwen Gu, Mahir Patel, Margrit Betke Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance
Simone Maurizio La Cava, Sara Concas, Ruben Tolosana, Roberto Casula, Giulia Orrù, Martin Drahanský, Julian Fierrez, Gian Luca Marcialis Exploring Strengths and Weaknesses of Super-Resolution Attack in Deepfake Detection
Davide Alessandro Coccomini, Roberto Caldelli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato FABRIC: Personalizing Diffusion Models with Iterative Feedback
Dimitri von Rütte, Elisabetta Fedele, Jonathan Thomm, Lukas Wolf FaceOracle: Chat with a Face Image Oracle
Wassim Kabbani, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch Fairness of AI Systems in the Legal Context
Veronica Paternolli, Mila Dalla Preda, Roberto Giacobazzi FALCON: Fair Active Learning for Content Moderation
Zuhui Wang, Sandra Sajeev, Gaurav Mittal, Matthew Hall, Ye Yu, Zhaozheng Yin, Mei Chen Fashion Attribute Extraction Under an Evolving Ontology
Aditya Kanade, Manasi Patwardhan, Mayur Patidar, Lovekesh Vig, Bagyalakshmi Vasudevan Find the Assembly Mistakes: Error Segmentation for Industrial Applications
Dan Lehman, Tim J. Schoonbeek, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models
Jun Xiao, Zihang Lyu, Hao Xie, Cong Zhang, Yakun Ju, Changjian Shui, Kin-Man Lam FruitBin: A Tunable Large-Scale Dataset for Advancing 6d Pose Estimation in Fruit Bin-Picking Automation
Guillaume Duret, Mahmoud Ali, Nicolas Cazin, Danylo Mazurak, Anna Samsonenko, Alexandre Chapin, Florence Zara, Emmanuel Dellandréa, Liming Chen, Jan Peters Garment Attribute Manipulation with Multi-Level Attention
Vittorio Casula, Lorenzo Berlincioni, Luca Cultrera, Federico Becattini, Chiara Pero, Carmen Bisogni, Marco Bertini, Alberto Del Bimbo GECO: GPT-Driven Estimation of 3D Human-Scene Contact in the Wild
Chaehong Lee, Simranjit Singh, Michael Fore, Georgios Pavlakos, Dimitrios Stamoulis Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones
Carlos Plou, Pablo Pueyo, Ruben Martinez-Cantin, Mac Schwager, Ana C. Murillo, Eduardo Montijano Generating Binary Species Range Maps
Filip Dorm, Christian Lange, Scott Loarie, Oisin Mac Aodha Generative Dataset Distillation Based on Diffusion Model
Duo Su, Junjie Hou, Guang Li, Ren Togo, Rui Song, Takahiro Ogawa, Miki Haseyama Generative Dataset Distillation Using Min-Max Diffusion Model
Junqiao Fan, Yunjiao Zhou, Min Chang Jordan Ren, Jianfei Yang Generative Hierarchical Temporal Transformer for Hand Pose and Action Modeling
Yilin Wen, Hao Pan, Takehiko Ohkawa, Lei Yang, Jia Pan, Yoichi Sato, Taku Komura, Wenping Wang Glia Cell Inspired Reinforcement Learning Agent for Neural Network Optimization
Alessio Fagioli, Luigi Cinque, Damiano Distante, Gian Luca Foresti, Marco Cascio Good Data Is All Imitation Learning Needs
Amir Samadi, Konstantinos Koufos, Kurt Debattista, Mehrdad Dianati GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Yu Liu, Kai Chen, Ping Luo GSTAM: Efficient Graph Distillation with Structural Attention-Matching
Arash Rasti-Meymandi, Ahmad Sajedi, Zhaopan Xu, Konstantinos N. Plataniotis Hand2Any: Hand-to-Any Motion Mapping with Few-Shot User Adaptation for Avatar Manipulation
Riku Shinohara, Atsushi Hashimoto, Tadashi Kozuno, Shigeo Yoshida, Yutaro Hirao, Monica Perusquía-Hernández, Hideaki Uchiyama, Kiyoshi Kiyokawa Helios: An Extremely Low Power Event-Based Gesture Recognition for Always-on Smart Eyewear
Prarthana Bhattacharyya, Joshua Mitton, Ryan Page, Owen Morgan, Ben Menzies, Gabriel Homewood, Kemi Jacobs, Paolo Baesso, Dave Trickett, Chris Mair, Taru Muhonen, Rory Clark, Louis Berridge, Richard Vigars, Iain Wallace High-Frequency Near-Eye Ground Truth for Event-Based Eye Tracking
Andrea Simpsi, Andrea Aspesi, Simone Mentasti, Luca Merigo, Tommaso Ongarello, Matteo Matteucci How to Squeeze an Explanation Out of Your Model
Tiago Roxo, Joana Cabral Costa, Pedro R. M. Inácio, Hugo Proença Human-Based Low-Level Visual Processing Neural Network for Image Segmentation
Alessio Fagioli, Luigi Cinque, Damiano Distante, Gian Luca Foresti, Marco Cascio Hyperbolic Learning with Multimodal Large Language Models
Paolo Mandica, Luca Franco, Konstantinos Kallidromitis, Suzanne Petryk, Fabio Galasso Hyperbolic Metric Learning for Visual Outlier Detection
Álvaro González-Jiménez, Simone Lionetti, Dena Bazazian, Philippe Gottfrois, Fabian Gröger, Alexander A. Navarini, Marc Pouly Hyperspectral Imaging and Computer Vision Based Remote Monitoring of SO2 Emissions in Maritime Vessels
Arnoud Jochemsen, Hege Indresand, Martin Chamberland, Etienne Drouin, Jan Robert Fiksdal, Xuan Zhang, Nabil Belbachir I-Design: Personalized LLM Interior Designer
Ata Çelen, Guo Han, Konrad Schindler, Luc Van Gool, Iro Armeni, Anton Obukhov, Xi Wang Image Color Consistency in Datasets: The Smooth-TPS3D Method
Ismael Benito-Altamirano, David Martínez-Carpena, Hanna Lizarzaburu-Aguilar, Carles Ventura, Cristian Fàbrega, Joan Daniel Prades Improving Hyperparameter Optimization with Checkpointed Model Weights
Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat, James Lucas, Arun George Zachariah Improving in Situ Real-Time Classification of Long-Tail Marine Plankton Images for Ecosystem Studies
Noushin Eftekhari, Sophie Pitois, Mojtaba Masoudi, Robert E. Blackwell, James Scott, Sarah L. C. Giering, Matthew Fry Improving Post-Earthquake Crack Detection Using Semi-Synthetic Generated Images
Piercarlo Dondi, Alessio Gullotti, Michele Inchingolo, Ilaria Senaldi, Chiara Casarotti, Luca Lombardi, Marco Piastra Incremental and Decremental Continual Learning for Privacy-Preserving Video Recognition
Lorenzo Caselli, Simone Magistri, Tommaso Bianconcini, Andrea Benericetti, Douglas Coimbra de Andrade, Andrew D. Bagdanov Introducing Gating and Context into Temporal Action Detection
Aglind Reka, Diana Laura Borza, Dominick Reilly, Michal Balazia, François Brémond IPAdapter-Instruct: Resolving Ambiguity in Image-Based Conditioning Using Instruct Prompts
Ciara Rowles, Shimon Vainer, Dante De Nigris, Slava Elizarov, Konstantin Kutsy, Simon Donné KAN You See It? KANs and Sentinel for Effective and Explainable Crop Field Segmentation
Daniele Rege Cambrin, Eleonora Poeta, Eliana Pastor, Tania Cerquitelli, Elena Baralis, Paolo Garza Khattat: Enhancing Readability and Concept Representation of Semantic Typography
Ahmed Hussein, Alaa Elsetohy, Sama Hadhoud, Tameem Bakr, Yasser Rohaim, Badr AlKhamissi KRONC: Keypoint-Based Robust Camera Optimization for 3D Car Reconstruction
Davide Di Nucci, Alessandro Simoni, Matteo Tomei, Luca Ciuffreda, Roberto Vezzani, Rita Cucchiara LanPose: Language-Instructed 6d Object Pose Estimation for Robotic Assembly
Bowen Fu, Sek Kun Leong, Yan Di, Gu Wang, Jiwen Tang, Federico Tombari, Xiangyang Ji Larval Hostplant Prediction from Luehdorfia Japonica Image Using Multi-Label ABN
Tsubasa Hirakawa, Takaaki Arai, Takayoshi Yamashita, Hironobu Fujiyoshi, Yuichi Oba, Hiromichi Fukui, Masaya Yago Latent Distillation for Continual Object Detection at the Edge
Francesco Pasti, Marina Ceccon, Davide Dalle Pezze, Francesco Paissan, Elisabetta Farella, Gian Antonio Susto, Nicola Bellotto Learning from Strong to Weak an Enhanced Quality Comparison Network via Efficient Transfer Learning
Yunchen Zhang, Xiangkai Xu, Hong Gao, Ji Shi, Yiming Bao, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu Level up Your Tutorials: VLMs for Game Tutorials Quality Assessment
Daniele Rege Cambrin, Gabriele Scaffidi Militone, Luca Colomba, Giovanni Malnati, Daniele Apiletti, Paolo Garza Leveraging Key-Points Encoded Human Pose Images for Human Activity Recognition
Gaia Virginia Dobici, Luca Minutillo, Ermanno Cordelli, Francesco Chirico, Goffredo Foglia, Paolo Soda Leveraging Object Priors for Point Tracking
Bikram Boote, Anh Thai, Wenqi Jia, Ozgur Kara, Stefan Stojanov, James M. Rehg, Sangmin Lee LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Huan Wang, Feitong Tan, Ziqian Bai, Yinda Zhang, Shichen Liu, Qiangeng Xu, Menglei Chai, Anish Prabhu, Rohit Pandey, Sean Fanello, Zeng Huang, Yun Fu Limited but Consistent Gains in Adversarial Robustness by Co-Training Object Recognition Models with Human EEG
Manshan Guo, Bhavin Choksi, Sari Sadiya, Alessandro T. Gifford, Martina G. Vilas, Radoslaw Martin Cichy, Gemma Roig Lincoln's Annotated Spatio-Temporal Strawberry Dataset (LAST-Straw)
Katherine Margaret Frances James, Karoline Heiwolt, Daniel James Sargent, Grzegorz Cielniak Llama-NAS: Efficient Neural Architecture Search for Large Language Models
Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan LLaMAPed: Multi-Modal Pedestrian Crossing Intention Prediction
Je-Seok Ham, Sunghun Kim, Jia Huang, Peng Jiang, Jinyoung Moon, Srikanth Saripalli, Changick Kim LocalMamba: Visual State Space Model with Windowed Selective Scan
Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu Loop Mining Large-Scale Unlabeled Data for Corner Case Detection in Autonomous Driving
Jiawei Zhao, Yiting Duan, Jinming Su, Wangwang Yang, Tingyi Guo, Xingyue Chen, Junfeng Luo LSVOS Challenge Report: Large-Scale Complex and Long Video Object Segmentation
Henghui Ding, Lingyi Hong, Chang Liu, Ning Xu, Linjie Yang, Yuchen Fan, Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Jinming Chai, Qin Ma, Junpei Zhang, Licheng Jiao, Fang Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Xu Liu, Lingling Li, Hao Fang, Feiyu Pan, Xiankai Lu, Wei Zhang, Runmin Cong, Tuyen Tran, Bin Cao, Yisi Zhang, Hanyi Wang, Xingjian He, Jing Liu LVG-SfM: Learning-Based View-Graph Generation for Robust On-the-Fly SfM
Wentian Gan, Yifei Yu, Giulio Perda, Luca Morelli, Rui Xia, Zongqian Zhan, Xin Wang, Fabio Remondino Machine Learning-Driven Marketing Personas for the Luxury Fashion Market
Rocco Pietrini, Alessandro Galdelli, Adriano Mancini, Emanuele Frontoni, Primo Zingaretti Magic-Me: Identity-Specific Video Customized Diffusion
Ze Ma, Daquan Zhou, Xue-She Wang, Chun-Hsiao Yeh, Xiuyu Li, Huanrui Yang, Zhen Dong, Kurt Keutzer, Jiashi Feng MaskSDM: Adaptive Species Distribution Modeling Through Data Masking
Robin Zbinden, Nina Van Tiel, Gencer Sumbul, Benjamin Kellenberger, Devis Tuia Maximally Separated Active Learning
Tejaswi Kasarla, Abhishek Jha, Faye Tervoort, Rita Cucchiara, Pascal Mettes MCUBench: A Benchmark of Tiny Object Detectors on MCUs
Sudhakar Sah, Darshan C. Ganji, Matteo Grimaldi, Ravish Kumar, Alexander Hoffman, Honnesh Rohmetra, Ehsan Saboori Medical Image Segmentation with SAM-Generated Annotations
Iira Häkkinen, Iaroslav Melekhov, Erik Englesson, Hossein Azizpour, Juho Kannala Memory-Optimized Once-for-All Network
Maxime Girard, Victor Quétu, Samuel Tardieu, Van-Tam Nguyen, Enzo Tartaglione MI-NeRF: Learning a Single NeRF for Multiple Identities
Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras Mining Field Data for Tree Species Recognition at Scale
Dimitri Gominski, Daniel Ortiz-Gonzalo, Martin Brandt, Maurice Mugabowindekwe, Rasmus Fensholt Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks
Sierra Bonilla, Chiara Di Vece, Rema Daher, Xinwei Ju, Danail Stoyanov, Francisco Vasconcelos, Sophia Bano Mixed Non-Linear Quantization for Vision Transformers
Gihwan Kim, Jemin Lee, Sihyeong Park, Yongin Kwon, Hyungshin Kim MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation
Dimitrios Kollias, Andreas Psaroudakis, Anastasios Arsenos, Paraskevi Theofilou, Chunchang Shao, Guanyu Hu, Ioannis Patras MobileIQA: Exploiting Mobile-Level Diverse Opinion Network for No-Reference Image Quality Assessment Using Knowledge Distillation
Zewen Chen, Sunhan Xu, Yun Zeng, Haochen Guo, Jian Guo, Shuai Liu, Juan Wang, Bing Li, Weiming Hu, Dehua Liu, Hesong Li Modelling the Distribution of Human Motion for Sign Language Assessment
Oliver Cory, Ozge Mercanoglu Sincan, Matthew J. Vowels, Alessia Battisti, Franz Holzknecht, Katja Tissi, Sandra Sidler-Miserez, Tobias Haug, Sarah Ebling, Richard Bowden Monitoring Viewer Attention During Online Ads
Mina Bishay, Graham Page, Waleed Emad, Mohammad Mavadati MPL: Lifting 3D Human Pose from Multi-View 2D Poses
Seyed Abolfazl Ghasemzadeh, Alexandre Alahi, Christophe De Vleeschouwer Multi-Scale and Multimodal Species Distribution Modeling
Nina Van Tiel, Robin Zbinden, Emanuele Dalsasso, Benjamin Kellenberger, Loïc Pellissier, Devis Tuia Multi-View Pose Fusion for Occlusion-Aware 3D Human Pose Estimation
Laura Bragagnolo, Matteo Terreran, Davide Allegro, Stefano Ghidoni MVP: Multimodal Emotion Recognition Based on Video and Physiological Signals
Valeriya Strizhkova, Hadi Kachmar, Hava Chaptoukaev, Raphael Kalandadze, Natia Kukhilava, Tatia Tsmindashvili, Nibras Abo-Alzahab, Maria A. Zuluaga, Michal Balazia, Antitza Dantcheva, François Brémond, Laura M. Ferrari NeAT: Neural Artistic Tracing for High Resolution Style Transfer
Dan Ruta, Andrew Gilbert, John P. Collomosse, Eli Shechtman, Nicholas I. Kolkin NeRFmentation: Improving Monocular Depth Estimation with NeRF-Based Data Augmentation
Casimir Feldmann, Niall Siegenheim, Nikolas Hars, Lovro Rabuzin, Mert Ertugrul, Luca Wolfart, Marc Pollefeys, Zuria Bauer, Martin R. Oswald Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis
Romeo Lanzino, Federico Fontana, Luigi Cinque, Francesco Scarcello, Atsuto Maki Neuromorphic Drone Detection: An Event-RGB Multimodal Approach
Gabriele Magrini, Federico Becattini, Pietro Pala, Alberto Del Bimbo, Antonio Porta Neuromorphic Facial Analysis with Cross-Modal Supervision
Federico Becattini, Luca Cultrera, Lorenzo Berlincioni, Claudio Ferrari, Andrea Leonardo, Alberto Del Bimbo On Camera and LiDAR Positions in End-to-End Autonomous Driving
Malte Stelzer, Jan Pirklbauer, Jan Bickerdt, Volker Schomerus, Jan Piewek, Thorsten Bagdonat, Tim Fingscheidt On Scaling up 3D Gaussian Splatting Training
Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie Online Learning via Memory: Retrieval-Augmented Detector Adaptation
Yanan Jian, Fuxun Yu, Qi Zhang, William Levine, Brandon Dubbs, Nikolaos Karianakis Online Stochastic Optimization for Data with Temporal Dependencies
Shivang Patel, Ram J. Zaveri, Samuel Chambers, Zaigham A. Randhawa, Gianfranco Doretto Open-Set Plankton Recognition
Joona Kareinen, Annaliina Skyttä, Tuomas Eerola, Kaisa Kraft, Lasse Lensu, Sanna Suikkanen, Maiju Lehtiniemi, Heikki Kälviäinen Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts
Prakash Chandra Chhipa, Kanjar De, Meenakshi Subhash Chippa, Rajkumar Saini, Marcus Liwicki OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
Muhammad Rameez Ur Rahman, Piero Simonetto, Anna Polato, Francesco Pasti, Luca Tonin, Sebastiano Vascon Optimal OnTheFly Feedback Control of Event Sensors
Valery Vishenvskiy, Greg Burman, Sebastian Kozerke, Diederik Paul Moeys Ordinal-Meta Learning for Fine-Grained Fruit Quality Prediction
Aayush Mishra, Manasi Patwardhan, Parijat Deshpande, Beena Rai OSSA: Unsupervised One-Shot Style Adaptation
Robin Gerster, Holger Caesar, Matthias Rapp, Alexander Wolpert, Michael Teutsch PackMamba: Efficient Processing of Variable-Length Sequences in Mamba Training
Haoran Xu, Ziqian Liu, Rong Fu, Zhongling Su, Zerui Wang, Zheng Cai, Zhilin Pei, Xingcheng Zhang Pixels of Faith: Exploiting Visual Saliency to Detect Religious Image Manipulation
Giuseppe Cartella, Vittorio Cuculo, Marcella Cornia, Marco Papasidero, Federico Ruozzi, Rita Cucchiara PlaMo: Plan and Move in Rich 3D Physical Environments
Assaf Hallak, Gal Dalal, Chen Tessler, Kelly Guo, Shie Mannor, Gal Chechik POLO - Point-Based, Multi-Class Animal Detection
Giacomo May, Emanuele Dalsasso, Benjamin Kellenberger, Devis Tuia Pose-Independent 3D Anthropometry from Sparse Data
David Bojanic, Stefanie Wuhrer, Tomislav Petkovic, Tomislav Pribanic PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash Objects
Luis F. W. Batista, Salim Khazem, Mehran Adibi, Seth Hutchinson, Cédric Pradalier Predicting Emotions in Interpersonal Interaction Videos: I Know What You Feel
Hajer Guerdelli, Claudio Ferrari, Stefano Berretti, Walid Barhoumi, Alberto Del Bimbo PRISM: Progressive Restoration for Scene Graph-Based Image Manipulation
Pavel Jahoda, Yousef Yeganeh, Ehsan Adeli, Nassir Navab, Azade Farshad Prompt and Prejudice
Lorenzo Berlincioni, Luca Cultrera, Federico Becattini, Marco Bertini, Alberto Del Bimbo ProxyDR: Deep Hyperspherical Metric Learning with Distance Ratio-Based Formulation
Hyeongji Kim, Changkyu Choi, Michael Kampffmeyer, Terje Berge, Pekka Parviainen, Ketil Malde Pruning by Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng Zheng, Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu, Feiyu Pan, Hao Fang, Xiankai Lu Real-Time 2nd-Order Gaze Metrics
Andrew T. Duchowski, Krzysztof Krejtz, Izabela Krejtz Real-Time Neural Cloth Deformation Using a Compact Latent Space and a Latent Vector Predictor
Chanhaeng Lee, Maksym Perepichka, Saeed Ghorbani, Sudhir P. Mudur, Eric Paquette, Tiberiu Popa Recent Event Camera Innovations: A Survey
Bharatesh Chakravarthi, Aayush Atul Verma, Kostas Daniilidis, Cornelia Fermüller, Yezhou Yang RenDetNet: Weakly-Supervised Shadow Detection with Shadow Caster Verification
Nikolina Kubiak, Elliot Wortman, Armin Mustafa, Graeme Phillipson, Stephen Jolly, Simon Hadfield Rethinking HTG Evaluation: Bridging Generation and Recognition
Konstantina Nikolaidou, George Retsinas, Giorgos Sfikas, Marcus Liwicki Retrieval of Sun-Induced Plant Fluorescence in the O2-A Absorption Band from DESIS Imagery
Jim Buffat, Miguel Pato, Kevin Alonso, Stefan Auer, Emiliano Carmona, Stefan W. Maier, Rupert Müller, Patrick Rademske, Uwe Rascher, Hanno Scharr Revisiting Relevance Feedback for CLIP-Based Interactive Image Retrieval
Ryoya Nara, Yu-Chieh Lin, Yuji Nozawa, Youyang Ng, Goh Itoh, Osamu Torii, Yusuke Matsui RMT-BVQA: Recurrent Memory Transformer Based Blind Video Quality Assessment for Enhanced Video Content
Tianhao Peng, Chen Feng, Duolikun Danier, Fan Zhang, Benoit Quentin Arthur Vallade, Alex Mackin, David Bull RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (Early Version)
Yao Mu, Tianxing Chen, Shijia Peng, Zanxin Chen, Zeyu Gao, Yude Zou, Lunkai Lin, Zhiqiang Xie, Ping Luo ROMEO: Revisiting Optimization Methods for Reconstructing 3D Human-Object Interaction Models from Images
Alexey Gavryushin, Yifei Liu, Daoji Huang, Yen-Ling Kuo, Julien Valentin, Luc Van Gool, Otmar Hilliges, Xi Wang S-ROPE: Spectral Frame Representation of Periodic Events
Luis Garcia Rodriguez, Jonas Konrad, Dominik Drees, Benjamin Risse SABER-6D: Shape Representation Based Implicit Object Pose Estimation
Shishir Reddy Vutukur, Mengkejiergeli Ba, Benjamin Busam, Matthias Kayser, Gurprit Singh San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows
Nicolò Di Domenico, Guido Borghi, Annalisa Franco, Marco Boschetti, Federica Giacomini, Sebastian Barzaghi, Silvia Ferucci, Simone Zambruno, Lorenzo Mularoni, Qiong Gao, Chenyue Che, Guoxin Li, Yanyan Zu, Jiayao Hao, Junpei Zhang, Ákos Dúcz, Levente Gego, Klevis Imeri, Viktória Nemkin, Azam Rakhmatillaev, Soma Szatmári, William Rowan Scaling up Resonate-and-Fire Networks for Fast Deep Learning
Thomas E. Huber, Jules Lecomte, Borislav Polovnikov, Axel von Arnim Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method
Hongjun Choi, Dongbin Na, Kyungjin Cho, Byunguk Bae, Seo Taek Kong, Hyunjoon Ahn, Sungchul Choi, Jaeyoung Kim Self-Supervised HDR Imaging from Motion and Exposure Cues
Michal Nazarczuk, Sibi Catley-Chandar, Ales Leonardis, Eduardo Pérez-Pellitero Self-Supervised Road Accident Anticipation with Non-Decreasing Danger
Aurel Pjetri, Davide Abbondandolo, Douglas Coimbra de Andrade, Stefano Caprasecca, Francesco Sambo, Andrew D. Bagdanov Skeleton-Aware Motion Retargeting Using Masked Pose Modeling
Giulia Martinelli, Nicola Garau, Niccolò Bisagno, Nicola Conci Source-Free Domain Adaptation for YOLO Object Detection
Simon Varailhon, Masih Aminbeidokhti, Marco Pedersoli, Eric Granger Sources of Uncertainty in 3D Scene Reconstruction
Marcus Klasson, Riccardo Mereu, Juho Kannala, Arno Solin Space3D-Bench: Spatial 3D Question Answering Benchmark
Emilia Szymanska, Mihai Dusmanu, Jan-Willem Buurlage, Mahdi Rad, Marc Pollefeys SR-VQA: Super-Resolution Video Quality Assessment Model
Yuqin Cao, Wei Sun, Weixia Zhang, Yinan Sun, Ziheng Jia, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai Storytelling Video Generation with Retrieval Augmentation and Character Consistency
Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers
Martín Méndez, Pau Torras, Adrià Molina, Jialuo Chen, Oriol Ramos Terrades, Alicia Fornés Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization
Rohan Reddy Mekala, Frederik Pahde, Simon Baur, Sneha Chandrashekar, Madeline Diep, Markus Wenzel, Eric L. Wisotzky, Galip Ümit Yolcu, Sebastian Lapuschkin, Jackie Ma, Peter Eisert, Mikael Lindvall, Adam A. Porter, Wojciech Samek TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans
Aggelina Chatziagapi, Bindita Chaudhuri, Amit Kumar, Rakesh Ranjan, Dimitris Samaras, Nikolaos Sarafianos Target-Oriented Object Grasping via Multimodal Human Guidance
Pengwei Xie, Siang Chen, Yixiang Dai, Dingchang Hu, Kaiqin Yang, Guijin Wang TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen, Wenjun Huang, Yang Ni, Sanggeon Yun, Yezi Liu, Fei Wen, Alvaro Velasquez, Hugo Latapie, Mohsen Imani TASOD: A Data Collection for Tiny and Small Object Detection
Lars Fichtel, Dominik Erbacher, Dennis Grünwald, Leon Heller, Christian Bachmeir, Radu Timofte Textualized and Feature-Based Models for Compound Multimodal Emotion Recognition in the Wild
Nicolas Richet, Soufiane Belharbi, Haseeb Aslam, Meike Emilie Schadt, Manuela González-González, Gustave Cortal, Alessandro Lameiras Koerich, Marco Pedersoli, Alain Finkel, Simon Bacon, Eric Granger The BRAVO Semantic Segmentation Challenge Results in UNCV2024
Tuan-Hung Vu, Eduardo Valle, Andrei Bursuc, Tommie Kerssies, Daan de Geus, Gijs Dubbelman, Long Qian, Bingke Zhu, Yingying Chen, Ming Tang, Jinqiao Wang, Tomás Vojír, Jan Sochman, Jirí Matas, Michael Smith, Frank P. Ferrie, Shamik Basu, Christos Sakaridis, Luc Van Gool The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results
Matej Kristan, Jirí Matas, Pavel Tokmakov, Michael Felsberg, Luka Cehovin Zajc, Alan Lukezic, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Hyung Jin Chang, Gustavo Fernández, Minasadat Attari, Antoni B. Chan, Liang Chen, Xin Chen, Jaired Collins, Yutao Cui, Ganesh Sai Manas Devarapu, Yinglong Du, Heng Fan, Wan-Cyuan Fan, Zhenhua Feng, Mingqi Gao, Rama Krishna Gorthi, Raghav Goyal, Jungong Han, Bijaya Kumar Hatuwal, Zhenyu He, Xiantao Hu, Xingsen Huang, Yuqing Huang, Dongmei Jiang, Ben Kang, Kannappan Palaniappan, Josef Kittler, Simiao Lai, Ning Li, Xiaohai Li, Xin Li, Cheng Liang, Liting Lin, Haibin Ling, Ting Liu, Ziquan Liu, Huchuan Lu, Yifei Luo, Deshui Miao, Juan David Mogollon, Ziqi Pang, Jaswanth Reddy Pochimireddy, Viktor Prutyanov, Gani Rahmon, Aleksandr Romanov, Liangtao Shi, Mennatullah Siam, Leonid Sigal, Arun Kumar Sivapuram, Roman A. Solovyev, Elham Soltani Kazemi, Imad Eddine Toubal, Jia Wan, Limin Wang, Xinying Wang, Yaowei Wang, Yu-Xiong Wang, Zhiquan Wang, Gangshan Wu, Qiangqiang Wu, Xiaojun Wu, Zihao Xia, Jinxia Xie, Chenlong Xu, Tianyang Xu, Yong Xu, Chaocan Xue, Chao Yang, Jinyu Yang, Ming-Hsuan Yang, Chenyang Yu, Ke Yu, Chunhui Zhang, Jiaming Zhang, Zhipeng Zhang, Feng Zheng, Yaozong Zheng, Bineng Zhong, Jinglin Zhou, Junbao Zhou, Yong Zhou, Zikun Zhou, Guibo Zhu, Jiawen Zhu, Xuefeng Zhu, Vladimir V. Zunin THP3D: Text-Driven Multi-Granularity 3D Human Parsing
Keito Suzuki, Bang Du, Kunyao Chen, Runfa Blark Li, Truong Q. Nguyen Time-Resolved MNIST Dataset for Single-Photon Recognition
Aleksi Suonsivu, Lauri Salmela, Edoardo Peretti, Leevi Uosukainen, Radu Ciprian Bilcu, Giacomo Boracchi ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment
Hsiang-Wei Huang, Jiacheng Sun, Cheng-Yen Yang, Zhongyu Jiang, Li-Yu Huang, Jenq-Neng Hwang, Yu-Ching Yeh Towards Motion from Video Diffusion Models
Paul Janson, Tiberiu Popa, Eugene Belilovsky Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning
Yushen Zuo, Jun Xiao, Kin-Chung Chan, Rongkang Dong, Cuixin Yang, Zongqi He, Hao Xie, Kin-Man Lam Towards Multimodal In-Context Learning for Vision and Language Models
Sivan Doveh, Shaked Perek, Muhammad Jehanzeb Mirza, Wei Lin, Amit Alfassy, Assaf Arbelle, Shimon Ullman, Leonid Karlinsky Towards Real-Time Online Egocentric Action Recognition on Smart Eyewear
Riccardo Santambrogio, Federico Caspani, Greta Corti, Francesca Palermo, Simone Mentasti, Diana Trojaniello, Matteo Matteucci Towards Resource-Aware Visual Inertial SLAM
Giovanni Affatato, Marco Paracchini, Francesca Palermo, Diana Trojaniello, Tommaso Ongarello, Marco Marcon, Stefano Tubaro Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang, Jiaqi Li, Yachuan Huang, Yiran Wang, Jinghong Zheng, Liao Shen, Zhiguo Cao Towards Unsupervised Eye-Region Segmentation for Eye Tracking
Jiangfan Deng, Zhuang Jia, Zhaoxue Wang, Xiang Long, Daniel K. Du Tracking-Assisted Object Detection with Event Cameras
Ting-Kang Yen, Igor Morawski, Shusil Dangi, Kai He, Chung-Yi Lin, Jia-Fong Yeh, Hung-Ting Su, Winston H. Hsu Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection
Sondos Mohamed, Walter Zimmer, Ross Greer, Ahmed Alaaeldin Ghita, Modesto Castrillón Santana, Mohan M. Trivedi, Alois Knoll, Salvatore M. Carta, Mirko Marras TRICKY 2024 Challenge on Monocular Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez, Alex Costanzino, Fabio Tosi, Matteo Poggi, Luigi Di Stefano, Jean-Baptiste Weibel, Dominik Bauer, Doris Antensteiner, Markus Vincze, Jiaqi Li, Yachuan Huang, Junrui Zhang, Yiran Wang, Jinghong Zheng, Liao Shen, Zhiguo Cao, Ziyang Song, Zerong Wang, Ruijie Zhu, Hao Zhang, Rui Li, Jiang Wu, Xian Li, Yu Zhu, Jinqiu Sun, Yanning Zhang, Pihai Sun, Yuanqi Yao, Wenbo Zhao, Kui Jiang, Junjun Jiang, Mykola Lavreniuk, Pengzhi Li, Jui-Lin Wang Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO
Julian Moosmann, Pietro Bonazzi, Yawei Li, Sizhen Bian, Philipp Mayer, Luca Benini, Michele Magno Underwater Uncertainty: A Multi-Annotator Image Dataset for Benthic Habitat Classification
Galadrielle Humblot-Renaux, Anders Skaarup Johansen, Jonathan Eichild Schmidt, Amanda Frederikke Irlind, Niels Madsen, Thomas B. Moeslund, Malte Pedersen Unlocking Comics: The AI4VA Dataset for Visual Understanding
Peter Grönquist, Deblina Bhattacharjee, Bahar Aydemir, Baran Ozaydin, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo, Felix Sattler, Antje Alex, Alexander Klein, Bruno Pereira Costa, Ángel Bueno Rodríguez, Jannis Stoppe Valeo4Cast: A Modular Approach to End-to-End Forecasting
Yihong Xu, Éloi Zablocki, Alexandre Boulch, Gilles Puy, Mickaël Chen, Florent Bartoccioni, Nermin Samet, Oriane Siméoni, Spyros Gidaris, Tuan-Hung Vu, Andrei Bursuc, Eduardo Valle, Renaud Marlet, Matthieu Cord VATE: A Large Scale Multimodal Spontaneous Dataset for Affective Evaluation
Francesco Agnelli, Giuliano Grossi, Alessandro D'Amelio, Marco De Paoli, Raffaella Lanzarotti Vibration Vision: Real-Time Machinery Fault Diagnosis with Event Cameras
Muhammad Aitsam, Gaurvi Goyal, Chiara Bartolozzi, Alessandro G. Di Nuovo Video Editing for Video Retrieval
Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damen Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods
Adam Phillips, Daniel Grandes Rodriguez, Miriam Sánchez-Manzano, Alan Salvadó, Manuel Garin, Gloria Haro, Coloma Ballester VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis
Donggoo Kang, Dasol Jeong, Hyunmin Lee, Sangwoo Park, Hasil Park, Sunkyu Kwon, Yeong-Joon Kim, Joonki Paik VQA-Driven Facet-Level Texture Segmentation in 3D Surfaces
Iyyakutti Iyappan Ganapathi, Sajid Javed, Syed Sadaf Ali, Mohamad Alansari, Said Boumaraf, Naoufel Werghi Watt for What: Rethinking Deep Learning's Energy-Performance Relationship
Shreyank N. Gowda, Xinyue Hao, Gen Li, Shashank Narayana Gowda, Xiaobo Jin, Laura Sevilla-Lara Wild Berry Image Dataset Collected in Finnish Forests and Peatlands Using Drones
Luigi Riz, Sergio Povoli, Andrea Caraffa, Davide Boscaini, Mohamed Lamine Mekhalfi, Paul Chippendale, Marjut Turtiainen, Birgitta Partanen, Laura Smith Ballester, Juan Fco. Blanes Noguera, Alessio Franchi, Elisa Castelli, Giacomo Piccinini, Luca Marchesotti, Micael Santos Couceiro, Fabio Poiesi xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations
Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong Μgat: Improving Single-Page Document Parsing by Providing Multi-Page Context
Fabio Quattrini, Carmine Zaccagnino, Silvia Cascianelli, Laura Righi, Rita Cucchiara