Jenni, Simon

18 publications

ICCV 2025 Improving Large Vision and Language Models by Learning from a Panel of Peers Jefferson Hernandez, Jing Shi, Simon Jenni, Vicente Ordonez, Kushal Kafle
NeurIPS 2025 The Indra Representation Hypothesis for Multimodal Alignment Jianglin Lu, Hailing Wang, Kuo Yang, Yitian Zhang, Simon Jenni, Yun Fu
CVPR 2025 The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique like Photographers Daiqing Qi, Handong Zhao, Jing Shi, Simon Jenni, Yifei Fan, Franck Dernoncourt, Scott Cohen, Sheng Li
CVPRW 2025 ViDROP: Video Dense Representation Through Spatio-Temporal Sparsity Sepehr Sameni, Simon Jenni, Paolo Favaro
CVPR 2024 Building Vision-Language Models on Solid Foundations with Masked Distillation Sepehr Sameni, Kushal Kafle, Hao Tan, Simon Jenni
CVPR 2024 Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Gihyun Kwon, Simon Jenni, Dingzeyu Li, Joon-Young Lee, Jong Chul Ye, Fabian Caba Heilbron
ECCV 2024 FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo
AAAI 2024 No More Shortcuts: Realizing the Potential of Temporal Self-Supervision Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah
ECCV 2024 Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets Ishan Rajendrakumar Dave, Fabian Caba, Mubarak Shah, Simon Jenni
AAAI 2023 Audio-Visual Contrastive Learning with Temporal Self-Supervision Simon Jenni, Alexander Black, John P. Collomosse
CVPRW 2023 EKILA: Synthetic Media Provenance and Attribution for Generative Art Kar Balan, Shruti Agarwal, Simon Jenni, Andy Parsons, Andrew Gilbert, John P. Collomosse
CVPR 2023 Meta-Personalizing Vision-Language Models to Find Named Instances in Video Chun-Hsiao Yeh, Bryan Russell, Josef Sivic, Fabian Caba Heilbron, Simon Jenni
AAAI 2023 Representation Learning by Detecting Incorrect Location Embeddings Sepehr Sameni, Simon Jenni, Paolo Favaro
ICCV 2023 Spatio-Temporal Crop Aggregation for Video Representation Learning Sepehr Sameni, Simon Jenni, Paolo Favaro
ICCV 2023 VADER: Video Alignment Differencing and Retrieval Alexander Black, Simon Jenni, Tu Bui, Md. Mehrab Tanjim, Stefano Petrangeli, Ritwik Sinha, Viswanathan Swaminathan, John Collomosse
ICCV 2021 Time-Equivariant Contrastive Video Representation Learning Simon Jenni, Hailin Jin
ECCV 2020 Learning Video Representations by Transforming Time Simon Jenni, Givi Meishvili, Paolo Favaro
ECCV 2018 Deep Bilevel Learning Simon Jenni, Paolo Favaro