ML Anthology
Authors
Search
About
Jenni, Simon
18 publications
ICCV
2025
Improving Large Vision and Language Models by Learning from a Panel of Peers
Jefferson Hernandez
,
Jing Shi
,
Simon Jenni
,
Vicente Ordonez
,
Kushal Kafle
NeurIPS
2025
The Indra Representation Hypothesis for Multimodal Alignment
Jianglin Lu
,
Hailing Wang
,
Kuo Yang
,
Yitian Zhang
,
Simon Jenni
,
Yun Fu
CVPR
2025
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique like Photographers
Daiqing Qi
,
Handong Zhao
,
Jing Shi
,
Simon Jenni
,
Yifei Fan
,
Franck Dernoncourt
,
Scott Cohen
,
Sheng Li
CVPRW
2025
ViDROP: Video Dense Representation Through Spatio-Temporal Sparsity
Sepehr Sameni
,
Simon Jenni
,
Paolo Favaro
CVPR
2024
Building Vision-Language Models on Solid Foundations with Masked Distillation
Sepehr Sameni
,
Kushal Kafle
,
Hao Tan
,
Simon Jenni
CVPR
2024
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Gihyun Kwon
,
Simon Jenni
,
Dingzeyu Li
,
Joon-Young Lee
,
Jong Chul Ye
,
Fabian Caba Heilbron
ECCV
2024
FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction
Hang Hua
,
Jing Shi
,
Kushal Kafle
,
Simon Jenni
,
Daoan Zhang
,
John Collomosse
,
Scott Cohen
,
Jiebo Luo
AAAI
2024
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision
Ishan Rajendrakumar Dave
,
Simon Jenni
,
Mubarak Shah
ECCV
2024
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Ishan Rajendrakumar Dave
,
Fabian Caba
,
Mubarak Shah
,
Simon Jenni
AAAI
2023
Audio-Visual Contrastive Learning with Temporal Self-Supervision
Simon Jenni
,
Alexander Black
,
John P. Collomosse
CVPRW
2023
EKILA: Synthetic Media Provenance and Attribution for Generative Art
Kar Balan
,
Shruti Agarwal
,
Simon Jenni
,
Andy Parsons
,
Andrew Gilbert
,
John P. Collomosse
CVPR
2023
Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Chun-Hsiao Yeh
,
Bryan Russell
,
Josef Sivic
,
Fabian Caba Heilbron
,
Simon Jenni
AAAI
2023
Representation Learning by Detecting Incorrect Location Embeddings
Sepehr Sameni
,
Simon Jenni
,
Paolo Favaro
ICCV
2023
Spatio-Temporal Crop Aggregation for Video Representation Learning
Sepehr Sameni
,
Simon Jenni
,
Paolo Favaro
ICCV
2023
VADER: Video Alignment Differencing and Retrieval
Alexander Black
,
Simon Jenni
,
Tu Bui
,
Md. Mehrab Tanjim
,
Stefano Petrangeli
,
Ritwik Sinha
,
Viswanathan Swaminathan
,
John Collomosse
ICCV
2021
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni
,
Hailin Jin
ECCV
2020
Learning Video Representations by Transforming Time
Simon Jenni
,
Givi Meishvili
,
Paolo Favaro
ECCV
2018
Deep Bilevel Learning
Simon Jenni
,
Paolo Favaro