Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Luo, Ge Ya; Favero, Gian Mario; Luo, ZhiHao; Jolicoeur-Martineau, Alexia; Pal, Christopher

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian Mario Favero, ZhiHao Luo, Alexia Jolicoeur-Martineau, Christopher Pal

ICLR 2025

/iclr/2025/luo2025iclr-beyond-a/

Abstract

The Fréchet Video Distance (FVD) is a widely adopted metric for evaluating video generation distribution quality. However, its effectiveness relies on critical assumptions. Our analysis reveals three significant limitations: (1) the non-Gaussianity of the Inflated 3D Convnet (I3D) feature space; (2) the insensitivity of I3D features to temporal distortions; (3) the impractical sample sizes required for reliable estimation. These findings undermine FVD's reliability and show that FVD falls short as a standalone metric for video generation evaluation. After extensive analysis of a wide range of metrics and backbone architectures, we propose JEDi, the JEPA Embedding Distance, based on features derived from a Joint Embedding Predictive Architecture, measured using Maximum Mean Discrepancy with polynomial kernel. Our experiments on multiple open-source datasets show clear evidence that it is a superior alternative to the widely used FVD metric, requiring only 16% of the samples to reach its steady value, while increasing alignment with human evaluation by 34%, on average. Project page: https://oooolga.github.io/JEDi.github.io/.

PDF ICLR Semantic Scholar

Cite

Text

Luo et al. "Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality." International Conference on Learning Representations, 2025.

Markdown

[Luo et al. "Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/luo2025iclr-beyond-a/)

BibTeX

@inproceedings{luo2025iclr-beyond-a,
  title     = {{Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality}},
  author    = {Luo, Ge Ya and Favero, Gian Mario and Luo, ZhiHao and Jolicoeur-Martineau, Alexia and Pal, Christopher},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/luo2025iclr-beyond-a/}
}