Denoising Diffusion Models Are Good General Gaze Feature Learners

Abstract

Since the collection of labeled gaze data is laborious and time-consuming, methods which can learn generalizable features by leveraging large-scale available unlabeled data are desirable. In recent years, we have witnessed the tremendous capabilities of diffusion models in generating images as well as their potential in feature representation learning. In this paper, we investigate whether they can acquire discriminative representations for gaze estimation via generative pre-training. To achieve this goal, we propose a self-supervised learning framework with diffusion models for gaze estimation, called GazeDiff. Specifically, we utilize a conditional diffusion model to generate target image with gaze direction specified by the reference image as the pre-training task. To facilitate the diffusion model to learn gaze related features as condition, we propose a disentangling feature learning strategy, which first learns appearance feature, head pose feature, and eye direction feature respectively, and then combines them as the conditional features. Extensive experiments demonstrate denoising diffusion models are also good general gaze feature learners.

Cite

Text

Zeng et al. "Denoising Diffusion Models Are Good General Gaze Feature Learners." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/259

Markdown

[Zeng et al. "Denoising Diffusion Models Are Good General Gaze Feature Learners." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/zeng2025ijcai-denoising/) doi:10.24963/IJCAI.2025/259

BibTeX

@inproceedings{zeng2025ijcai-denoising,
  title     = {{Denoising Diffusion Models Are Good General Gaze Feature Learners}},
  author    = {Zeng, Guanzhong and Wang, Jingjing and Yin, Pengwei and Xu, Zefu and Zhou, Mingyang},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {2323-2331},
  doi       = {10.24963/IJCAI.2025/259},
  url       = {https://mlanthology.org/ijcai/2025/zeng2025ijcai-denoising/}
}