P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification

Abstract

Age estimation is a challenging task that has numerous applications. In this paper, we propose a new direction for age classification that utilizes a video-based model to address challenges such as occlusions, low-resolution, and lighting conditions. To address these challenges, we propose AgeFormer which utilizes spatio-temporal information on the dynamics of the entire body dominating face-based methods for age classification. Our novel two-stream architecture uses TimeSformer and EfficientNet as backbones, to effectively capture both facial and body dynamics information for efficient and accurate age estimation in videos. Furthermore, to fill the gap in predicting age in real-world situations from videos, we construct a video dataset called Pexels Age (P-Age) for age classification. The proposed method achieves superior results compared to existing face-based age estimation methods and is evaluated in situations where the face is highly occluded, blurred, or masked. The method is also cross-tested on a variety of challenging video datasets such as Charades, Smarthome, and Thumos-14.

Cite

Text

Ali et al. "P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Ali et al. "P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/ali2024wacv-page/)

BibTeX

@inproceedings{ali2024wacv-page,
  title     = {{P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification}},
  author    = {Ali, Abid and Marisetty, Ashish and Brémond, François},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {8606-8615},
  url       = {https://mlanthology.org/wacv/2024/ali2024wacv-page/}
}