Combining Online and Offline Learning for Tracking a Talking Face in Video

Abstract

Facial appearance changes in video sequences represent a non stationary data problem, because of factors such as variations in pose, illumination and facial expressions. While most algorithm, that employ fixed appearance models of the target object, are not robust to track objects in uncontrolled environments. Existing Adaptive Appearance Models (AAMs) approaches solve this problem to an extent. However, they are not able to detect a misalignment, partial occlusions and do not adequately track facial feature points such as those relating to the eyes or mouth in the presence of significant expression changes. In this paper, we propose to combine an online and an offline learning approches for robust tracking of feature points of a talking face. The online learning used in a stochastic approach to estimate facial feature points globally and in a deterministic approach to refine the feature points. The tracked results are filtered by offline learning approach to ensure rejection of poorly aligned targets. This allows the proposed tracker to significantly improves robustness against appearance changes and occlusions. Experiment results on tracking facial feature points in long video sequences with a wide range of facial expressions in head movement demonstrate the effectiveness and robustness of our tracker.

Cite

Text

Nguyen and Milgram. "Combining Online and Offline Learning for Tracking a Talking Face in Video." IEEE/CVF International Conference on Computer Vision Workshops, 2009. doi:10.1109/ICCVW.2009.5457448

Markdown

[Nguyen and Milgram. "Combining Online and Offline Learning for Tracking a Talking Face in Video." IEEE/CVF International Conference on Computer Vision Workshops, 2009.](https://mlanthology.org/iccvw/2009/nguyen2009iccvw-combining/) doi:10.1109/ICCVW.2009.5457448

BibTeX

@inproceedings{nguyen2009iccvw-combining,
  title     = {{Combining Online and Offline Learning for Tracking a Talking Face in Video}},
  author    = {Nguyen, Quoc Dinh and Milgram, Maurice},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2009},
  pages     = {1401-1408},
  doi       = {10.1109/ICCVW.2009.5457448},
  url       = {https://mlanthology.org/iccvw/2009/nguyen2009iccvw-combining/}
}