Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos
Abstract
Self-supervised learning holds the promise of learning good representations from real-world continuous uncurated data streams. However, most existing works in visual self-supervised learning focus on static images or artificial data streams. Towards exploring a more realistic learning substrate, we investigate streaming self-supervised learning from long-form real-world egocentric video streams. Inspired by the event segmentation mechanism in human perception and memory, we propose “Memory Storyboard,” a novel continual self-supervised learning framework that groups recent past frames into temporal segments for a more effective summarization of the past visual streams for memory replay. To accommodate efficient temporal segmentation, we propose a two-tier memory hierarchy: the recent past is stored in a short-term memory, where the storyboard temporal segments are produced and then transferred to a long-term memory. Experiments on two real-world egocentric video datasets show that contrastive learning objectives on top of storyboard frames result in semantically meaningful representations that outperform those produced by state-of-the-art unsupervised continual learning methods.
Cite
Text
Yang and Ren. "Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos." Proceedings of The 4th Conference on Lifelong Learning Agents, 2025.Markdown
[Yang and Ren. "Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos." Proceedings of The 4th Conference on Lifelong Learning Agents, 2025.](https://mlanthology.org/collas/2025/yang2025collas-memory/)BibTeX
@inproceedings{yang2025collas-memory,
title = {{Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos}},
author = {Yang, Yanlai and Ren, Mengye},
booktitle = {Proceedings of The 4th Conference on Lifelong Learning Agents},
year = {2025},
pages = {182-203},
volume = {330},
url = {https://mlanthology.org/collas/2025/yang2025collas-memory/}
}