Learning Keypoints for Multi-Agent Behavior Analysis Using Self-Supervision

Abstract

The study of social interactions and collective behaviors through multi-agent video analysis is crucial in biology. While self-supervised keypoint discovery has emerged as a promising solution to reduce the need for manual keypoint annotations existing methods often struggle with videos containing multiple interacting agents especially those of the same species and color. To address this we introduce B-KinD-multi a novel approach that leverages pre-trained video segmentation models to guide keypoint discovery in multi-agent scenarios. This eliminates the need for time-consuming manual annotations on new experimental settings and organisms. Extensive evaluations demonstrate improved keypoint regression and downstream behavioral classification in videos of flies mice and rats. Furthermore our method generalizes well to other species including ants bees and humans highlighting its potential for broad applications in automated keypoint annotation for multi-agent behavior analysis. Code available under: B-KinD-Multi

Cite

Text

Khalil et al. "Learning Keypoints for Multi-Agent Behavior Analysis Using Self-Supervision." Winter Conference on Applications of Computer Vision, 2025.

Markdown

[Khalil et al. "Learning Keypoints for Multi-Agent Behavior Analysis Using Self-Supervision." Winter Conference on Applications of Computer Vision, 2025.](https://mlanthology.org/wacv/2025/khalil2025wacv-learning/)

BibTeX

@inproceedings{khalil2025wacv-learning,
  title     = {{Learning Keypoints for Multi-Agent Behavior Analysis Using Self-Supervision}},
  author    = {Khalil, Daniel and Liu, Christina and Perona, Pietro and Sun, Jennifer and Marks, Markus},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2025},
  pages     = {578-588},
  url       = {https://mlanthology.org/wacv/2025/khalil2025wacv-learning/}
}