LaMMOn: Language Model Combined Graph Neural Network for Multi-Target Multi-Camera Tracking in Online Scenarios

Abstract

Multi-target multi-camera tracking is crucial to intelligent transportation systems. Numerous recent studies have been undertaken to address this issue. Nevertheless, using the approaches in real-world situations is challenging due to the scarcity of publicly available data and the laborious process of manually annotating the new dataset and creating a tailored rule-based matching system for each camera scenario. To address this issue, we present a novel solution termed LaMMOn, an end-to-end transformer and graph neural network-based multi-camera tracking model. LaMMOn consists of three main modules: (1) Language Model Detection (LMD) for object detection; (2) Language and Graph Model Association module (LGMA) for object tracking and trajectory clustering; (3) Text-to-embedding module (T2E) that overcome the problem of data limitation by synthesizing the object embedding from defined texts. LaMMOn can be run online in real-time scenarios and achieve a competitive result on many datasets, e.g., CityFlow (HOTA 76.46%), I24 (HOTA 25.7%), and TrackCUIP (HOTA 80.94%) with an acceptable FPS (from 12.20 to 13.37) for an online application.

Cite

Text

Nguyen et al. "LaMMOn: Language Model Combined Graph Neural Network for Multi-Target Multi-Camera Tracking in Online Scenarios." Machine Learning, 2024. doi:10.1007/S10994-024-06592-1

Markdown

[Nguyen et al. "LaMMOn: Language Model Combined Graph Neural Network for Multi-Target Multi-Camera Tracking in Online Scenarios." Machine Learning, 2024.](https://mlanthology.org/mlj/2024/nguyen2024mlj-lammon/) doi:10.1007/S10994-024-06592-1

BibTeX

@article{nguyen2024mlj-lammon,
  title     = {{LaMMOn: Language Model Combined Graph Neural Network for Multi-Target Multi-Camera Tracking in Online Scenarios}},
  author    = {Nguyen, Tuan T. and Nguyen, Hoang H. and Sartipi, Mina and Fisichella, Marco},
  journal   = {Machine Learning},
  year      = {2024},
  pages     = {6811-6837},
  doi       = {10.1007/S10994-024-06592-1},
  volume    = {113},
  url       = {https://mlanthology.org/mlj/2024/nguyen2024mlj-lammon/}
}