Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization
Abstract
Temporal stability in glottic opening localization remains challenging due to the complementary weaknesses of single-frame detectors and foundation-model trackers: the former lacks temporal context, while the latter suffers from memory drift. Specifically, in video laryngoscopy, rapid tissue deformation, occlusions, and visual ambiguities in emergency settings require a robust, temporally aware solution that can prevent progressive tracking errors. We propose Closed-Loop Memory Correction (CL-MC), a detector-in-the-loop framework that supervises Segment Anything Model 2(SAM2) through confidence-aligned state decisions and active memory rectification. High-confidence detections trigger semantic resets that overwrite corrupted tracker memory, effectively mitigating drift accumulation with a training-free foundation tracker in complex endoscopic scenes. On emergency intubation videos, CL-MC achieves state-of-the-art performance, significantly reducing drift and missing rate compared with the SAM2 variants and open loop based methods. Our results establish memory correction as a crucial component for reliable clinical video tracking.
Cite
Text
Wang et al. "Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization." Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, 2026.Markdown
[Wang et al. "Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization." Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, 2026.](https://mlanthology.org/midl/2026/wang2026midl-detectorintheloop/)BibTeX
@inproceedings{wang2026midl-detectorintheloop,
title = {{Detector-in-the-Loop Tracking: Active Memory Rectification for Stable Glottic Opening Localization}},
author = {Wang, Huayu and Alattar, Bahaa and Yang, Cheng-Yen and Huang, Hsiang-Wei and Kim, Jung Heon and Shapiro, Linda and White, Nathan and Hwang, Jenq-Neng},
booktitle = {Proceedings of The 9th International Conference on Medical Imaging with Deep Learning},
year = {2026},
pages = {3750-3763},
volume = {315},
url = {https://mlanthology.org/midl/2026/wang2026midl-detectorintheloop/}
}