Transtreaming: Adaptive Delay-Aware Transformer for Real-Time Streaming Perception

Abstract

Real-time object detection is critical for the decision-making process for many real-world applications, such as collision avoidance and path planning in autonomous driving. This work presents an innovative real-time streaming perception method, Transtreaming, which addresses the challenge of real-time object detection with dynamic computational delays. The core innovation of Transtreaming lies in its adaptive delay-aware transformer, which can concurrently predict multiple future frames and select the output that best matches the real-world present time, compensating for any system-induced computational delays. The proposed model outperforms existing state-of-the-art methods, even in single-frame detection scenarios, by leveraging a transformer-based methodology. It demonstrates robust performance across a range of devices, from powerful V100 to modest 2080Ti, achieving the highest level of perceptual accuracy on all platforms. Unlike most state-of-the-art methods that struggle to complete computation within a single frame on less powerful devices, Transtreaming meets the stringent real-time processing requirements on all kinds of devices. The experimental results emphasize the system's adaptability and its potential to significantly improve the safety and reliability of many real-world systems, such as autonomous driving.

Cite

Text

Zhang et al. "Transtreaming: Adaptive Delay-Aware Transformer for Real-Time Streaming Perception." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33105

Markdown

[Zhang et al. "Transtreaming: Adaptive Delay-Aware Transformer for Real-Time Streaming Perception." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-transtreaming/) doi:10.1609/AAAI.V39I10.33105

BibTeX

@inproceedings{zhang2025aaai-transtreaming,
  title     = {{Transtreaming: Adaptive Delay-Aware Transformer for Real-Time Streaming Perception}},
  author    = {Zhang, Xiang and Cui, Yufei and Fu, Chenchen and Wang, Zihao and Sun, Yuyang and Liu, Xue and Wu, Weiwei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10185-10193},
  doi       = {10.1609/AAAI.V39I10.33105},
  url       = {https://mlanthology.org/aaai/2025/zhang2025aaai-transtreaming/}
}