Real-Time Object Detection and Skeletonization for Motion Prediction in Video Streaming
Abstract
The increasing demand for real-time analysis in video streaming has driven significant advancements in object detection and motion prediction. This paper presents SkelAI, an innovative application that combines YOLOv8, OpenCV, OpenAI API, and our own innovative algorithms to achieve real-time object detection and medial axis skeletonization tailored explicitly for live video streaming environments. In addition, SkelAI integrates AI-generated image capabilities through the DALL-E 3 model, enabling the extraction of skeletons from synthetic content that simulates streaming scenarios. The application supports exporting skeleton data in PyTorch-compatible formats, facilitating the training of sequence predicting deep learning models. Comprehensive evaluations demonstrate SkelAI’s enhanced accuracy, efficiency, and versatility compared to existing tools, underscoring its potential applications in digital animation, biomechanical research and robotics, human-computer interaction, and video compression within streaming platforms.
Cite
Text
Wong et al. "Real-Time Object Detection and Skeletonization for Motion Prediction in Video Streaming." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35376Markdown
[Wong et al. "Real-Time Object Detection and Skeletonization for Motion Prediction in Video Streaming." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wong2025aaai-real/) doi:10.1609/AAAI.V39I28.35376BibTeX
@inproceedings{wong2025aaai-real,
title = {{Real-Time Object Detection and Skeletonization for Motion Prediction in Video Streaming}},
author = {Wong, Gavin and Kumar, Yulia and Li, J. Jenny and Kruger, Dov},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {29712-29714},
doi = {10.1609/AAAI.V39I28.35376},
url = {https://mlanthology.org/aaai/2025/wong2025aaai-real/}
}