Harnessing Vision Models for Time Series Analysis: A Survey

Abstract

Time series analysis has evolved from traditional autoregressive models to deep learning, Transformers, and Large Language Models (LLMs). While vision models have also been explored along the way, their contributions are less recognized due to the predominance of sequence modeling. However, challenges such as the mismatch between continuous time series and LLMs’ discrete token space, and the difficulty in capturing multivariate correlations, have led to growing interest in Large Vision Models (LVMs) and Vision-Language Models (VLMs). This survey highlights the advantages of vision models over LLMs in time series analysis, offering a comprehensive dual-view taxonomy that answers key research questions like how to encode time series as images and how to model imaged time series. Additionally, we address pre- and post-processing challenges in this framework and outline future directions for advancing the field.

Cite

Text

Ni et al. "Harnessing Vision Models for Time Series Analysis: A Survey." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1178

Markdown

[Ni et al. "Harnessing Vision Models for Time Series Analysis: A Survey." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/ni2025ijcai-harnessing/) doi:10.24963/IJCAI.2025/1178

BibTeX

@inproceedings{ni2025ijcai-harnessing,
  title     = {{Harnessing Vision Models for Time Series Analysis: A Survey}},
  author    = {Ni, Jingchao and Zhao, Ziming and Shen, ChengAo and Tong, Hanghang and Song, Dongjin and Cheng, Wei and Luo, Dongsheng and Chen, Haifeng},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10612-10620},
  doi       = {10.24963/IJCAI.2025/1178},
  url       = {https://mlanthology.org/ijcai/2025/ni2025ijcai-harnessing/}
}