S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting

Abstract

Global Station Weather Forecasting (GSWF) is a key meteorological research area, critical to energy, aviation, and agriculture. Existing time series forecasting methods often ignore or unidirectionally model spatial correlation when conducting large-scale global station forecasting. This contradicts the intrinsic nature underlying observations of the global weather system, limiting forecast performance. To address this, we propose a novel Spatial Structured Attention Block in this paper. It partitions the spatial graph into a set of subgraphs and instantiates Intra-subgraph Attention to learn local spatial correlation within each subgraph, and aggregates nodes into subgraph representations for message passing among the subgraphs via Inter-subgraph Attention---considering both spatial proximity and global correlation. Building on this block, we develop a multiscale spatiotemporal forecasting model S$^2$Transformer by progressively expanding subgraph scales. The resulting model is both scalable and able to produce structured spatial correlation, and meanwhile, it is easy to implement. The experimental results show that it can achieve performance improvements up to 16.8% over time series forecasting baselines at low running costs.

Cite

Text

Chen et al. "S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting." Transactions on Machine Learning Research, 2026.

Markdown

[Chen et al. "S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/chen2026tmlr-2transformer/)

BibTeX

@article{chen2026tmlr-2transformer,
  title     = {{S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting}},
  author    = {Chen, Hongyi and Li, Xiucheng and Chen, Xinyang and Cheng, Yun and Li, Jing and Chen, Kehai and Nie, Liqiang},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/chen2026tmlr-2transformer/}
}