Cross-Dimensional Self-Attention for Multivariate, Geo-Tagged Time Series Imputation
Abstract
Many real-world applications involve multivariate, geo-tagged time series data: at each location, multiple sensors record corresponding measurements. For example, air quality monitoring system records PM2.5, CO, etc. The resulting time-series data often has missing values due to device outages or communication errors. In order to impute the missing values, state-of-the-art methods are built on Recurrent Neural Networks (RNN), which process each time stamp sequentially, prohibiting the direct modeling of the relationship between distant time stamps. Recently, the self-attention mechanism has been proposed for sequence modeling tasks such as machine translation, significantly outperforming RNN because the relationship between each two time stamps can be modeled explicitly. In this paper, we are the first to adapt the self-attention mechanism for multivariate, geo-tagged time series data. In order to jointly capture the self-attention across different dimensions (i.e. time, location and sensor measurements) while keep the size of attention maps reasonable, we propose a novel approach called Cross-Dimensional Self-Attention (CDSA) to process each dimension sequentially, yet in an order-independent manner. On three real-world datasets, including one our newly collected NYC-traffic dataset, extensive experiments demonstrate the superiority of our approach compared to state-of-the-art methods for both imputation and forecasting tasks.
Cite
Text
Ma et al. "Cross-Dimensional Self-Attention for Multivariate, Geo-Tagged Time Series Imputation." International Conference on Learning Representations, 2020.Markdown
[Ma et al. "Cross-Dimensional Self-Attention for Multivariate, Geo-Tagged Time Series Imputation." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/ma2020iclr-crossdimensional/)BibTeX
@inproceedings{ma2020iclr-crossdimensional,
title = {{Cross-Dimensional Self-Attention for Multivariate, Geo-Tagged Time Series Imputation}},
author = {Ma, Jiawei and Shou, Zheng and Zareian, Alireza and Mansour, Hassan and Vetro, Anthony and Chang, Shih-Fu},
booktitle = {International Conference on Learning Representations},
year = {2020},
url = {https://mlanthology.org/iclr/2020/ma2020iclr-crossdimensional/}
}