AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning
Abstract
Text-to-Time Series generation holds significant potential to address challenges such as data sparsity, imbalance, and limited availability of multimodal time series data across domains. While diffusion models have achieved remarkable success in Text-to-X (e.g., vision and audio data) generation, their use in time series generation remains limit. Existing approaches face two critical limitations: (1) reliance on domain-specific captions that generalize poorly, and (2) inability to generate time series of arbitrary length, limiting real-world use. In this work, we first introduce a new multimodal dataset containing over 600,000 high-resolution text-time series pairs. Second, we propose Text-to-Series (T2S), a diffusion-based framework that bridges the gap between natural language and time series in a domain-agnostic manner. It employs a length-adaptive VAE to encode time series of varying lengths into consistent latent embeddings. On top of that, T2S effectively aligns textual representations with latent embeddings by utilizing Flow Matching and employing DiT as the denoiser. We train T2S in an interleaved paradigm across multiple lengths, allowing it to generate sequences of arbitrary lengths. Extensive evaluations demonstrate that T2S achieves state-of-the-art performance across 13 datasets spanning 12 domains.
Cite
Text
Wu and Cui. "AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/580Markdown
[Wu and Cui. "AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/wu2024ijcai-allmatch/) doi:10.24963/ijcai.2024/580BibTeX
@inproceedings{wu2024ijcai-allmatch,
title = {{AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning}},
author = {Wu, Zhiyu and Cui, Jinshi},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {5245-5253},
doi = {10.24963/ijcai.2024/580},
url = {https://mlanthology.org/ijcai/2024/wu2024ijcai-allmatch/}
}