InterDiff: Synthesizing Financial Time Series with Inter-Stock Correlations via Classifier-Free Guided Diffusion

Abstract

Stock prediction is hindered by data scarcity, and although existing data augmentation techniques have made significant strides, they often overlook the dynamic inter-stock interactions crucial for robust modeling. To address these challenges, we propose InterDiff , a diffusion-based framework that synthesizes realistic financial time series by dynamically modeling both intra- and inter-stock correlations. InterDiff employs hierarchical transformers to learn these correlations, encoding them into a guidance vector that steers a diffusion model via classifier-free guidance. This approach ensures that the synthetic data preserves fidelity while introducing controlled variability. Evaluations on CSI300 and CSI800 show that models trained on InterDiff-augmented data boost the information coefficient by 1.13–4.70% on CSI300 and 40.15–49.60% on CSI800, while delivering cumulative return improvements of 0.57–13.87% on CSI300 and 28.72–51.33% on CSI800 under 0.1% per-trade cost. The framework outperforms alternatives such as DiffsFormer and Quant GAN. Ablation studies reveal a fidelity-diversity tradeoff: while larger guidance strength improves synthetic data fidelity, it does not necessarily enhance prediction performance. Visualizations confirm the preservation of inter-stock correlations and a reduction in overfitting. These results demonstrate InterDiff’s ability to enhance robustness and profitability in real-world trading environments and mitigate data scarcity.

Cite

Text

Long et al. "InterDiff: Synthesizing Financial Time Series with Inter-Stock Correlations via Classifier-Free Guided Diffusion." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06118-8_13

Markdown

[Long et al. "InterDiff: Synthesizing Financial Time Series with Inter-Stock Correlations via Classifier-Free Guided Diffusion." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/long2025ecmlpkdd-interdiff/) doi:10.1007/978-3-032-06118-8_13

BibTeX

@inproceedings{long2025ecmlpkdd-interdiff,
  title     = {{InterDiff: Synthesizing Financial Time Series with Inter-Stock Correlations via Classifier-Free Guided Diffusion}},
  author    = {Long, Hou-Wan and Tang, Zhoufei and Zhang, Jianhui and Zhan, Zhuoyang and Lu, Tao and Zhang, Xiaoquan Michael},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {212-229},
  doi       = {10.1007/978-3-032-06118-8_13},
  url       = {https://mlanthology.org/ecmlpkdd/2025/long2025ecmlpkdd-interdiff/}
}