Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization

Abstract

Low-Dimension-to-High-Dimension (LDHD) generalization, a subset of Out-of-Distribution (OOD) generalization, involves training on a low-dimensional subspace and testing in a high-dimensional space. Assuming instances are generated from latent variables reflecting problem scale, LDHD generalization captures the inherent scaling challenge of length generalization. We theoretically show that LDHD generalization is unattainable without appropriate inductive bias. Focusing on Boolean functions, we demonstrate that different architectures trained with (S)GD converge to min-degree interpolators w.r.t. different linearly independent sets, achieving LDHD generalization only when the target function aligns with this bias. From the perspective of LDHD generalization for length generalization, we explain the success of CoT in restructuring latent space for improved LDHD generalization. We further propose a principle for designing position embeddings to address both LDHD generalization and data format nuisances separately. Following the principle, we introduce RPE-Square, a novel embedding that enhances RPE to better handle data formats.

Cite

Text

Chen et al. "Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Chen et al. "Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/chen2025icml-lowdimensiontohighdimension/)

BibTeX

@inproceedings{chen2025icml-lowdimensiontohighdimension,
  title     = {{Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization}},
  author    = {Chen, Yang and Yang, Long and Liang, Yitao and Lin, Zhouchen},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {9566-9589},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/chen2025icml-lowdimensiontohighdimension/}
}