Ehrenfeucht-Haussler Rank and Chain of Thought

Abstract

The notion of rank of a Boolean function has been a cornerstone in PAC learning, enabling quasipolynomial-time learning algorithms for polynomial-size decision trees. We present a novel characterization of rank, grounded in the well-known Transformer architecture. We show that the rank of a function $f$ corresponds to the minimum number of Chain of Thought (CoT) steps required by a single-layer Transformer with hard attention to compute $f$. Based on this characterization we establish tight bounds on the number of CoT steps required for specific problems, showing that $\ell$-fold function composition necessitates exactly $\ell$ CoT steps. Furthermore, we analyze the problem of identifying the position of the $k$-th occurrence of 1 in a Boolean sequence, proving that it requires $k$ CoT steps.

Cite

Text

Barcelo et al. "Ehrenfeucht-Haussler Rank and Chain of Thought." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Barcelo et al. "Ehrenfeucht-Haussler Rank and Chain of Thought." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/barcelo2025icml-ehrenfeuchthaussler/)

BibTeX

@inproceedings{barcelo2025icml-ehrenfeuchthaussler,
  title     = {{Ehrenfeucht-Haussler Rank and Chain of Thought}},
  author    = {Barcelo, Pablo and Kozachinskiy, Alexander and Steifer, Tomasz},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {2968-2977},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/barcelo2025icml-ehrenfeuchthaussler/}
}