TSLM: Tree-Structured Language Modeling for Divergent Thinking

Kim, Doyoung; Doo, JaeHyeok; Seo, Minjoon

TSLM: Tree-Structured Language Modeling for Divergent Thinking

ICLR 2026

/iclr/2026/kim2026iclr-tslm/

Abstract

Language models generate reasoning sequentially, preventing them from decoupling irrelevant exploration paths during search. We introduce Tree-Structured Language Modeling (TSLM), which uses special tokens to encode branching structure, enabling models to generate and selectively expand multiple search paths within a single generation process. By training on complete search trees including both successful and failed attempts, TSLM learns to internalize systematic exploration without redundant recomputation of shared prefixes. TSLM achieves 100\% accuracy on Game of 24 (vs. 17\% sequential baseline), robust extrapolation to 20×20 grids (91.5\% vs. 42.7\% for Tree-of-Thought), and superior inference efficiency by avoiding the multiple independent forward passes required by external search methods. These results suggest a new paradigm of inference-time scaling for robust reasoning, demonstrating that supervised learning on complete tree-structured traces provides an efficient alternative for developing systematic exploration capabilities in language models.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Kim et al. "TSLM: Tree-Structured Language Modeling for Divergent Thinking." International Conference on Learning Representations, 2026.

Markdown

[Kim et al. "TSLM: Tree-Structured Language Modeling for Divergent Thinking." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/kim2026iclr-tslm/)

BibTeX

@inproceedings{kim2026iclr-tslm,
  title     = {{TSLM: Tree-Structured Language Modeling for Divergent Thinking}},
  author    = {Kim, Doyoung and Doo, JaeHyeok and Seo, Minjoon},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/kim2026iclr-tslm/}
}