Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings
Abstract
Selective state-space models excel at long-sequence modeling, but their capacity for language representation, in complex hierarchical reasoning -- remains underexplored. Most large language models rely on flat Euclidean embeddings, limiting their ability to capture latent hierarchies. To address this, we propose Hierarchical Mamba (HiM), integrating efficient Mamba2 with hyperbolic geometry to learn hierarchy-aware language embeddings for deeper linguistic understanding. Mamba2-processed sequences are projected to the Poincaré ball or Lorentzian manifold with "learnable" curvature, optimized with a hyperbolic loss. Our HiM model facilitates the capture of relational distances across varying hierarchical levels, enabling effective long-range reasoning for tasks like mixed-hop prediction and multi-hop inference in hierarchical classification. Experimental results show both HiM effectively capture hierarchical relationships across four linguistic and medical datasets, surpassing Euclidean baselines, with HiM-Poincaré providing fine-grained distinctions with higher h-norms, while HiM-Lorentz offers more stable, compact, and hierarchy-preserving embeddings.
Cite
Text
Patil et al. "Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings." Transactions on Machine Learning Research, 2026.Markdown
[Patil et al. "Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/patil2026tmlr-hierarchical/)BibTeX
@article{patil2026tmlr-hierarchical,
title = {{Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings}},
author = {Patil, Sarang Rajendra and Pandey, Ashish Parmanand and Koutis, Ioannis and Xu, Mengjia},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/patil2026tmlr-hierarchical/}
}