MDTree: A Masked Dynamic Autoregressive Model for Phylogenetic Inference

Abstract

Phylogenetic tree inference requires optimizing both branch lengths and topologies, yet traditional MCMC-based methods suffer from slow convergence and high computational cost. Recent deep learning approaches improve scalability but remain constrained: Bayesian models are computationally intensive, autoregressive methods depend on fixed species orders, and flow-based models underutilize genomic signals. Fixed-order autoregression introduces an inductive bias misaligned with evolutionary proximity: early misplacements distort subsequent attachment probabilities and compound topology errors (exposure bias). Absent sequence-informed priors, the posterior over the super-exponential topology space remains diffuse and multimodal, yielding high-variance gradients and sluggish convergence for both MCMC proposals and neural samplers. We propose MDTree, a masked dynamic autoregressive framework that integrates genomic priors into a Dynamic Ordering Network to learn biologically informed node sequences. A dynamic masking mechanism further enables parallel node insertion, improving efficiency without sacrificing accuracy. Experiments on standard benchmarks demonstrate that MDTree outperforms existing methods in accuracy and runtime while producing biologically coherent phylogenies, providing a scalable solution for large-scale evolutionary analysis.

Cite

Text

Zang et al. "MDTree: A Masked Dynamic Autoregressive Model for Phylogenetic Inference." Transactions on Machine Learning Research, 2025.

Markdown

[Zang et al. "MDTree: A Masked Dynamic Autoregressive Model for Phylogenetic Inference." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/zang2025tmlr-mdtree/)

BibTeX

@article{zang2025tmlr-mdtree,
  title     = {{MDTree: A Masked Dynamic Autoregressive Model for Phylogenetic Inference}},
  author    = {Zang, Zelin and Duan, ChenRui and Li, Siyuan and Wu, Jinlin and Ling, BingoWing-Kuen and Yang, Fuji and Luo, Jiebo and Lei, Zhen and Li, Stan Z.},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/zang2025tmlr-mdtree/}
}