Solving the Tree Containment Problem Using Graph Neural Networks

Arkadiy Dushatskiy, Esther Julien, Leen Stougie, Leo van Iersel

TMLR 2024

/tmlr/2024/dushatskiy2024tmlr-solving/

Abstract

\textsc{Tree containment} is a fundamental problem in phylogenetics useful for verifying a proposed phylogenetic network, representing the evolutionary history of certain species. \textsc{Tree containment} asks whether the given phylogenetic tree (for instance, constructed from a DNA fragment showing tree-like evolution) is contained in the given phylogenetic network. In the general case, this is an NP-complete problem. We propose to solve it approximately using Graph Neural Networks. In particular, we propose to combine the given network and the tree and apply a Graph Neural Network to this network-tree graph. This way, we achieve the capability of solving the tree containment instances representing a larger number of species than the instances contained in the training dataset (i.e., our algorithm has the inductive learning ability). Our algorithm demonstrates an accuracy of over $95\%$ in solving the tree containment problem on instances with up to 100 leaves.

PDF TMLR Code Semantic Scholar

Cite

Text

Dushatskiy et al. "Solving the Tree Containment Problem Using Graph Neural Networks." Transactions on Machine Learning Research, 2024.

Markdown

[Dushatskiy et al. "Solving the Tree Containment Problem Using Graph Neural Networks." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/dushatskiy2024tmlr-solving/)

BibTeX

@article{dushatskiy2024tmlr-solving,
  title     = {{Solving the Tree Containment Problem Using Graph Neural Networks}},
  author    = {Dushatskiy, Arkadiy and Julien, Esther and Stougie, Leen and van Iersel, Leo},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/dushatskiy2024tmlr-solving/}
}