Improving Tandem Mass Spectra Analysis with Hierarchical Learning

Abstract

Tandem mass spectrometry is the most widely used technology to identify proteins in a complex biological sample, which produces a large number of spectra representative of protein subsequences named peptide. In this paper, we propose a hierarchical multi-stage framework, referred as DeepTag, to identify the peptide sequence for each given spectrum. Compared with the traditional one-stage generation, our sequencing model starts the inference with a selected high-confidence guiding tag and provides the complete sequence based on this guiding tag. Besides, we introduce a cross-modality refining module to asist the decoder focus on effective peaks and fine-tune with a reinforcement learning technique. Experiments on different public datasets demonstrate that our method achieves a new state-of-the-art performance in peptide identification task, leading to a marked improvement in terms of both precision and recall.

Cite

Text

Fei. "Improving Tandem Mass Spectra Analysis with Hierarchical Learning." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/599

Markdown

[Fei. "Improving Tandem Mass Spectra Analysis with Hierarchical Learning." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/fei2020ijcai-improving/) doi:10.24963/IJCAI.2020/599

BibTeX

@inproceedings{fei2020ijcai-improving,
  title     = {{Improving Tandem Mass Spectra Analysis with Hierarchical Learning}},
  author    = {Fei, Zhengcong},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {4345-4351},
  doi       = {10.24963/IJCAI.2020/599},
  url       = {https://mlanthology.org/ijcai/2020/fei2020ijcai-improving/}
}