A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications

Abstract

More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of these drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs. In many cases, there is already evidence of efficacy for cancer, but trying to manually extract such evidence from the scientific literature is intractable. In this emerging applications paper, we introduce a system to automate non-cancer generic drug evidence extraction from PubMed abstracts. Our primary contribution is to define the natural language processing pipeline required to obtain such evidence, comprising the following modules: querying, filtering, cancer type entity extraction, therapeutic association classification, and study type classification. Using the subject matter expertise on our team, we create our own datasets for these specialized domain-specific tasks. We obtain promising performance in each of the modules by utilizing modern language processing techniques and plan to treat them as baseline approaches for future improvement of individual components.

Cite

Text

Subramanian et al. "A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I08.7052

Markdown

[Subramanian et al. "A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/subramanian2020aaai-natural/) doi:10.1609/AAAI.V34I08.7052

BibTeX

@inproceedings{subramanian2020aaai-natural,
  title     = {{A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications}},
  author    = {Subramanian, Shivashankar and Baldini, Ioana and Ravichandran, Sushma and Katz-Rogozhnikov, Dmitriy A. and Ramamurthy, Karthikeyan Natesan and Sattigeri, Prasanna and Varshney, Kush R. and Wang, Annmarie and Mangalath, Pradeep and Kleiman, Laura B.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {13369-13381},
  doi       = {10.1609/AAAI.V34I08.7052},
  url       = {https://mlanthology.org/aaai/2020/subramanian2020aaai-natural/}
}