Multi-Label Learning from Medical Plain Text with Convolutional Residual Models

Abstract

Predicting diagnoses from Electronic Health Records (EHRs) is an important medical application of multi-label learning. We propose a convolutional residual model for multi-label classification from doctor notes in EHR data. A given patient may have multiple diagnoses, and therefore multi-label learning is required. We employ a Convolutional Neural Network (CNN) to encode plain text into a fixed-length sentence embedding vector. Since diagnoses are typically correlated, a deep residual network is employed on top of the CNN encoder, to capture label (diagnosis) dependencies and incorporate information directly from the encoded sentence vector. A real EHR dataset is considered, and we compare the proposed model with several well-known baselines, to predict diagnoses based on doctor notes. Experimental results demonstrate the superiority of the proposed convolutional residual model.

Cite

Text

Zhang et al. "Multi-Label Learning from Medical Plain Text with Convolutional Residual Models." Proceedings of the 3rd Machine Learning for Healthcare Conference, 2018.

Markdown

[Zhang et al. "Multi-Label Learning from Medical Plain Text with Convolutional Residual Models." Proceedings of the 3rd Machine Learning for Healthcare Conference, 2018.](https://mlanthology.org/mlhc/2018/zhang2018mlhc-multilabel/)

BibTeX

@inproceedings{zhang2018mlhc-multilabel,
  title     = {{Multi-Label Learning from Medical Plain Text with Convolutional Residual Models}},
  author    = {Zhang, Yinyuan and Henao, Ricardo and Gan, Zhe and Li, Yitong and Carin, Lawrence},
  booktitle = {Proceedings of the 3rd Machine Learning for Healthcare Conference},
  year      = {2018},
  pages     = {280-294},
  volume    = {85},
  url       = {https://mlanthology.org/mlhc/2018/zhang2018mlhc-multilabel/}
}