Unsupervised Risk Stratification in Clinical Datasets: Identifying Patients at Risk of Rare Outcomes
Abstract
Most existing algorithms for clinical risk stratification rely on labeled training data. Collecting this data is challenging for clinical conditions where only a small percentage of patients experience adverse outcomes. We propose an unsupervised anomaly detection approach to risk stratify patients without the need of positively and negatively labeled training examples. High risk patients are identified without any expert knowledge using a minimum enclosing ball to find cases that lie in sparse regions of the feature space. When evaluated on data from patients admitted with acute coronary syndrome and patients undergoing inpatient surgical procedures, our approach was able to successfully identify individuals at increased risk of adverse endpoints in both populations. In some cases, unsupervised anomaly detection outperformed other machine learning methods that used additional knowledge in the form of labeled examples.
Cite
Text
Syed and Rubinfeld. "Unsupervised Risk Stratification in Clinical Datasets: Identifying Patients at Risk of Rare Outcomes." International Conference on Machine Learning, 2010.Markdown
[Syed and Rubinfeld. "Unsupervised Risk Stratification in Clinical Datasets: Identifying Patients at Risk of Rare Outcomes." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/syed2010icml-unsupervised/)BibTeX
@inproceedings{syed2010icml-unsupervised,
title = {{Unsupervised Risk Stratification in Clinical Datasets: Identifying Patients at Risk of Rare Outcomes}},
author = {Syed, Zeeshan and Rubinfeld, Ilan},
booktitle = {International Conference on Machine Learning},
year = {2010},
pages = {1023-1030},
url = {https://mlanthology.org/icml/2010/syed2010icml-unsupervised/}
}