Towards Reliable Learning for High Stakes Applications

Abstract

In this paper, we focus on delivering reliable learning results for high stakes applications such as self-driving, financial investment and clinical diagnosis, where the accuracy of predictions is considered as a more crucial requirement than giving predictions for all query samples. We adopt the learning with reject option framework where the learning model only predict those samples which they convince to give the correct answer. However, for most prevailing deep learning predictors, the confidence estimated by the model themselves are far from reflecting the real generalization performance. To model the reliability of prediction concisely, we propose an exploratory solution called GALVE (Generative Adversarial Learning with Variance Expansion) which adopts generative adversarial learning to implicitly measure the region where the model achieve good generalization performance. By applying GALVE to measure the reliability of predictions, we achieved an error rate less than half of which straightforwardly measured by confidence in CIFAR10 and SVHN computer vision tasks.

Cite

Text

Gao et al. "Towards Reliable Learning for High Stakes Applications." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33013614

Markdown

[Gao et al. "Towards Reliable Learning for High Stakes Applications." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/gao2019aaai-reliable/) doi:10.1609/AAAI.V33I01.33013614

BibTeX

@inproceedings{gao2019aaai-reliable,
  title     = {{Towards Reliable Learning for High Stakes Applications}},
  author    = {Gao, Jinyang and Yao, Junjie and Shao, Yingxia},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {3614-3621},
  doi       = {10.1609/AAAI.V33I01.33013614},
  url       = {https://mlanthology.org/aaai/2019/gao2019aaai-reliable/}
}