Enlightening Deep Neural Networks with Knowledge of Confounding Factors
Abstract
Deep learning techniques have demonstrated significant capacity in modeling some of the most challenging real world problems of high complexity. Despite the popularity of deep models, we still strive to better understand the underlying mechanism that drives their success. Motivated by observations that neurons in trained deep nets predict variation explaining factors indirectly related to the training tasks, we recognize that a deep network learns representations more general than the task at hand in order to disentangle impacts of multiple confounding factors governing the data, isolate the effects of the concerning factors, and optimize the given objective. Consequently, we propose to augment training of deep models with auxiliary information on explanatory factors of the data, in an effort to boost this disentanglement. Such deep networks, trained to comprehend data interactions and distributions more accurately, possess improved generalizability and compute better feature representations. Since pose is one of the most dominant confounding factors for object recognition, we adopt this principle to train a pose-aware deep convolutional neural network to learn both the class and pose of an object, so that it can make more informed classification decisions taking into account image variations induced by the object pose. We demonstrate that auxiliary pose information improves the classification accuracy in our experiments on Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR) tasks. This general principle is readily applicable to improve the recognition and classification performance in various deep-learning applications.
Cite
Text
Zhong and Ettinger. "Enlightening Deep Neural Networks with Knowledge of Confounding Factors." IEEE/CVF International Conference on Computer Vision Workshops, 2017. doi:10.1109/ICCVW.2017.131Markdown
[Zhong and Ettinger. "Enlightening Deep Neural Networks with Knowledge of Confounding Factors." IEEE/CVF International Conference on Computer Vision Workshops, 2017.](https://mlanthology.org/iccvw/2017/zhong2017iccvw-enlightening/) doi:10.1109/ICCVW.2017.131BibTeX
@inproceedings{zhong2017iccvw-enlightening,
title = {{Enlightening Deep Neural Networks with Knowledge of Confounding Factors}},
author = {Zhong, Yu and Ettinger, Gil J.},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2017},
pages = {1077-1086},
doi = {10.1109/ICCVW.2017.131},
url = {https://mlanthology.org/iccvw/2017/zhong2017iccvw-enlightening/}
}