Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation

Abstract

A key problem in computational sustainability is to understand the distribution of species across landscapes over time. This question gives rise to challenging large-scale prediction problems since (i) hundreds of species have to be simultaneously modeled and (ii) the survey data are usually inflated with zeros due to the absence of species for a large number of sites. The problem of tackling both issues simultaneously, which we refer to as the zero-inflated multi-target regression problem, has not been addressed by previous methods in statistics and machine learning. In this paper, we propose a novel deep model for the zero-inflated multi-target regression problem. To this end, we first model the joint distribution of multiple response variables as a multivariate probit model and then couple the positive outcomes with a multivariate log-normal distribution. By penalizing the difference between the two distributions’ covariance matrices, a link between both distributions is established. The whole model is cast as an end-to-end learning framework and we provide an efficient learning algorithm for our model that can be fully implemented on GPUs. We show that our model outperforms the existing state-of-the-art baselines on two challenging real-world species distribution datasets concerning bird and fish populations.

Cite

Text

Kong et al. "Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/603

Markdown

[Kong et al. "Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/kong2020ijcai-deep/) doi:10.24963/IJCAI.2020/603

BibTeX

@inproceedings{kong2020ijcai-deep,
  title     = {{Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation}},
  author    = {Kong, Shufeng and Bai, Junwen and Lee, Jae Hee and Chen, Di and Allyn, Andrew and Stuart, Michelle and Pinsky, Malin and Mills, Katherine and Gomes, Carla P.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {4375-4381},
  doi       = {10.24963/IJCAI.2020/603},
  url       = {https://mlanthology.org/ijcai/2020/kong2020ijcai-deep/}
}