GO Hessian for Expectation-Based Objectives

Abstract

An unbiased low-variance gradient estimator, termed GO gradient, was proposed recently for expectation-based objectives E_q_γ(y) [f(y)], where the random variable (RV) y may be drawn from a stochastic computation graph (SCG) with continuous (non-reparameterizable) internal nodes and continuous/discrete leaves. Based on the GO gradient, we present for E_q_γ(y) [f(y)] an unbiased low-variance Hessian estimator, named GO Hessian, which contains the deterministic Hessian as a special case. Considering practical implementation, we reveal that the GO Hessian in expectation obeys the chain rule and is therefore easy-to-use with auto-differentiation and Hessian-vector products, enabling efficient cheap exploitation of curvature information over deep SCGs. As representative examples, we present the GO Hessian for non-reparameterizable gamma and negative binomial RVs/nodes. Leveraging the GO Hessian, we develop a new second-order method for E_q_γ(y) [f(y)], with challenging experiments conducted to verify its effectiveness and efficiency.

Cite

Text

Cong et al. "GO Hessian for Expectation-Based Objectives." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I13.17432

Markdown

[Cong et al. "GO Hessian for Expectation-Based Objectives." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/cong2021aaai-go/) doi:10.1609/AAAI.V35I13.17432

BibTeX

@inproceedings{cong2021aaai-go,
  title     = {{GO Hessian for Expectation-Based Objectives}},
  author    = {Cong, Yulai and Zhao, Miaoyun and Li, Jianqiao and Chen, Junya and Carin, Lawrence},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {12060-12068},
  doi       = {10.1609/AAAI.V35I13.17432},
  url       = {https://mlanthology.org/aaai/2021/cong2021aaai-go/}
}