Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

Abstract

This paper investigates the asymptotic distribution of the maximum-likelihood estimate (MLE) in multinomial logistic models in the high-dimensional regime where dimension and sample size are of the same order. While classical large-sample theory provides asymptotic normality of the MLE under certain conditions, such classical results are expected to fail in high-dimensions as documented for the binary logistic case in the seminal work of Sur and Candès [2019]. We address this issue in classification problems with 3 or more classes, by developing asymptotic normality and asymptotic chi-square results for the multinomial logistic MLE (also known as cross-entropy minimizer) on null covariates. Our theory leads to a new methodology to test the significance of a given feature. Extensive simulation studies on synthetic data corroborate these asymptotic results and confirm the validity of proposed p-values for testing the significance of a given feature.

Cite

Text

Tan and Bellec. "Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions." Neural Information Processing Systems, 2023.

Markdown

[Tan and Bellec. "Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/tan2023neurips-multinomial/)

BibTeX

@inproceedings{tan2023neurips-multinomial,
  title     = {{Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions}},
  author    = {Tan, Kai and Bellec, Pierre C},
  booktitle = {Neural Information Processing Systems},
  year      = {2023},
  url       = {https://mlanthology.org/neurips/2023/tan2023neurips-multinomial/}
}