Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions

Abstract

The class of finite mixtures of multivariate Bernoulli distributions is known to be nonidentifiable; that is, different values of the mixture parameters can correspond to exactly the same probability distribution. In principle, this would mean that sample estimates using this model would give rise to different interpretations. We give empirical support to the fact that estimation of this class of mixtures can still produce meaningful results in practice, thus lessening the importance of the identifiability problem. We also show that the expectation-maximization algorithm is guaranteed to converge to a proper maximum likelihood estimate, owing to a property of the log-likelihood surface. Experiments with synthetic data sets show that an original generating distribution can be estimated from a sample. Experiments with an electropalatography data set show important structure in the data.

Cite

Text

Carreira-Perpiñán and Renals. "Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions." Neural Computation, 2000. doi:10.1162/089976600300015925

Markdown

[Carreira-Perpiñán and Renals. "Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions." Neural Computation, 2000.](https://mlanthology.org/neco/2000/carreiraperpinan2000neco-practical/) doi:10.1162/089976600300015925

BibTeX

@article{carreiraperpinan2000neco-practical,
  title     = {{Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions}},
  author    = {Carreira-Perpiñán, Miguel Á. and Renals, Steve},
  journal   = {Neural Computation},
  year      = {2000},
  pages     = {141-152},
  doi       = {10.1162/089976600300015925},
  volume    = {12},
  url       = {https://mlanthology.org/neco/2000/carreiraperpinan2000neco-practical/}
}