On Modeling Profiles Instead of Values
Abstract
We consider the problem of estimating the distribution underlying an observed sample of data. Instead of maximum likelihood, which maximizes the probability of the observed values, we propose a different estimate, the high-profile distribution, which maximizes the probability of the observed profile---the number of symbols appearing any given number of times. We determine the high-profile distribution of several data samples, establish some of its general properties, and show that when the number of distinct symbols observed is small compared to the data size, the high-profile and maximum-likelihood distributions are roughly the same, but when the number of symbols is large, the distributions differ, and high-profile better explains the data.
Cite
Text
Orlitsky et al. "On Modeling Profiles Instead of Values." Conference on Uncertainty in Artificial Intelligence, 2004.Markdown
[Orlitsky et al. "On Modeling Profiles Instead of Values." Conference on Uncertainty in Artificial Intelligence, 2004.](https://mlanthology.org/uai/2004/orlitsky2004uai-modeling/)BibTeX
@inproceedings{orlitsky2004uai-modeling,
title = {{On Modeling Profiles Instead of Values}},
author = {Orlitsky, Alon and Santhanam, Narayana P. and Viswanathan, Krishnamurthy and Zhang, Junan},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2004},
pages = {426-435},
url = {https://mlanthology.org/uai/2004/orlitsky2004uai-modeling/}
}