Estimation of the Data Region Using Extreme-Value Distributions

Abstract

In the field of pattern recognition or outlier detection, it is necessary to estimate the region where data of a particular class are generated. In other words, it is required to accurately estimate the support of the distribution that generates the data. Considering the 1-dimensional distribution whose support is a finite interval, the data region is estimated effectively by the maximum value and the minimum value in the samples. Limiting distributions of these values have been studied in the extreme-value theory in statistics. In this research, we propose a method to estimate the data region using the maximum value and the minimum value in the samples. We calculate the average loss of the estimator, and derive the optimally improved estimators for given loss functions.

Cite

Text

Watanabe and Watanabe. "Estimation of the Data Region Using Extreme-Value Distributions." International Conference on Algorithmic Learning Theory, 2004. doi:10.1007/978-3-540-30215-5_17

Markdown

[Watanabe and Watanabe. "Estimation of the Data Region Using Extreme-Value Distributions." International Conference on Algorithmic Learning Theory, 2004.](https://mlanthology.org/alt/2004/watanabe2004alt-estimation/) doi:10.1007/978-3-540-30215-5_17

BibTeX

@inproceedings{watanabe2004alt-estimation,
  title     = {{Estimation of the Data Region Using Extreme-Value Distributions}},
  author    = {Watanabe, Kazuho and Watanabe, Sumio},
  booktitle = {International Conference on Algorithmic Learning Theory},
  year      = {2004},
  pages     = {206-220},
  doi       = {10.1007/978-3-540-30215-5_17},
  url       = {https://mlanthology.org/alt/2004/watanabe2004alt-estimation/}
}