A Principal Components Approach to Combining Regression Estimates

Abstract

The goal of combining the predictions of multiple learned models is to form an improved estimator. A combining strategy must be able to robustly handle the inherent correlation, or multicollinearity, of the learned models while identifying the unique contributions of each. A progression of existing approaches and their limitations with respect to these two issues are discussed. A new approach, PCR*, based on principal components regression is proposed to address these limitations. An evaluation of the new approach on a collection of domains reveals that (1) PCR* was the most robust combining method, (2) correlation could be handled without eliminating any of the learned models, and (3) the principal components of the learned models provided a continuum of “regularized” weights from which PCR* could choose.

Cite

Text

Merz and Pazzani. "A Principal Components Approach to Combining Regression Estimates." Machine Learning, 1999. doi:10.1023/A:1007507221352

Markdown

[Merz and Pazzani. "A Principal Components Approach to Combining Regression Estimates." Machine Learning, 1999.](https://mlanthology.org/mlj/1999/merz1999mlj-principal/) doi:10.1023/A:1007507221352

BibTeX

@article{merz1999mlj-principal,
  title     = {{A Principal Components Approach to Combining Regression Estimates}},
  author    = {Merz, Christopher J. and Pazzani, Michael J.},
  journal   = {Machine Learning},
  year      = {1999},
  pages     = {9-32},
  doi       = {10.1023/A:1007507221352},
  volume    = {36},
  url       = {https://mlanthology.org/mlj/1999/merz1999mlj-principal/}
}