A Principal Components Approach to Combining Regression Estimates
Abstract
The goal of combining the predictions of multiple learned models is to form an improved estimator. A combining strategy must be able to robustly handle the inherent correlation, or multicollinearity, of the learned models while identifying the unique contributions of each. A progression of existing approaches and their limitations with respect to these two issues are discussed. A new approach, PCR*, based on principal components regression is proposed to address these limitations. An evaluation of the new approach on a collection of domains reveals that (1) PCR* was the most robust combining method, (2) correlation could be handled without eliminating any of the learned models, and (3) the principal components of the learned models provided a continuum of “regularized” weights from which PCR* could choose.
Cite
Text
Merz and Pazzani. "A Principal Components Approach to Combining Regression Estimates." Machine Learning, 1999. doi:10.1023/A:1007507221352Markdown
[Merz and Pazzani. "A Principal Components Approach to Combining Regression Estimates." Machine Learning, 1999.](https://mlanthology.org/mlj/1999/merz1999mlj-principal/) doi:10.1023/A:1007507221352BibTeX
@article{merz1999mlj-principal,
title = {{A Principal Components Approach to Combining Regression Estimates}},
author = {Merz, Christopher J. and Pazzani, Michael J.},
journal = {Machine Learning},
year = {1999},
pages = {9-32},
doi = {10.1023/A:1007507221352},
volume = {36},
url = {https://mlanthology.org/mlj/1999/merz1999mlj-principal/}
}