Logistic Regression with an Auxiliary Data Source

Liao, Xuejun; Xue, Ya; Carin, Lawrence

doi:10.1145/1102351.1102415

Logistic Regression with an Auxiliary Data Source

Xuejun Liao, Ya Xue, Lawrence Carin

ICML 2005 pp. 505-512

doi:10.1145/1102351.1102415 /icml/2005/liao2005icml-logistic/

Abstract

To achieve good generalization in supervised learning, the training and testing examples are usually required to be drawn from the same source distribution. In this paper we propose a method to relax this requirement in the context of logistic regression. Assuming Dp and Da are two sets of examples drawn from two mismatched distributions, where Da are fully labeled and Dp partially labeled, our objective is to complete the labels of Dp. We introduce an auxiliary variable μ for each example in Da to reflect its mismatch with Dp. Under an appropriate constraint the μ's are estimated as a byproduct, along with the classifier. We also present an active learning approach for selecting the labeled examples in Dp. The proposed algorithm, called "Migratory-Logit" or M-Logit, is demonstrated successfully on simulated as well as real data sets.

PDF ICML Semantic Scholar

Cite

Text

Liao et al. "Logistic Regression with an Auxiliary Data Source." International Conference on Machine Learning, 2005. doi:10.1145/1102351.1102415

Markdown

[Liao et al. "Logistic Regression with an Auxiliary Data Source." International Conference on Machine Learning, 2005.](https://mlanthology.org/icml/2005/liao2005icml-logistic/) doi:10.1145/1102351.1102415

BibTeX

@inproceedings{liao2005icml-logistic,
  title     = {{Logistic Regression with an Auxiliary Data Source}},
  author    = {Liao, Xuejun and Xue, Ya and Carin, Lawrence},
  booktitle = {International Conference on Machine Learning},
  year      = {2005},
  pages     = {505-512},
  doi       = {10.1145/1102351.1102415},
  url       = {https://mlanthology.org/icml/2005/liao2005icml-logistic/}
}