Differentially Private Hypothesis Transfer Learning

Abstract

In recent years, the focus of machine learning has been shifting to the paradigm of transfer learning where the data distribution in the target domain differs from that in the source domain. This is a prevalent setting in real-world classification problems and numerous well-established theoretical results in the classical supervised learning paradigm will break down under this setting. In addition, the increasing privacy protection awareness restricts access to source domain samples and poses new challenges for the development of privacy-preserving transfer learning algorithms. In this paper, we propose a novel differentially private multiple-source hypothesis transfer learning method for logistic regression. The target learner operates on differentially private hypotheses and importance weighting information from the sources to construct informative Gaussian priors for its logistic regression model. By leveraging a publicly available auxiliary data set, the importance weighting information can be used to determine the relationship between the source domain and the target domain without leaking source data privacy. Our approach provides a robust performance boost even when high quality labeled samples are extremely scarce in the target data set. The extensive experiments on two real-world data sets confirm the performance improvement of our approach over several baselines. Data related to this paper is available at: http://qwone.com/~jason/20Newsgroups/ and https://www.cs.jhu.edu/~mdredze/datasets/sentiment/index2.html .

Cite

Text

Wang et al. "Differentially Private Hypothesis Transfer Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10928-8_48

Markdown

[Wang et al. "Differentially Private Hypothesis Transfer Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/wang2018ecmlpkdd-differentially/) doi:10.1007/978-3-030-10928-8_48

BibTeX

@inproceedings{wang2018ecmlpkdd-differentially,
  title     = {{Differentially Private Hypothesis Transfer Learning}},
  author    = {Wang, Yang and Gu, Quanquan and Brown, Donald E.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2018},
  pages     = {811-826},
  doi       = {10.1007/978-3-030-10928-8_48},
  url       = {https://mlanthology.org/ecmlpkdd/2018/wang2018ecmlpkdd-differentially/}
}