Semi-Supervised Multi-Task Regression

Abstract

Labeled data are needed for many machine learning applications but the amount available in some applications is scarce. Semi-supervised learning and multi-task learning are two of the approaches that have been proposed to alleviate this problem. In this paper, we seek to integrate these two approaches for regression applications. We first propose a new supervised multi-task regression method called SMTR, which is based on Gaussian processes (GP) with the assumption that the kernel parameters for all tasks share a common prior. We then incorporate unlabeled data into SMTR by changing the kernel function of the GP prior to a data-dependent kernel function, resulting in a semi-supervised extension of SMTR, called SSMTR. Moreover, we incorporate pairwise information into SSMTR to further boost the learning performance for applications in which such information is available. Experiments conducted on two commonly used data sets for multi-task regression demonstrate the effectiveness of our methods.

Cite

Text

Zhang and Yeung. "Semi-Supervised Multi-Task Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009. doi:10.1007/978-3-642-04174-7_40

Markdown

[Zhang and Yeung. "Semi-Supervised Multi-Task Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009.](https://mlanthology.org/ecmlpkdd/2009/zhang2009ecmlpkdd-semisupervised/) doi:10.1007/978-3-642-04174-7_40

BibTeX

@inproceedings{zhang2009ecmlpkdd-semisupervised,
  title     = {{Semi-Supervised Multi-Task Regression}},
  author    = {Zhang, Yu and Yeung, Dit-Yan},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2009},
  pages     = {617-631},
  doi       = {10.1007/978-3-642-04174-7_40},
  url       = {https://mlanthology.org/ecmlpkdd/2009/zhang2009ecmlpkdd-semisupervised/}
}