Semi-Supervised Discriminant Analysis via CCCP

Abstract

Linear discriminant analysis (LDA) is commonly used for dimensionality reduction. In real-world applications where labeled data are scarce, LDA does not work very well. However, unlabeled data are often available in large quantities. We propose a novel semi-supervised discriminant analysis algorithm called SSDA $_{\mathit{CCCP}}$ . We utilize unlabeled data to maximize an optimality criterion of LDA and use the constrained concave-convex procedure to solve the optimization problem. The optimization procedure leads to estimation of the class labels for the unlabeled data. We propose a novel confidence measure for selecting those unlabeled data points with high confidence. The selected unlabeled data can then be used to augment the original labeled data set for performing LDA. We also propose a variant of SSDA $_{\mathit{CCCP}}$ , called M-SSDA $_{\mathit{CCCP}}$ , which adopts the manifold assumption to utilize the unlabeled data. Extensive experiments on many benchmark data sets demonstrate the effectiveness of our proposed methods.

Cite

Text

Zhang and Yeung. "Semi-Supervised Discriminant Analysis via CCCP." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008. doi:10.1007/978-3-540-87481-2_42

Markdown

[Zhang and Yeung. "Semi-Supervised Discriminant Analysis via CCCP." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008.](https://mlanthology.org/ecmlpkdd/2008/zhang2008ecmlpkdd-semisupervised/) doi:10.1007/978-3-540-87481-2_42

BibTeX

@inproceedings{zhang2008ecmlpkdd-semisupervised,
  title     = {{Semi-Supervised Discriminant Analysis via CCCP}},
  author    = {Zhang, Yu and Yeung, Dit-Yan},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2008},
  pages     = {644-659},
  doi       = {10.1007/978-3-540-87481-2_42},
  url       = {https://mlanthology.org/ecmlpkdd/2008/zhang2008ecmlpkdd-semisupervised/}
}