Learning Multi-View Neighborhood Preserving Projections
Abstract
We address the problem of metric learning for multi-view data, namely the construction of embedding projections from data in different representations into a shared feature space, such that the Euclidean distance in this space provides a meaningful within-view as well as between-view similarity. Our motivation stems from the problem of cross-media retrieval tasks, where the availability of a joint Euclidean distance function is a prerequisite to allow fast, in particular hashing-based, nearest neighbor queries. We formulate an objective function that expresses the intuitive concept that matching samples are mapped closely together in the output space, whereas non-matching samples are pushed apart, no matter in which view they are available. The resulting optimization problem is not convex, but it can be decomposed explicitly into a convex and a concave part, thereby allowing efficient optimization using the convex-concave procedure. Experiments on an image retrieval task show that nearest-neighbor based cross-view retrieval is indeed possible, and the proposed technique improves the retrieval accuracy over baseline techniques.
Cite
Text
Quadrianto and Lampert. "Learning Multi-View Neighborhood Preserving Projections." International Conference on Machine Learning, 2011.Markdown
[Quadrianto and Lampert. "Learning Multi-View Neighborhood Preserving Projections." International Conference on Machine Learning, 2011.](https://mlanthology.org/icml/2011/quadrianto2011icml-learning/)BibTeX
@inproceedings{quadrianto2011icml-learning,
title = {{Learning Multi-View Neighborhood Preserving Projections}},
author = {Quadrianto, Novi and Lampert, Christoph H.},
booktitle = {International Conference on Machine Learning},
year = {2011},
pages = {425-432},
url = {https://mlanthology.org/icml/2011/quadrianto2011icml-learning/}
}