Multi-View Learning of Word Embeddings via CCA
Abstract
Recently, there has been substantial interest in using large amounts of unlabeled data to learn word representations which can then be used as features in supervised classifiers for NLP tasks. However, most current approaches are slow to train, do not model context of the word, and lack theoretical grounding. In this paper, we present a new learning method, Low Rank Multi-View Learning (LR-MVL) which uses a fast spectral method to estimate low dimensional context-specific word representations from unlabeled data. These representation features can then be used with any supervised learner. LR-MVL is extremely fast, gives guaranteed convergence to a global optimum, is theoretically elegant, and achieves state-of-the-art performance on named entity recognition (NER) and chunking problems.
Cite
Text
Dhillon et al. "Multi-View Learning of Word Embeddings via CCA." Neural Information Processing Systems, 2011.Markdown
[Dhillon et al. "Multi-View Learning of Word Embeddings via CCA." Neural Information Processing Systems, 2011.](https://mlanthology.org/neurips/2011/dhillon2011neurips-multiview/)BibTeX
@inproceedings{dhillon2011neurips-multiview,
title = {{Multi-View Learning of Word Embeddings via CCA}},
author = {Dhillon, Paramveer and Foster, Dean P. and Ungar, Lyle H.},
booktitle = {Neural Information Processing Systems},
year = {2011},
pages = {199-207},
url = {https://mlanthology.org/neurips/2011/dhillon2011neurips-multiview/}
}