An Online Riemannian PCA for Stochastic Canonical Correlation Analysis
Abstract
We present an efficient stochastic algorithm (RSG+) for canonical correlation analysis (CCA) using a reparametrization of the projection matrices. We show how this reparametrization (into structured matrices), simple in hindsight, directly presents an opportunity to repurpose/adjust mature techniques for numerical optimization on Riemannian manifolds. Our developments nicely complement existing methods for this problem which either require $O(d^3)$ time complexity per iteration with $O(\frac{1}{\sqrt{t}})$ convergence rate (where $d$ is the dimensionality) or only extract the top $1$ component with $O(\frac{1}{t})$ convergence rate. In contrast, our algorithm offers a strict improvement for this classical problem: it achieves $O(d^2k)$ runtime complexity per iteration for extracting the top $k$ canonical components with $O(\frac{1}{t})$ convergence rate. While the paper primarily focuses on the formulation and technical analysis of its properties, our experiments show that the empirical behavior on common datasets is quite promising, We also explore a potential application in training fair models where the label of protected attribute is missing or otherwise unavailable.
Cite
Text
Meng et al. "An Online Riemannian PCA for Stochastic Canonical Correlation Analysis." Neural Information Processing Systems, 2021.Markdown
[Meng et al. "An Online Riemannian PCA for Stochastic Canonical Correlation Analysis." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/meng2021neurips-online/)BibTeX
@inproceedings{meng2021neurips-online,
title = {{An Online Riemannian PCA for Stochastic Canonical Correlation Analysis}},
author = {Meng, Zihang and Chakraborty, Rudrasis and Singh, Vikas},
booktitle = {Neural Information Processing Systems},
year = {2021},
url = {https://mlanthology.org/neurips/2021/meng2021neurips-online/}
}