Greed Is Good: Correspondence Recovery for Unlabeled Linear Regression

Abstract

We consider the unlabeled linear regression reading as $\mathbf{Y} = \mathbf{\Pi}^{*}\mathbf{X}\mathbf{B}^* + \mathbf{W}$, where $\mathbf{\Pi}^{*}, \mathbf{B}^*$ and $\mathbf{W}$ represents missing (or incomplete) correspondence information, signals, and additive noise, respectively. Our goal is to perform data alignment between $\mathbf{Y}$ and $\mathbf{X}$, or equivalently, reconstruct the correspondence information encoded by $\mathbf{\Pi}^*$. Based on whether signal $\mathbf{B}^*$ is given a prior, we separately propose two greedy-selection-based estimators, which both reach the mini-max optimality. Compared with previous works, our work $(i)$ supports partial recovery of the correspondence information; and $(ii)$ applies to a general matrix family rather than the permutation matrices, to put more specifically, selection matrices, where multiple rows of $\mathbf{X}$ can correspond to the same row in $\mathbf{Y}$. Moreover, numerical experiments are provided to corroborate our claims.

Cite

Text

Zhang and Li. "Greed Is Good: Correspondence Recovery for Unlabeled Linear Regression." Uncertainty in Artificial Intelligence, 2023.

Markdown

[Zhang and Li. "Greed Is Good: Correspondence Recovery for Unlabeled Linear Regression." Uncertainty in Artificial Intelligence, 2023.](https://mlanthology.org/uai/2023/zhang2023uai-greed/)

BibTeX

@inproceedings{zhang2023uai-greed,
  title     = {{Greed Is Good: Correspondence Recovery for Unlabeled Linear Regression}},
  author    = {Zhang, Hang and Li, Ping},
  booktitle = {Uncertainty in Artificial Intelligence},
  year      = {2023},
  pages     = {2509-2518},
  volume    = {216},
  url       = {https://mlanthology.org/uai/2023/zhang2023uai-greed/}
}