Dynamic Flows on Curved Space Generated by Labeled Data

Abstract

The scarcity of labeled data is a long-standing challenge for many machine learning tasks. We propose our gradient flow method to leverage the existing dataset (i.e., source) to generate new samples that are close to the dataset of interest (i.e., target). We lift both datasets to the space of probability distributions on the feature-Gaussian manifold, and then develop a gradient flow method that minimizes the maximum mean discrepancy loss. To perform the gradient flow of distributions on the curved feature-Gaussian space, we unravel the Riemannian structure of the space and compute explicitly the Riemannian gradient of the loss function induced by the optimal transport metric. For practical applications, we also propose a discretized flow, and provide conditional results guaranteeing the global convergence of the flow to the optimum. We illustrate the results of our proposed gradient flow method on several real-world datasets and show our method can improve the accuracy of classification models in transfer learning settings.

Cite

Text

Hua et al. "Dynamic Flows on Curved Space Generated by Labeled Data." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/423

Markdown

[Hua et al. "Dynamic Flows on Curved Space Generated by Labeled Data." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/hua2023ijcai-dynamic/) doi:10.24963/IJCAI.2023/423

BibTeX

@inproceedings{hua2023ijcai-dynamic,
  title     = {{Dynamic Flows on Curved Space Generated by Labeled Data}},
  author    = {Hua, Xinru and Nguyen, Truyen and Le, Tam and Blanchet, Jose H. and Nguyen, Viet Anh},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {3803-3811},
  doi       = {10.24963/IJCAI.2023/423},
  url       = {https://mlanthology.org/ijcai/2023/hua2023ijcai-dynamic/}
}