An Information-Geometric Distance on the Space of Tasks
Abstract
We compute a distance between tasks modeled as joint distributions on data and labels. We develop a stochastic process that transports the marginal on the data of the source task to that of the target task, and simultaneously updates the weights of a classifier initialized on the source task to track this evolving data distribution. The distance between two tasks is defined to be the shortest path on the Riemannian manifold of the conditional distribution of labels given data as the weights evolve. We derive connections of this distance with Rademacher complexity-based generalization bounds; distance between tasks computed using our method can be interpreted as the trajectory in weight space that keeps the generalization gap constant as the task distribution changes from the source to the target. Experiments on image classification datasets verify that this task distance helps predict the performance of transfer learning and shows consistency with fine-tuning results.
Cite
Text
Gao and Chaudhari. "An Information-Geometric Distance on the Space of Tasks." NeurIPS 2020 Workshops: DL-IG, 2020.Markdown
[Gao and Chaudhari. "An Information-Geometric Distance on the Space of Tasks." NeurIPS 2020 Workshops: DL-IG, 2020.](https://mlanthology.org/neuripsw/2020/gao2020neuripsw-informationgeometric/)BibTeX
@inproceedings{gao2020neuripsw-informationgeometric,
title = {{An Information-Geometric Distance on the Space of Tasks}},
author = {Gao, Yansong and Chaudhari, Pratik Anil},
booktitle = {NeurIPS 2020 Workshops: DL-IG},
year = {2020},
url = {https://mlanthology.org/neuripsw/2020/gao2020neuripsw-informationgeometric/}
}