Towards Generalization Beyond Pointwise Learning: A Unified Information-Theoretic Perspective
Abstract
The recent surge in contrastive learning has intensified the interest in understanding the generalization of non-pointwise learning paradigms. While information-theoretic analysis achieves remarkable success in characterizing the generalization behavior of learning algorithms, its applicability is largely confined to pointwise learning, with extensions to the simplest pairwise settings remaining unexplored due to the challenges of non-i.i.d losses and dimensionality explosion. In this paper, we develop the first series of information-theoretic bounds extending beyond pointwise scenarios, encompassing pointwise, pairwise, triplet, quadruplet, and higher-order scenarios, all within a unified framework. Specifically, our hypothesis-based bounds elucidate the generalization behavior of iterative and noisy learning algorithms via gradient covariance analysis, and our prediction-based bounds accurately estimate the generalization gap with computationally tractable low-dimensional information metrics. Comprehensive numerical studies then demonstrate the effectiveness of our bounds in capturing the generalization dynamics across diverse learning scenarios.
Cite
Text
Dong et al. "Towards Generalization Beyond Pointwise Learning: A Unified Information-Theoretic Perspective." International Conference on Machine Learning, 2024.Markdown
[Dong et al. "Towards Generalization Beyond Pointwise Learning: A Unified Information-Theoretic Perspective." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/dong2024icml-generalization/)BibTeX
@inproceedings{dong2024icml-generalization,
title = {{Towards Generalization Beyond Pointwise Learning: A Unified Information-Theoretic Perspective}},
author = {Dong, Yuxin and Gong, Tieliang and Chen, Hong and He, Zhongjiang and Li, Mengxiang and Song, Shuangyong and Li, Chen},
booktitle = {International Conference on Machine Learning},
year = {2024},
pages = {11311-11345},
volume = {235},
url = {https://mlanthology.org/icml/2024/dong2024icml-generalization/}
}