Comparing Apples and Oranges: Is Stitching Similarity a Load of Spheres?

Abstract

Model stitching is used in the literature to assess the extent to which models capture similar information. The intuition is that if two models classify samples in the same way, they must be capturing the same information. We construct a series of experiments to show that two models can make the same predictions but represent very different information. We therefore argue that unlike previously claimed, stitching cannot reflect the extent to which models represent or capture similar information. This paper draws the community’s attention to the need to correctly interpret the results of such functional similarity measures and highlights the need for similarity measures that capture informational similarity.

Cite

Text

Smith and Marcu. "Comparing Apples and Oranges: Is Stitching Similarity a Load of Spheres?." NeurIPS 2024 Workshops: SciForDL, 2024.

Markdown

[Smith and Marcu. "Comparing Apples and Oranges: Is Stitching Similarity a Load of Spheres?." NeurIPS 2024 Workshops: SciForDL, 2024.](https://mlanthology.org/neuripsw/2024/smith2024neuripsw-comparing/)

BibTeX

@inproceedings{smith2024neuripsw-comparing,
  title     = {{Comparing Apples and Oranges: Is Stitching Similarity a Load of Spheres?}},
  author    = {Smith, Damian and Marcu, Antonia},
  booktitle = {NeurIPS 2024 Workshops: SciForDL},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/smith2024neuripsw-comparing/}
}