Emergent Neural Network Mechanisms for Generalization to Objects in Novel Orientations
Abstract
The capability of Deep Neural Networks (DNNs) to recognize objects in orientations outside the training data distribution is not well understood. We investigate the limitations of DNNs’ generalization capacities by systematically inspecting DNNs' patterns of success and failure across out-of-distribution (OoD) orientations. We present evidence that DNNs (across architecture types, including convolutional neural networks and transformers) are capable of generalizing to objects in novel orientations, and we describe their generalization behaviors. Specifically, generalization strengthens when training the DNN with an increasing number of familiar objects, but only in orientations that involve 2D rotations of familiar orientations. We also hypothesize how this generalization behavior emerges from internal neural mechanisms – that neurons tuned to common features between familiar and unfamiliar objects enable out of distribution generalization – and present supporting data for this theory. The reproducibility of our findings across model architectures, as well as analogous prior studies on the brain, suggests that these orientation generalization behaviors, as well as the neural mechanisms that drive them, may be a feature of neural networks in general.
Cite
Text
Cooper et al. "Emergent Neural Network Mechanisms for Generalization to Objects in Novel Orientations." Transactions on Machine Learning Research, 2025.Markdown
[Cooper et al. "Emergent Neural Network Mechanisms for Generalization to Objects in Novel Orientations." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/cooper2025tmlr-emergent/)BibTeX
@article{cooper2025tmlr-emergent,
title = {{Emergent Neural Network Mechanisms for Generalization to Objects in Novel Orientations}},
author = {Cooper, Avi and Harari, Daniel and Sasaki, Tomotake and Madan, Spandan and Pfister, Hanspeter and Sinha, Pawan and Boix, Xavier},
journal = {Transactions on Machine Learning Research},
year = {2025},
url = {https://mlanthology.org/tmlr/2025/cooper2025tmlr-emergent/}
}