Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning
Abstract
With the rise of deep neural networks, especially in safety-critical applications, robustness and interpretability are crucial to ensure their trustworthiness. Recent advances in 3D-aware classifiers that map image features to volumetric representation of objects, rather than relying solely on 2D appearance, have greatly improved robustness on out-of-distribution (OOD) data. Such classifiers have not yet been studied from the perspective of interpretability. Meanwhile, current concept-based XAI methods often neglect OOD robustness. We aim to address both aspects with CAVE - Concept Aware Volumes for Explanations - a new direction that unifies interpretability and robustness in image classification. We design CAVE as a robust and inherently interpretable classifier that learns sparse concepts from 3D object representation. We further propose 3D Consistency (3D-C), a metric to measure spatial consistency of concepts. Unlike existing metrics that rely on human-annotated parts on images, 3D-C leverages ground-truth object meshes as a common surface to project and compare explanations across concept-based methods. CAVE achieves competitive classification performance while discovering consistent and meaningful concepts across images in various OOD settings.
Cite
Text
Pham et al. "Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning." International Conference on Learning Representations, 2026.Markdown
[Pham et al. "Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/pham2026iclr-interpretable/)BibTeX
@inproceedings{pham2026iclr-interpretable,
title = {{Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning}},
author = {Pham, Nhi and Jesslen, Artur and Schiele, Bernt and Kortylewski, Adam and Fischer, Jonas},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/pham2026iclr-interpretable/}
}