H3R: Hybrid Multi-View Correspondence for Generalizable 3D Reconstruction

Jia, Heng; Zhu, Linchao; Zhao, Na

H3R: Hybrid Multi-View Correspondence for Generalizable 3D Reconstruction

ICCV 2025 pp. 7655-7665

/iccv/2025/jia2025iccv-h3r/

Abstract

Despite recent advances in feed-forward 3D Gaussian Splatting, generalizable 3D reconstruction remains challenging, particularly in multi-view correspondence modeling. Existing approaches face a fundamental trade-off: explicit methods achieve geometric precision but struggle with ambiguous regions, while implicit methods provide robustness but suffer from slow convergence. We present H3R, a hybrid framework that addresses this limitation by integrating volumetric latent fusion with attention-based feature aggregation. Our framework consists of two complementary components: an efficient latent volume that enforces geometric consistency through epipolar constraints, and a camera-aware Transformer that leverages Plucker coordinates for adaptive correspondence refinement. By integrating both paradigms, our approach enhances generalization while converging 2xfaster than existing methods. Furthermore, we show that spatial-aligned foundation models (e.g., SD-VAE) substantially outperform semantic-aligned models (e.g., DINOv2), resolving the mismatch between semantic representations and spatial reconstruction requirements. Our method supports variable-number and high-resolution input views while demonstrating robust cross-dataset generalization. Extensive experiments show that our method achieves state-of-the-art performance across multiple benchmarks, with significant PSNR improvements of 0.59 dB, 1.06 dB, and 0.22 dB on the RealEstate10K, ACID, and DTU datasets, respectively. Code is available at https://github.com/JiaHeng-DLUT/H3R.

PDF ICCV Semantic Scholar

Cite

Text

Jia et al. "H3R: Hybrid Multi-View Correspondence for Generalizable 3D Reconstruction." International Conference on Computer Vision, 2025.

Markdown

[Jia et al. "H3R: Hybrid Multi-View Correspondence for Generalizable 3D Reconstruction." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/jia2025iccv-h3r/)

BibTeX

@inproceedings{jia2025iccv-h3r,
  title     = {{H3R: Hybrid Multi-View Correspondence for Generalizable 3D Reconstruction}},
  author    = {Jia, Heng and Zhu, Linchao and Zhao, Na},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {7655-7665},
  url       = {https://mlanthology.org/iccv/2025/jia2025iccv-h3r/}
}