DiMeR: Disentangled Mesh Reconstruction Model with Normal-Only Geometry Training

Abstract

We propose DiMeR, a novel geometry-texture disentangled feed-forward model with 3D supervision for sparse-view mesh reconstruction. Existing methods confront two persistent obstacles: (i) textures can conceal geometric errors, i.e., visually plausible images can be rendered even with wrong geometry, producing multiple ambiguous optimization objectives in geometry-texture mixed solution space for similar objects; and (ii) prevailing mesh extraction methods are redundant, unstable, and lack 3D supervision. To solve these challenges, we rethink the inductive bias for mesh reconstruction. First, we disentangle the unified geometry-texture solution space, where a single input admits multiple feasible solutions, into geometry and texture spaces individually. Specifically, given that normal maps are strictly consistent with geometry and accurately capture surface variations, the normal maps serve as the only input for geometry prediction in DiMeR, while the texture is estimated from RGB images. Second, we streamline the algorithm of mesh extraction by eliminating modules with low performance/cost ratios and redesigning regularization losses with 3D supervision. Notably, DiMeR still accepts raw RGB images as input by leveraging foundation models for normal prediction. Extensive experiments demonstrate that DiMeR generalises across sparse‑views-3D, single‑image-3D, and text‑to‑3D tasks, consistently outperforming baselines. On the GSO and OmniObject3D datasets, DiMeR significantly reduces Chamfer Distance by more than 30%.

Cite

Text

Jiang et al. "DiMeR: Disentangled Mesh Reconstruction Model with Normal-Only Geometry Training." International Conference on Learning Representations, 2026.

Markdown

[Jiang et al. "DiMeR: Disentangled Mesh Reconstruction Model with Normal-Only Geometry Training." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/jiang2026iclr-dimer/)

BibTeX

@inproceedings{jiang2026iclr-dimer,
  title     = {{DiMeR: Disentangled Mesh Reconstruction Model with Normal-Only Geometry Training}},
  author    = {Jiang, Lutao and Lin, Jiantao and Chen, Kanghao and Ge, Wenhang and Yang, Xin and Jiang, Yifan and Lyu, Yuanhuiyi and Zheng, Xu and Jing, Li and Li, Yinchuan and Chen, Ying-Cong},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/jiang2026iclr-dimer/}
}