EdgeRunner: Auto-Regressive Auto-Encoder for Artistic Mesh Generation

Abstract

Current auto-regressive mesh generation methods suffer from issues such as incompleteness, insufficient detail, and poor generalization. In this paper, we propose an Auto-regressive Auto-encoder (ArAE) model capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of $512^3$. We introduce a novel mesh tokenization algorithm that efficiently compresses triangular meshes into 1D token sequences, significantly enhancing training efficiency. Furthermore, our model compresses variable-length triangular meshes into a fixed-length latent space, enabling training latent diffusion models for better generalization. Extensive experiments demonstrate the superior quality, diversity, and generalization capabilities of our model in both point cloud and image-conditioned mesh generation tasks.

Cite

Text

Tang et al. "EdgeRunner: Auto-Regressive Auto-Encoder for Artistic Mesh Generation." International Conference on Learning Representations, 2025.

Markdown

[Tang et al. "EdgeRunner: Auto-Regressive Auto-Encoder for Artistic Mesh Generation." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/tang2025iclr-edgerunner/)

BibTeX

@inproceedings{tang2025iclr-edgerunner,
  title     = {{EdgeRunner: Auto-Regressive Auto-Encoder for Artistic Mesh Generation}},
  author    = {Tang, Jiaxiang and Li, Zhaoshuo and Hao, Zekun and Liu, Xian and Zeng, Gang and Liu, Ming-Yu and Zhang, Qinsheng},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/tang2025iclr-edgerunner/}
}