X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation

Abstract

Recent advancements in automatic 3D avatar generation guided by text have made significant progress. However, existing methods have limitations such as oversaturation and low-quality output. To address these challenges, we propose X-Oscar, a progressive framework for generating high-quality animatable avatars from text prompts. It follows a sequential "Geometry→Texture→Animation" paradigm, simplifying optimization through step-by-step generation. To tackle oversaturation, we introduce Adaptive Variational Parameter (AVP), representing avatars as an adaptive distribution during training. Additionally, we present Avatar-aware Score Distillation Sampling (ASDS), a novel technique that incorporates avatar-aware noise into rendered images for improved generation quality during optimization. Extensive evaluations confirm the superiority of X-Oscar over existing text-to-3D and text-to-avatar approaches. Our anonymous project page: https://anonymous1440.github.io/.

Cite

Text

Ma et al. "X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation." International Conference on Machine Learning, 2024.

Markdown

[Ma et al. "X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/ma2024icml-xoscar/)

BibTeX

@inproceedings{ma2024icml-xoscar,
  title     = {{X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation}},
  author    = {Ma, Yiwei and Lin, Zhekai and Ji, Jiayi and Fan, Yijun and Sun, Xiaoshuai and Ji, Rongrong},
  booktitle = {International Conference on Machine Learning},
  year      = {2024},
  pages     = {33826-33838},
  volume    = {235},
  url       = {https://mlanthology.org/icml/2024/ma2024icml-xoscar/}
}