X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation
Abstract
Recent advancements in automatic 3D avatar generation guided by text have made significant progress. However, existing methods have limitations such as oversaturation and low-quality output. To address these challenges, we propose X-Oscar, a progressive framework for generating high-quality animatable avatars from text prompts. It follows a sequential "Geometry→Texture→Animation" paradigm, simplifying optimization through step-by-step generation. To tackle oversaturation, we introduce Adaptive Variational Parameter (AVP), representing avatars as an adaptive distribution during training. Additionally, we present Avatar-aware Score Distillation Sampling (ASDS), a novel technique that incorporates avatar-aware noise into rendered images for improved generation quality during optimization. Extensive evaluations confirm the superiority of X-Oscar over existing text-to-3D and text-to-avatar approaches. Our anonymous project page: https://anonymous1440.github.io/.
Cite
Text
Ma et al. "X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation." International Conference on Machine Learning, 2024.Markdown
[Ma et al. "X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/ma2024icml-xoscar/)BibTeX
@inproceedings{ma2024icml-xoscar,
title = {{X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation}},
author = {Ma, Yiwei and Lin, Zhekai and Ji, Jiayi and Fan, Yijun and Sun, Xiaoshuai and Ji, Rongrong},
booktitle = {International Conference on Machine Learning},
year = {2024},
pages = {33826-33838},
volume = {235},
url = {https://mlanthology.org/icml/2024/ma2024icml-xoscar/}
}