FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications
Abstract
High-quality, pixel-level annotated datasets are crucial for training deep learning models, while their creation is often labor-intensive, time-consuming, and costly. Generative diffusion models have then gained prominence for producing synthetic datasets, yet existing text-to-data methods struggle with generating complex scenes involving multiple objects and intricate spatial arrangements. To address these limitations, we introduce FlexDataset, a framework that pioneers the composition-to-data (C2D) paradigm. FlexDataset generates high-fidelity synthetic datasets with versatile annotations, tailored for tasks like salient object detection, depth estimation, and segmentation. Leveraging a meticulously designed composition-to-image (C2I) framework, it offers precise positional and categorical control. Our Versatile Annotation Generation (VAG) Plan A further enhances efficiency by exploiting rich latent representations through tuned perception decoders, reducing annotation time by nearly fivefold. FlexDataset allows unlimited generation of customized, multi-instance and multi-category (MIMC) annotated data. Extensive experiments show that FlexDataset sets a new standard in synthetic dataset generation across multiple datasets and tasks, including zero-shot and long-tail scenarios.
Cite
Text
Yi-Ge and Shawn. "FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.33027Markdown
[Yi-Ge and Shawn. "FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/yige2025aaai-flexdataset/) doi:10.1609/AAAI.V39I9.33027BibTeX
@inproceedings{yige2025aaai-flexdataset,
title = {{FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications}},
author = {Yi-Ge, Ellen and Shawn, Leo},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {9481-9489},
doi = {10.1609/AAAI.V39I9.33027},
url = {https://mlanthology.org/aaai/2025/yige2025aaai-flexdataset/}
}