Discrete Diffusion for Bundle Construction
Abstract
As a central task in product bundling, bundle construction aims to select a subset of items from large item catalogs to build an entire bundle or, more practically, complete a partial bundle. Existing methods often rely on the sequential construction paradigm that predicts items one at a time, nevertheless, this paradigm is fundamentally unsuitable for the essentially unordered bundles. In contrast, non-sequential methods model a bundle as a set, but still face two dimensionality curses: the combinatorial space grows exponentially with both bundle length and catalog size. Accordingly, we identify two technical challenges: 1) how to effectively and efficiently model the higher-order intra-bundle relations with the growth of bundle length; and 2) how to learn item representations that remain discriminative while avoiding search directly over a huge item catalog. To address these challenges, we propose DDBC, a Discrete Diffusion model for Bundle Construction. DDBC leverages a masked denoising diffusion process to build bundles non-sequentially, capturing joint dependencies among items without relying on a fixed decoding order, thereby partially alleviating the combinatorial challenge introduced by increasing bundle length. To mitigate the curse of large catalog size, we integrate residual vector quantization (RVQ), which compresses item embeddings into discrete codes drawn from a globally shared codebook, enabling more efficient search while retaining semantic granularity. We evaluate our method on real-world bundle construction datasets of music playlist continuation and fashion outfit completion, and the experimental results show that DDBC achieves more than 100\% relative performance improvements over state-of-the-art baselines on long-bundle datasets, with competitive performance on short bundles. Ablation and model studies further confirm the effectiveness of both the diffusion backbone and the RVQ tokenizer, with gains becoming more pronounced for longer bundles and larger catalogs. Our code is available at https://github.com/LiAi16/DDBC.
Cite
Text
Tu et al. "Discrete Diffusion for Bundle Construction." International Conference on Learning Representations, 2026.Markdown
[Tu et al. "Discrete Diffusion for Bundle Construction." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/tu2026iclr-discrete/)BibTeX
@inproceedings{tu2026iclr-discrete,
title = {{Discrete Diffusion for Bundle Construction}},
author = {Tu, Teng and Li, Ai and Ma, Yunshan and Xu, Shuo and Liu, Xiaohao and Ma, Haokai and Pang, Liang and Chua, Tat-Seng},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/tu2026iclr-discrete/}
}