UniRec: Unified Multimodal Encoding for LLM-Based Recommendations
Abstract
Large language models (LLMs) have recently shown promise for multimodal recommen- dation, particularly with text and image inputs. Yet real-world recommendation signals extend far beyond these modalities. To reflect this, we formalize recommendation features into four modalities: text, images, categorical features, and numerical attributes, and em- phasize unique challenges this heterogeneity poses for LLMs in understanding multimodal information. In particular, these challenges arise not only across modalities but also within them, as attributes (e.g., price, rating, time) may all be numeric yet carry distinct meanings. Beyond this intra-modality ambiguity, another major challenge is the nested structure of recommendation signals, where user histories are sequences of items, each carrying multiple attributes. To address these challenges, we propose UniRec, a unified multimodal encoder for LLM-based recommendation. UniRec first employs modality-specific encoders to produce consistent embeddings across heterogeneous signals. It then applies a triplet representa- tion—comprising attribute name, type, and value—to separate schema from raw inputs and preserve semantic distinctions. Finally, a hierarchical Q-Former models the nested structure of user interactions while maintaining their layered organization. On multiple real-world benchmarks, UniRec outperforms state-of-the-art multimodal and LLM-based recommenders by up to 15%, while extensive ablation studies further validate the contribu- tions of each component.
Cite
Text
Lei et al. "UniRec: Unified Multimodal Encoding for LLM-Based Recommendations." Transactions on Machine Learning Research, 2026.Markdown
[Lei et al. "UniRec: Unified Multimodal Encoding for LLM-Based Recommendations." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/lei2026tmlr-unirec/)BibTeX
@article{lei2026tmlr-unirec,
title = {{UniRec: Unified Multimodal Encoding for LLM-Based Recommendations}},
author = {Lei, Zijie and Feng, Tao and Hua, Zhigang and Xie, Yan and Lin, Guanyu and Yang, Shuang and Liu, Ge and You, Jiaxuan},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/lei2026tmlr-unirec/}
}