ML Anthology
Authors
Search
About
Wang, Duomin
10 publications
ICLR
2026
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence
Zixin Yin
,
Xili Dai
,
Duomin Wang
,
Xianfang Zeng
,
Lionel Ni
,
Gang Yu
,
Heung-Yeung Shum
ICLR
2026
SpeakerVid-5m: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation
Youliang Zhang
,
Zhaoyang Li
,
Duomin Wang
,
Jiahe Zhang
,
Deyu Zhou
,
Zixin Yin
,
Xili Dai
,
Gang Yu
,
Xiu Li
ICLR
2026
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Zixin Yin
,
Xili Dai
,
Ling-Hao Chen
,
Deyu Zhou
,
Jianan Wang
,
Duomin Wang
,
Gang Yu
,
Lionel Ni
,
Lei Zhang
,
Heung-Yeung Shum
CVPR
2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Deyu Zhou
,
Quan Sun
,
Yuang Peng
,
Kun Yan
,
Runpei Dong
,
Duomin Wang
,
Zheng Ge
,
Nan Duan
,
Xiangyu Zhang
ECCVW
2024
Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Duomin Wang
,
Bin Dai
,
Yu Deng
,
Baoyuan Wang
CVPR
2024
PICTURE: PhotorealistIC Virtual Try-on from UnconstRained dEsigns
Shuliang Ning
,
Duomin Wang
,
Yipeng Qin
,
Zirong Jin
,
Baoyuan Wang
,
Xiaoguang Han
ECCV
2024
Portrait4D-V2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Yu Deng
,
Duomin Wang
,
Baoyuan Wang
CVPR
2024
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis Using Synthetic Data
Yu Deng
,
Duomin Wang
,
Xiaohang Ren
,
Xingyu Chen
,
Baoyuan Wang
CVPR
2023
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis
Duomin Wang
,
Yu Deng
,
Zixin Yin
,
Heung-Yeung Shum
,
Baoyuan Wang
ICCV
2023
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu
,
Zixin Yin
,
Deyu Zhou
,
Duomin Wang
,
Finn Wong
,
Baoyuan Wang