Shibuya, Takashi

13 publications

CVPR 2025 Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization Anubhav Jain, Yuya Kobayashi, Takashi Shibuya, Yuhta Takida, Nasir Memon, Julian Togelius, Yuki Mitsufuji
CVPRW 2025 Dyadic Mamba: Long-Term Dyadic Human Motion Synthesis Julian Tanke, Takashi Shibuya, Kengo Uchida, Koichi Saito, Yuki Mitsufuji
ICLR 2025 HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning Ayano Hiranaka, Shang-Fu Chen, Chieh-Hsin Lai, Dongjun Kim, Naoki Murata, Takashi Shibuya, Wei-Hsiang Liao, Shao-Hua Sun, Yuki Mitsufuji
CVPR 2025 MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Ho Kei Cheng, Masato Ishii, Akio Hayakawa, Takashi Shibuya, Alexander Schwing, Yuki Mitsufuji
ICLR 2025 MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation Akio Hayakawa, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji
CVPRW 2025 MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training Kengo Uchida, Takashi Shibuya, Yuhta Takida, Naoki Murata, Julian Tanke, Shusuke Takahashi, Yuki Mitsufuji
ICLR 2025 SoundCTM: Unifying Score-Based and Consistency Models for Full-Band Text-to-Sound Generation Koichi Saito, Dongjun Kim, Takashi Shibuya, Chieh-Hsin Lai, Zhi Zhong, Yuhta Takida, Yuki Mitsufuji
ICCV 2025 TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models Christian Simon, Masato Ishii, Akio Hayakawa, Zhi Zhong, Shusuke Takahashi, Takashi Shibuya, Yuki Mitsufuji
NeurIPS 2024 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping Junyoung Seo, Kazumi Fukuda, Takashi Shibuya, Takuya Narihira, Naoki Murata, Shoukang Hu, Chieh-Hsin Lai, Seungryong Kim, Yuki Mitsufuji
TMLR 2024 HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji
ICLR 2024 SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer Yuhta Takida, Masaaki Imaizumi, Takashi Shibuya, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji
NeurIPSW 2024 SoundCTM: Uniting Score-Based and Consistency Models for Text-to-Sound Generation Koichi Saito, Dongjun Kim, Takashi Shibuya, Chieh-Hsin Lai, Zhi Zhong, Yuhta Takida, Yuki Mitsufuji
ICML 2022 SQ-VAE: Variational Bayes on Discrete Representation with Self-Annealed Stochastic Quantization Yuhta Takida, Takashi Shibuya, Weihsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji