ML Anthology
Authors
Search
About
Richard, Alexander
18 publications
ICCV
2025
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions
Aggelina Chatziagapi
,
Louis-Philippe Morency
,
Hongyu Gong
,
Michael Zollhöfer
,
Dimitris Samaras
,
Alexander Richard
ICML
2025
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
Susan Liang
,
Dejan Markovic
,
Israel D. Gebru
,
Steven Krenn
,
Todd Keebler
,
Jacob Sandakly
,
Frank Yu
,
Samuel Hassel
,
Chenliang Xu
,
Alexander Richard
ICLR
2025
FlowDec: A Flow-Based Full-Band General Audio Codec with High Perceptual Quality
Simon Welker
,
Matthew Le
,
Ricky T. Q. Chen
,
Wei-Ning Hsu
,
Timo Gerkmann
,
Alexander Richard
,
Yi-Chiao Wu
CVPR
2025
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning
Jihyun Lee
,
Weipeng Xu
,
Alexander Richard
,
Shih-En Wei
,
Shunsuke Saito
,
Shaojie Bai
,
Te-Li Wang
,
Minhyuk Sung
,
Tae-Kyun Kim
,
Jason Saragih
CVPR
2025
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding
Mingfei Chen
,
Israel D. Gebru
,
Ishwarya Ananthabhotla
,
Christian Richardt
,
Dejan Markovic
,
Jake Sandakly
,
Steven Krenn
,
Todd Keebler
,
Eli Shlizerman
,
Alexander Richard
CVPR
2024
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Evonne Ng
,
Javier Romero
,
Timur Bagautdinov
,
Shaojie Bai
,
Trevor Darrell
,
Angjoo Kanazawa
,
Alexander Richard
ECCV
2024
Modeling and Driving Human Body Soundfields Through Acoustic Primitives
Chao Huang
,
Dejan Markovic
,
Chenliang Xu
,
Alexander Richard
CVPR
2024
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark
Ziyang Chen
,
Israel D. Gebru
,
Christian Richardt
,
Anurag Kumar
,
William Laney
,
Andrew Owens
,
Alexander Richard
CVPR
2023
Novel-View Acoustic Synthesis
Changan Chen
,
Alexander Richard
,
Roman Shapovalov
,
Vamsi Krishna Ithapu
,
Natalia Neverova
,
Kristen Grauman
,
Andrea Vedaldi
NeurIPS
2023
Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio
Xudong Xu
,
Dejan Markovic
,
Jacob Sandakly
,
Todd Keebler
,
Steven Krenn
,
Alexander Richard
CVPR
2022
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
Karren Yang
,
Dejan Marković
,
Steven Krenn
,
Vasu Agrawal
,
Alexander Richard
ECCV
2022
LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space
Emre Aksan
,
Shugao Ma
,
Akin Caliskan
,
Stanislav Pidhorskyi
,
Alexander Richard
,
Shih-En Wei
,
Jason Saragih
,
Otmar Hilliges
WACV
2021
Audio- and Gaze-Driven Facial Animation of Codec Avatars
Alexander Richard
,
Colin Lea
,
Shugao Ma
,
Jurgen Gall
,
Fernando de la Torre
,
Yaser Sheikh
ICCV
2021
MeshTalk: 3D Face Animation from Speech Using Cross-Modality Disentanglement
Alexander Richard
,
Michael Zollhöfer
,
Yandong Wen
,
Fernando de la Torre
,
Yaser Sheikh
ICLR
2021
Neural Synthesis of Binaural Speech from Mono Audio
Alexander Richard
,
Dejan Markovic
,
Israel D. Gebru
,
Steven Krenn
,
Gladstone Alexander Butler
,
Fernando Torre
,
Yaser Sheikh
ICCVW
2019
Enhancing Temporal Action Localization with Transfer Learning from Action Recognition
Ahsan Iqbal
,
Alexander Richard
,
Juergen Gall
CVPR
2017
Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling
Alexander Richard
,
Hilde Kuehne
,
Juergen Gall
CVPR
2016
Temporal Action Detection Using a Statistical Language Model
Alexander Richard
,
Juergen Gall