Lakhotia, Kushal

1 publications

ICLR 2022 Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed