Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction

Cite

Text

Mu and Yang. "Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction." International Joint Conference on Artificial Intelligence, 2024.

Markdown

[Mu and Yang. "Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/mu2024ijcai-separate/)

BibTeX

@inproceedings{mu2024ijcai-separate,
  title     = {{Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction}},
  author    = {Mu, Zhaoxi and Yang, Xinyu},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {6415-6423},
  url       = {https://mlanthology.org/ijcai/2024/mu2024ijcai-separate/}
}