TCNSpeech: A Community-Curated Speech Corpus for Sermons
Abstract
In this work we present TCNSpeech, a community-curated multispeaker sermon corpus for speech recognition tasks. It contains a total of 24 hours of English audio data recording, chunked and transcribed. The context of the dataset is domain-specific for sermons in Nigerian English accent and a use case for community data curation. The dataset will be made publicly available.
Cite
Text
Oyewusi et al. "TCNSpeech: A Community-Curated Speech Corpus for Sermons." ICLR 2022 Workshops: AfricaNLP, 2022.Markdown
[Oyewusi et al. "TCNSpeech: A Community-Curated Speech Corpus for Sermons." ICLR 2022 Workshops: AfricaNLP, 2022.](https://mlanthology.org/iclrw/2022/oyewusi2022iclrw-tcnspeech/)BibTeX
@inproceedings{oyewusi2022iclrw-tcnspeech,
title = {{TCNSpeech: A Community-Curated Speech Corpus for Sermons}},
author = {Oyewusi, Wuraola Fisayo and Ibejih, Sharon and Uzomah, Soromfe and Joseph, Elizabeth Mawutin and Cynthia, Jon and Ojemuyiwa, Folakunmi and Johnson-Onuigwe, Benedicta and Taiwo, Omolola and Akinpelumi, Akintunde and Adesina, Olabisi and Noutouglo, Ayodele and Adeoba, Adeola Adeleke and Akoh, Andrew and Nwachukwu, Chukwuemeka and Agbabiaje, Opeyemi and Falade, Itunu and Erhunmwunsee, Olukemi and Dada, Oluwatobiloba and Osibeluwo, Olúwatóbi David and Akene, Ehis and Akpan, Udim and Amadi-Emina, Moira and Marquis, Jaiyeola and Bojerenu, Michael Senapon and Olumade, Gbolahan and Lesi, Oluwagbemi and Ezeh, Timothy and Oguntoyinbo, Oluwadamilola and Mogbeyiteren, Tosan and Oresanya, Felicia and Chika, Samuel and Akinjobi, Sodiq},
booktitle = {ICLR 2022 Workshops: AfricaNLP},
year = {2022},
url = {https://mlanthology.org/iclrw/2022/oyewusi2022iclrw-tcnspeech/}
}