Skip to content

sharonibejih/tcnspeech-transcriber

Repository files navigation

Real Time Speech Recognition for Nigerian Church Sermons

This is a research work that aims to improve the speech recognition of Nigerian speakers, streamlining to Biblical preachers.

Approach:

  • Data Collection: We adopted the community data-curation approach, where Christain volunteers offered to transcribe the recordings of sermons. This exercise resulted to about 24 hours data, which is now open-sourced.

  • Speech Recognition Experiment: Two pretrained models were experimented with:

    • NeMo QuartzNet
    • wav2vec2 XLS-R

After training on these models, their results were quite similar. However, the latter was more easier to deploy. wav2vec2 XLS-R has a test WER of 0.235. The model is pushed to the HuggingFace Hub and can be found there.

Below is a snippet of the wav2vec2 XLS-R mode prediction version the actual text:

Screenshot 2022-09-28 at 15 51 03

The model performs better on an actual church sermon (with some musical background sounds, like a typical church) than on a normal voice record.

  • Deployment: The model is downloaded from the Hub and is used to transcribe real-time speech. This project is dockerized and deployed to a DockerHub.

Citation:

The 24h Church sermon dataset can be found here.

Please reference our paper if you make use of this data.

@inproceedings{
oyewusi2022tcnspeech,
title={{TCNS}peech: A Community-Curated Speech Corpus for Sermons},
author={Wuraola Fisayo Oyewusi and Sharon Ibejih and Soromfe Uzomah and Elizabeth Mawutin Joseph and Jon Cynthia and Folakunmi Ojemuyiwa and Benedicta Johnson-Onuigwe and Omolola Taiwo and Akintunde Akinpelumi and Olabisi Adesina and Ayodele Noutouglo and Adeola Adeleke Adeoba and Andrew Akoh and Chukwuemeka Nwachukwu and Opeyemi Agbabiaje and Itunu Falade and Olukemi Erhunmwunsee and Oluwatobiloba Dada and Ol{\'u}wat{\'o}bi David OSIBELUWO and Ehis Akene and Udim Akpan and Moira Amadi-Emina and Jaiyeola Marquis and Michael Senapon Bojerenu and Gbolahan Olumade and Oluwagbemi Lesi and Timothy Ezeh and Oluwadamilola Oguntoyinbo and Tosan Mogbeyiteren and Felicia Oresanya and Samuel Chika and Sodiq Akinjobi},
booktitle={3rd Workshop on African Natural Language Processing},
year={2022},
url={https://openreview.net/forum?id=r_-PYcf4LZc}
}

About

Real-time Nigerian church sermons speech transcriber

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published