TY - JOUR
T1 - A Cookbook for Community-driven Data Collection of Impaired Speech in Low-Resource Languages
AU - Salihs, Sumaya Ahmed
AU - Wiafe, Isaac
AU - Abdulai, Jamal Deen
AU - Atsakpo, Elikem Doe
AU - Ayoka, Gifty
AU - Cave, Richard
AU - Ekpezu, Akon Obu
AU - Holloway, Catherine
AU - Tomanek, Katrin
AU - Winful, Fiifi Baffoe Payin
N1 - Publisher Copyright:
© 2025 International Speech Communication Association. All rights reserved.
PY - 2025
Y1 - 2025
N2 - This study presents an approach for collecting speech samples to build Automatic Speech Recognition (ASR) models for impaired speech, particularly, low-resource languages. It aims to democratize ASR technology and data collection by developing a "cookbook" of best practices and training for community-driven data collection and ASR model building. As a proof-of-concept, this study curated the first open-source dataset of impaired speech in Akan: a widely spoken indigenous language in Ghana. The study involved participants from diverse backgrounds with speech impairments. The resulting dataset, along with the cookbook and open-source tools, are publicly available to enable researchers and practitioners to create inclusive ASR technologies tailored to the unique needs of speech impaired individuals. In addition, this study presents the initial results of finetuning open-source ASR models to better recognize impaired speech in Akan.
AB - This study presents an approach for collecting speech samples to build Automatic Speech Recognition (ASR) models for impaired speech, particularly, low-resource languages. It aims to democratize ASR technology and data collection by developing a "cookbook" of best practices and training for community-driven data collection and ASR model building. As a proof-of-concept, this study curated the first open-source dataset of impaired speech in Akan: a widely spoken indigenous language in Ghana. The study involved participants from diverse backgrounds with speech impairments. The resulting dataset, along with the cookbook and open-source tools, are publicly available to enable researchers and practitioners to create inclusive ASR technologies tailored to the unique needs of speech impaired individuals. In addition, this study presents the initial results of finetuning open-source ASR models to better recognize impaired speech in Akan.
KW - automatic speech recognition
KW - community engagement
KW - democratizing AI
KW - impaired speech
KW - low resource language
UR - https://www.scopus.com/pages/publications/105020037127
U2 - 10.21437/Interspeech.2025-2261
DO - 10.21437/Interspeech.2025-2261
M3 - Conference article
AN - SCOPUS:105020037127
SN - 2308-457X
SP - 4623
EP - 4627
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 26th Interspeech Conference 2025
Y2 - 17 August 2025 through 21 August 2025
ER -