Go to main content
Formats
Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Automatic speech recognition (ASR) enables the transcription of spoken speech into a written format. Previous research has shown racial biases in modern ASR systems exist and negatively affect Black speakers. In this thesis, speech data from the CallHome and CORAAL ATL, DCB, PRV, and ROC corpora are processed and given to ASpIRE, a DNN-HMM model built on the open-source ASR toolkit Kaldi. The trends in the model’s word error rates between different phonological phenomena and corpora are considered in the context of the model’s original training process and modern sociolinguistic knowledge. All in all, the training set used to develop the ASpIRE model is insufficiently enriched with phonological and lexical representations of AAL and Southern characteristics.

Details

PDF

Statistics

from
to
Export
Download Full History