Race and Regionality on the ASpIRE ASR Model

Le, Lillian

Race and Regionality on the ASpIRE ASR Model

Le, Lillian

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Automatic speech recognition (ASR) enables the transcription of spoken speech into a written format. Previous research has shown racial biases in modern ASR systems exist and negatively affect Black speakers. In this thesis, speech data from the CallHome and CORAAL ATL, DCB, PRV, and ROC corpora are processed and given to ASpIRE, a DNN-HMM model built on the open-source ASR toolkit Kaldi. The trends in the model’s word error rates between different phonological phenomena and corpora are considered in the context of the model’s original training process and modern sociolinguistic knowledge. All in all, the training set used to develop the ASpIRE model is insufficiently enriched with phonological and lexical representations of AAL and Southern characteristics.

Details

Record ID

4483

Record Created

2024-12-05

Title

Race and Regionality on the ASpIRE ASR Model

Author

Le, Lillian

Contributor

Renwick, Margaret E. L. Advisor
Rasheed, Khaled Committee Member
Hale, John Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Subjects

Artificial intelligence
Linguistics

Content Type

Thesis

Pagination

89

File Format

pdf

Language

English

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia

Year Degree Granted

2021-12

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949421028302959

PDF

Statistics

Download Full History