The Hidden Knowledge of Untrained Neural Language Models

Ware, Benjamin J

The Hidden Knowledge of Untrained Neural Language Models

Ware, Benjamin J

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

My research explored the language predictiveness of untrained neural language models to better understand how their activations were predictive of neural data. My results indicate that the results published in Schrimpf for the Blank2014 fMRI are erroneous. The results of the linguistic analysis contradicted expectations, especially for the untrained models: The XL-Untrained model significantly outperformed the GPT2-Untrained model on fMRI prediction, but significantly underperformed on predicting linguistic targets. Although trained GPT-2-XL outperformed the GPT2-Untrained Model on fMRI prediction, it had similar performance on next-word ngram probability prediction, and it unperformed the base on part of speech prediction. Finally, a possible theory is proposed that reconciles the apparent contradiction of the results.

Details

Record ID

2265

Record Created

2024-12-05

Title

The Hidden Knowledge of Untrained Neural Language Models

Author

Ware, Benjamin J

Contributor

Maier, Frederick Advisor
Balashov, Yuri V. Committee Member
Rasheed, Khaled Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Content Type

Thesis

Pagination

68

File Format

pdf

Language

English

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia

Year Degree Granted

2023-05

Keywords

Deep Neural Networks; fMRI; Language Models; Linguistics; Machine Learning; NLP; Computer science

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949644724302959

PDF

Statistics

Download Full History