Using temporal difference learning to train players of nondeterministic board games

Matthews, Glenn F.

Using temporal difference learning to train players of nondeterministic board games

Matthews, Glenn F.

2006

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Temporal difference learning algorithms have been used successfully to train neural networks to play backgammon at human expert level. This approach has subsequently been applied to deterministic games such as chess and Go with little success, but few have attempted to apply it to other nondeterministic games. We use temporal difference learning to train neural networks for four such games: backgammon, hypergammon, pachisi, and Parcheesi. We investigate the influence of two training variables on these networks: the source of training data (learner-versus-self or learner-versus-other game play) and its structure (a simple encoding of the board layout, a set of derived board features, or a combination of both of these).We show that this approach is viable for all four games, that self-play can provide effective training data, and that the combination of raw and derived features allows for the development of stronger players.

Details

Record ID

16894

Record Created

2024-12-05

Title

Using temporal difference learning to train players of nondeterministic board games

Author

Matthews, Glenn F.

Contributor

Rasheed, Khaled Advisor
Doshi, Prashant Committee Member
Potter, Walter D. Committee Member

College or School

College of Engineering

Department

School of Computing

Date

2006

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Winter 2006

Year Degree Granted

2006

Keywords

Temporal difference learning; Board games; Neural networks; Machine learning; Backgammon; Parcheesi; Pachisi; Hypergammon; Hyper-backgammon; Learning environments; Board representations; Truncated unary encoding; Derived features; Smart features

Record Appears in

College, School, or Unit > College of Engineering > School of Computing
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949333691302959

PDF

Statistics

Download Full History