Maximum likelihood approach for model-free inverse reinforcement learning

Jain, Vinamra

Maximum likelihood approach for model-free inverse reinforcement learning

Jain, Vinamra

2017

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Preparing an intelligent system in advance to respond optimally in every possible situation is difficult. Machine learning approaches like Inverse Reinforcement Learning can help learning behavior using a limited number of demonstrations. We present a model-free technique by applying maximum likelihood estimation to an IRL problem. To make our approach model-free, we model the environment using the canonical Markov Decision Process tuple, except we exclude the transition function. We define our reward function as a linear function of a known set of features. We use a modified Q-learning technique, called Q-Averaging. The direction for optimization is guided by the gradient of the likelihood function for current feature weights until the unknown reward function is identified. Experimental results over a grid world problem support our model-free representation of an IRL technique. We also extend our experiments to real-world freeway merging problem of autonomous cars and the results are significant.

Details

Record ID

11598

Record Created

2024-12-05

Title

Maximum likelihood approach for model-free inverse reinforcement learning

Author

Jain, Vinamra

Contributor

Doshi, Prashant Advisor
Hong, Yi Committee Member
Maier, Frederick Committee Member

College or School

College of Engineering

Department

School of Computing

Date

2017

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Winter 2017

Year Degree Granted

2017

Keywords

Inverse Reinforcement Learning; Maximum Likelihood Estimation; Markov Decision Process; Learning from Demonstrations

Record Appears in

Electronic Theses and Dissertations > Graduate Thesis
College of Engineering
All Resources

System Control Number

9949334390802959

PDF

Statistics

Download Full History