Model-based IRL with continuous action spaces

Nagare, Anuja Pradeep

Model-based IRL with continuous action spaces

Nagare, Anuja Pradeep

2018

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Inverse reinforcement learning (IRL) seeks to learn the preferences of an expert agent performing a task from the experts demonstrations. More specifically, it seeks to find the reward function of the expert modelled as a Markov decision process from observations of its state-action trajectories. IRLs ability to use an expert agents demonstrations of real-world activities, such as driving, locomotion tasks, and other robotic tasks to build intelligent agents makes IRL significant. This research provides a novel method for preference learning by developing a model-based IRL algorithm for continuous action spaces. It generalizes a previous Bayesian approach to IRL to include continuous action spaces and uses the trust region policy optimization in the method. Action space densities are generated for each state using a random walk, and an online transition model is used. Our method learns the reward function of an expert agent with a continuous action space and uses this learned function to complete the underlying MDP and predict an optimal policy. Experimental results over a benchmark problem domain called Object-World and toward modelling driver behavior on congested freeways offer evidence about the benefits of this approach.

Record ID

19306

Record Created

2024-12-05

Title

Model-based IRL with continuous action spaces

Author

Nagare, Anuja Pradeep

Contributor

Doshi, Prashant Advisor
Hong, Yi Committee Member
Rasheed, Khaled Committee Member

College or School

College of Engineering

Department

School of Computing

Date

2018

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Summer 2018

Year Degree Granted

2018

Keywords

Self-Driving Car; Machine Learning; Inverse Reinforcement Learning; Continuous Action Space; Bayesian IRL; Gaussian Process IRL

Record Appears in

College, School, or Unit > College of Engineering > School of Computing
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949333261902959

Download Full History

Model-based IRL with continuous action spaces

Files

Abstract

Details

PDF

Statistics