FRAMEWORK AND ALGORITHMS FOR ONLINE INVERSE REINFORCEMENT LEARNING UNDER IMPERFECT OBSERVATIONS

Arora, Saurabh

FRAMEWORK AND ALGORITHMS FOR ONLINE INVERSE REINFORCEMENT LEARNING UNDER IMPERFECT OBSERVATIONS

Arora, Saurabh

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Autonomous systems predominantly deploy IRL (inverse reinforcement learning) to model the task preferences of a user (often called an expert), as a reward function, by observing the user while executing the task. While IRL is witnessing sustained attention, the related problem of online IRL– where the observations are incrementally accrued, yet the real-time demands of the application often prohibit a full rerun of an IRL method – has received much less attention. Furthermore, most of the current online learning literature assumes perfect noise-free completely perceivable training data along with a prior knowledge of features for the model of task being learned. Unfortunately, these assumptions do not hold in real life applications. The following data imperfections and lack of prior knowledge impact learning accuracy: 1) some of the data in input trajectories is missing, 2) data is mixed up with some data from other sources, 3) input data has perception noise and observation model is unknown, 4) input data has perception noise and manual engineering of system features is not possible. The research contributions from my team address these gaps. Experimental evaluation of these cases on robotic domains (navigation and manipulation) and OpenAI gym domains showed a significant improvement in performance w.r.t. state-of-the-art baselines.

Details

Record ID

2885

Record Created

2024-12-05

Title

FRAMEWORK AND ALGORITHMS FOR ONLINE INVERSE REINFORCEMENT LEARNING UNDER IMPERFECT OBSERVATIONS

Author

Arora, Saurabh

Contributor

Doshi, Prashant Advisor
Rasheed, Khaled Committee Member
Maier, Frederick Committee Member
Niekum, Scott Committee Member
Banerjee, Bikramjit Committee Member

College or School

College of Engineering

Department

School of Computing

Content Type

Dissertation

Pagination

135

File Format

pdf

Language

English

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia

Year Degree Granted

2023-08

Keywords

Online Learning; Reinforcement Learning

Record Appears in

College, School, or Unit > College of Engineering > School of Computing
Electronic Theses and Dissertations > Doctoral Dissertation
All Resources
Doctoral

System Control Number

9949574624002959

PDF

Statistics

Download Full History