Inverse learning of robot behavior for ad-hoc teamwork

Trivedi, Maulesh

Inverse learning of robot behavior for ad-hoc teamwork

Trivedi, Maulesh

2016

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Machine Learning and Robotics present a very intriguing combination of research in Artificial Intelligence. Inverse Reinforcement Learning (IRL) algorithms have generated a great deal of interest in the AI community in recent years. However, very little research has been done on modelling agent interactions in multi-robot ad-hoc settings after learning is complete. Moreover, incorporating IRL for practical robot environments that deal with online learning and high levels of uncertainty is a challenge. While decision theoretic frameworks used for planning in these environments provide good approximations for computing an optimal policy for an agent, these model parameters are usually specified by a human designer. We describe a unique Bayesian approach to approximate unknown state transition functions. We then propose a novel multi-agent Best Response Model that plugs in the experts reward structure learnt through Maximum Entropy Inverse Reinforcement Learning, and use the learnt transition functions from our Bayes Adaptive approach to compute an optimal best response policy for our multi-robot ad-hoc setting. We test our algorithms on a robot debris-sorting task.

Details

Record ID

15802

Record Created

2024-12-05

Title

Inverse learning of robot behavior for ad-hoc teamwork

Author

Trivedi, Maulesh

Contributor

Doshi, Prashant Advisor
Potter, Walter D. Committee Member
Rasheed, Khaled Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Date

2016

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Summer 2016

Year Degree Granted

2016

Keywords

Inverse Reinforcement Learning; Markov Decision Process; Bayes Adaptive Markov Decision Process; Best Response Model; Dec MDP; Optimal Policy; Reward Function

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949333891702959

PDF

Statistics

Download Full History