Anytime point based approximations for interactive POMDPs

Perez Barrenechea, Dennis David

Anytime point based approximations for interactive POMDPs

2007

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Partially observable Markov decision processes (POMDPs) have been largely accepted as a rich-framework for planning and control problems. In settings where multiple agents interact, POMDPs fail to model other agents explicitly. The interactive partially observable Markov decision process (I-POMDP) is a new paradigm that extends POMDPs to multiagent settings. The I-POMDP framework models other agents explicitly, making exact solution unfeasible but for the simplest settings. Thus, a need for good approximation methods arises, methods that could find solutions with tight error bounds and short periods of time. We develop a point based method for solving finitely nested I-POMDPs pproximately. The method maintains a set of belief points and form value functions including only the value vectors that are optimal at these belief points. Since I-POMDPs computation depends on the prediction of the actions of other agents in multiagent settings, an interactive generalization of the point based value iteration (PBVI) methods that recursively solves all models of other agents needed to be developed. We present some empirical results in domains on theliterature and discuss the computational savings of the proposed method.

Details

Record ID

16861

Record Created

2024-12-05

Title

Anytime point based approximations for interactive POMDPs

Author

Perez Barrenechea, Dennis David

Contributor

Doshi, Prashant Advisor
Potter, Walter D. Committee Member
Rasheed, Khaled Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Date

2007

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Winter 2007

Year Degree Granted

2007

Keywords

Markov Decision Process; Multiagent systems; Decision making; POMDP

Record Appears in

Electronic Theses and Dissertations > Graduate Thesis
Franklin College of Arts and Sciences
All Resources

System Control Number

9949333695002959

PDF

Statistics

Download Full History