SIPOMDPLITE-NET: LIGHTWEIGHT SELF-INTERESTED LEARNING AND PLANNING IN PARTIALLY OBSERVABLE MULTIAGENT SETTINGS WITH SPARSE INTERACTIONS

Zhang, Gengyu

SIPOMDPLITE-NET: LIGHTWEIGHT SELF-INTERESTED LEARNING AND PLANNING IN PARTIALLY OBSERVABLE MULTIAGENT SETTINGS WITH SPARSE INTERACTIONS

Zhang, Gengyu

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

This work introduces SIPOMDPLite-net, a deep neural network (DNN) architecture for agentcontrol in partially observable multiagent settings with sparse interactions between agents. The network represents a new method for planning in contexts modeled by the interactive partially observable Markov decision process (I-POMDP) Lite and the decentralized sparse-interaction MDP (Dec-SIMDP) frameworks, which facilitates self-interested planning in settings shared with other agents more tractable than the well-known I-POMDP framework. The network uses fully-differentiable value iteration networks to simulate the solution of nested MDPs, which I-POMDP Lite attributes to the other agent to model its behavior, avoiding the need for involving nondifferentiable techniques such as particle filtering to model the other agents more generally. We train SIPOMDPLite-net on a small two-agent tiger-grid problem, for which it accurately learns the underlying model and near-optimal policy, and the trained model continues to perform well on much larger and complex grids. As such, SIPOMDPLite-net shows good transfer capabilities and offers a lighter learning and planning approach for individual agents in multiagent settings.

Record ID

4488

Record Created

2024-12-05

Title

SIPOMDPLITE-NET: LIGHTWEIGHT SELF-INTERESTED LEARNING AND PLANNING IN PARTIALLY OBSERVABLE MULTIAGENT SETTINGS WITH SPARSE INTERACTIONS

Author

Zhang, Gengyu

Contributor

Doshi, Prashant J Advisor
Liu, Tianming Committee Member
Maier, Frederick W Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Subjects

Artificial intelligence
Computer science

Content Type

Thesis

Pagination

66

File Format

pdf

Language

English

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia

Year Degree Granted

2021-12

Keywords

Interactive POMDP Lite; Learning to plan; Sparse interactions

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949421027802959

Download Full History

SIPOMDPLITE-NET: LIGHTWEIGHT SELF-INTERESTED LEARNING AND PLANNING IN PARTIALLY OBSERVABLE MULTIAGENT SETTINGS WITH SPARSE INTERACTIONS

Files

Abstract

Details

PDF

Statistics