Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments

June 16, 2016
Matthew Hausknecht | UT Austin
ECC 2010

As software and hardware agents begin to perform tasks of genuine interest, they will be faced with environments too complex for humans to predetermine the correct actions to take. Three characteristics shared by many complex domains are 1) high-dimensional state and action spaces, 2) partial observability, and 3) multiple learning agents. To tackle such problems I will describe algorithms that combine deep neural network function approximation with reinforcement learning. First I will describe using recurrent neural networks to handle partial observability in Atari games. Next, I will describe a multiagent soccer domain: Half-Field-Offense and approaches for learning effective policies in this parameterized-continuous action space. I will conclude with ongoing work on multiagent learning in HFO.

- Pushmeet Kohli
  
  Principal Research Manager Director of Research Microsoft Research
Research Area
- Algorithms
Event
- ECC 2010

Watch Next

Storycaster: An AI System for Immersive Room-based Storytelling (CHI 2026)
April 24, 2026
AutoAdapt demo
April 24, 2026
Microsoft Research India 2025 Highlights
December 31, 2025
Microsoft Research India - The evolution
March 1, 2025
Venkat Padmanabhan,

P. Anandan,

Rick Rashid

, et. al.
Microsoft Research India - The lab culture
March 1, 2025
P. Anandan,

Indrani Medhi Thies,

B. Ashok

, et. al.
GenAI for Supply Chain Management: Present and Future
February 14, 2025
Georg Glantschnig,

Beibin Li,

Konstantina Mellou

, et. al.
Using Optimization and LLMs to Enhance Cloud Supply Chain Operations
December 2, 2024
Beibin Li,

Konstantina Mellou,

Ishai Menache

, et. al.
AI for Business Transformation: Lessons from Healthcare
September 3, 2024
Gretchen Huizinga,

Peter Lee,

Vijay Mital
AI for Business Transformation: Multimodal Models
September 3, 2024
Gretchen Huizinga,

Peter Lee,

Vijay Mital
AI for Business Transformation: The Business of Data
September 3, 2024
Gretchen Huizinga,

Peter Lee,

Vijay Mital

Your Privacy Choices