Deep Generative Models for Imitation Learning and Fairness
- Jiaming (Tony) Song | Stanford University
In the first part of the talk, I will introduce Multi-agent Generative Adversarial Imitation Learning, a new framework for multi-agent imitation learning for general Markov games, where we build upon a generalized notion of inverse reinforcement learning. Multi-agent settings are challenging due to the existence of multiple (Nash) equilibria and non-stationary environments. Our method can be used to imitate complex behaviors in high-dimensional environments with multiple cooperative or competing agents.
In the second part of the talk, I will discuss an information-theoretically motivated objective for learning maximally expressive representations subject to fairness constraints. This objective generalizes a range of existing approaches. We introduce a dual optimization method that allows the user to explicitly control the level of fairness. Empirical evidences suggest that our proposed method can account for multiple notions of fairness and achieves higher expressiveness at a lower computational cost.
View presentation slides here: https://www.microsoft.com/en-us/research/wp-content/uploads/2018/12/Deep-Generative-Models-for-Imitation-Learning-and-Fairness-SLIDES.pdf
-
-
Greg Yang
Senior Researcher
-
-
Series: Microsoft Research Talks
-
Decoding the Human Brain – A Neurosurgeon’s Experience
- Dr. Pascal O. Zinn
-
-
-
-
-
-
Challenges in Evolving a Successful Database Product (SQL Server) to a Cloud Service (SQL Azure)
- Hanuma Kodavalla,
- Phil Bernstein
-
Improving text prediction accuracy using neurophysiology
- Sophia Mehdizadeh
-
Tongue-Gesture Recognition in Head-Mounted Displays
- Tan Gemicioglu
-
DIABLo: a Deep Individual-Agnostic Binaural Localizer
- Shoken Kaneko
-
-
-
-
Audio-based Toxic Language Detection
- Midia Yousefi
-
-
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
- Forrest Iandola,
- Sujeeth Bharadwaj
-
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
- Ashique Khudabukhsh
-
-
-
Towards Mainstream Brain-Computer Interfaces (BCIs)
- Brendan Allison
-
-
-
-
Learning Structured Models for Safe Robot Control
- Subramanian Ramamoorthy
-