Explore-Exploit Learning @MSR-NYC

Established: October 24, 2013

This is an umbrella project for machine learning with explore-exploit tradeoff: the trade-off between acquiring and using information. This is a mature, yet very active, research area studied in Machine Learning, Theoretical Computer Science, Operations Research, and Economics. Much of our activity focuses on “multi-armed bandits” and “contextual bandits”, relatively simple and yet very powerful models for explore-exploit tradeoff.

We are located in (or heavily collaborating with) Microsoft Research New York City. Most of us are involved in Multi-World Testing: an approach & system for contextual bandit learning.

People

People

Portrait of Robert Schapire

Robert Schapire

Partner Researcher

Portrait of Paul Mineiro

Paul Mineiro

Principal Data Scientist

Portrait of Siddhartha Sen

Siddhartha Sen

Principal Researcher

Portrait of Sarah Bird

Sarah Bird

Principal Program Manager, Responsible AI Lead, Emerging Technology and Research Strategy Lead, Azure AI

Portrait of Alex Slivkins

Alex Slivkins

Principal Researcher

Portrait of Miro Dudík

Miro Dudík

Sr Principal Researcher

Portrait of John Langford

John Langford

Partner Researcher Manager

Portrait of Markus Cozowicz

Markus Cozowicz

Senior Research Engineer

Portrait of Alekh Agarwal

Alekh Agarwal

Principal Research Manager