Mathematics - Microsoft Research

Microsoft Research Blog

A Deep Learning Theory: Global minima and over-parameterization

December 10, 2018 | Zeyuan Allen-Zhu, Yuanzhi Li, Zhao Song

One empirical finding in deep learning is that simple methods such as stochastic gradient descent (SGD) have a remarkable ability to fit training data. From a capacity perspective, this may not be surprising— modern neural…

Publication

Computing supersingular isogenies on Kummer surfaces

Craig Costello

Progress in Cryptology – ASIACRYPT 2018 | December 2018

Project

Publication

Natasha 2: Faster Non-Convex Optimization Than SGD

Zeyuan Allen-Zhu

NIPS 2018 | December 2018

Publication

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Kevin Scaman, Francis Bach, Sébastien Bubeck, Yin Tat Lee, Laurent Massoulié

NIPS 2018 | December 2018

Publication

How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Zeyuan Allen-Zhu

NIPS 2018 | December 2018

Publication

Extrinsic Noise Suppression in Micro RNA Mediated Incoherent Feedforward Loops

Sumit Mukherjee, Alberto Carignano, Abhyudai Singh, Georg Seelig

Conference on Decision and Control (CDC) | December 2018

Video

Deep Generative Models for Imitation Learning and Fairness

November 19, 2018 | Jiaming (Tony) Song

In the first part of the talk, I will introduce Multi-agent Generative Adversarial Imitation Learning, a new framework for multi-agent imitation learning for general Markov games, where we build upon a generalized notion of inverse…

Deep Generative Models for Imitation Learning and Fairness

01:12:02

Publication

Stellar Velocity Dispersion: Linking Quiescent Galaxies to Their Dark Matter Halos

H. Jabran Zahid, et al.

November 2018

Video

The 20th Northwest Probability Seminar: Stochastic Explosions in Branching Processes and Non-uniqueness for Nonlinear PDE

October 20, 2018 | Radu Dascaliuc

We will discuss stochastic processes, Le Jan-Sznitman cascades, that can be associated with certain nonlinear PDE and how explosion of these cascades can be exploited to prove non-uniques for the associated Cauchy problems. In particular,…

The 20th Northwest Probability Seminar: Stochastic Explosions in Branching Processes and Non-uniqueness for Nonlinear PDE

43:18

Video

The 20th Northwest Probability Seminar: First Order Logic on Galton-Watson Trees

October 20, 2018 | Moumanti Podder

The 20th Northwest Probability Seminar, a one-day mini-conference organized by the University of Washington, the Oregon State University, the University of British Columbia, the University of Oregon, and Microsoft Research, was held on October 20,…

The 20th Northwest Probability Seminar: First Order Logic on Galton-Watson Trees

36:20