Relax and Randomize: A recipe for online learning algorithms

August 1, 2012
Karthik Sridharan | University of Pennsylvania

We show a principled way of deriving online learning algorithms from a minimax analysis. The framework yields algorithms for various upper bounds on the minimax value, that were previously only obtained in a non-constructive fashion. The framework allows us to seamlessly recover most of the known methods and to derive new and efficient online learning algorithms. We present a number of new algorithms, including a family of randomized methods that use the idea of a “random play out”. Several new versions of the Follow-the-Perturbed-Leader algorithms are presented, as well as methods based on the Littlestone’s dimension, efficient methods for matrix completion with trace norm and algorithms for the problems of transductive learning and prediction with static experts. Along with localized analysis the framework can also be used to provide adaptive online learning algorithms that can attain faster rates against more benign adversaries. Overall through the framework we emphasize that understanding the inherent complexity of the learning problem leads to the development of algorithms.

Joint work with Alexander Rakhlin and Ohad Shamir

Speaker Details

Karthik Sridharan is currently a Post-Doctoral researcher at The Wharton School, Statistics Department, University of Pennsylvania. He received his PhD degree in computer science from Toyota Technological Institute at Chicago. His research interests include theoretical machine learning, convex optimization, empirical process theory and game theory

- Jeff Running

Series: Microsoft Research Talks

Decoding the Human Brain – A Neurosurgeon’s Experience
August 1, 2024
Dr. Pascal O. Zinn
Scalable and Efficient AI: From Supercomputers to Smartphones
June 29, 2023
Human-Centered AI: Ensuring Human Control While Increasing Automation
May 3, 2023
Mary Czerwinski,

Ben Shneiderman
WiDS Career Panel: Gabriela de Queiroz, Juliet Hougland, & Samantha Sifleet
April 5, 2023
Galea: The Bridge Between Mixed Reality and Neurotechnology
February 13, 2023
Current and Future Application of BCIs
February 1, 2023
Challenges in Evolving a Successful Database Product (SQL Server) to a Cloud Service (SQL Azure)
October 27, 2022
Hanuma Kodavalla,

Phil Bernstein
Improving text prediction accuracy using neurophysiology
September 30, 2022
Sophia Mehdizadeh
Tongue-Gesture Recognition in Head-Mounted Displays
August 11, 2022
Tan Gemicioglu
DIABLo: a Deep Individual-Agnostic Binaural Localizer
August 12, 2021
Shoken Kaneko
A Tale of Two Cities: Software Developers in Practice During the COVID-19 Pandemic
February 26, 2021
Denae Ford Robinson
Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
September 29, 2020
Kai Zhen
Geometry-constrained Beamforming Network for end-to-end Farfield Sound Source Separation
September 24, 2020
Ali Aroudi
Audio-based Toxic Language Detection
August 13, 2020
Midia Yousefi
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 2/2)
August 4, 2020
Paul Smolensky
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
July 29, 2020
Forrest Iandola,

Sujeeth Bharadwaj
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
July 29, 2020
Ashique Khudabukhsh
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 1/2)
July 28, 2020
Paul Smolensky
An Ethical Crisis in Computing?
March 3, 2020
Eric Horvitz,

Moshe Y. Vardi
Towards Mainstream Brain-Computer Interfaces (BCIs)
February 27, 2020
Brendan Allison
Underestimating the challenge of cognitive disabilities (and digital literacy). Directions to explore for current, next, and next-next generation UIs
November 25, 2019
Gregg Vanderheiden
'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
November 18, 2019
Peter Clark
Checkpointing the Un-checkpointable: the Split-Process Approach for MPI and Formal Verification
November 15, 2019
Gene Cooperman
Learning Structured Models for Safe Robot Control
September 27, 2019
Subramanian Ramamoorthy
Non-linear Invariants for Control-Command Systems
September 6, 2019
Pierre Roux