Publication
Stochastic One-Sided Full-Information Bandit
Video
Dashboard Mechanisms for Online Marketplaces
We present a theoretical model for design and analysis of mechanisms for online marketplaces where a bidding dashboard enables the bid-optimization of long-lived agents. We assume that a good allocation algorithm exists when given the…
Video
Time discretization invariance in Machine Learning, applications to reinforcement learning and recurrent neural networks
While computers are well equipped to deal with discrete flows of data, the real world often provides intrinsically continuous time data sequences, e.g. visual, sensory streams, time series, or state variables in continuous control environments.…