Contextual Combinatorial Cascading Bandits
PACORA
PACORA (Performance-Aware Convex Optimization for Research Allocation) is a resource allocation framework for general-purpose operating and cloud systems, which is designed to provide responsiveness guarantees to a simultaneous mix of high-throughput parallel, interactive, and real-time…
Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments
As software and hardware agents begin to perform tasks of genuine interest, they will be faced with environments too complex for humans to predetermine the correct actions to take. Three characteristics shared by many complex…
Optimal Classification with Multivariate Losses
Learning to Soar: Exploration-Exploitation Algorithms for Autonomous Soaring Flight
Soaring is the process of collecting energy from the wind during flight with an aerial platform. Animal studies have demonstrated that large birds can significantly extend their flight duration by soaring in favourable wind conditions.…