Learning to Represent Action Values as a Hypergraph on the Action Vertices
Action values are ubiquitous in reinforcement learning (RL) methods, with the sample complexity of such methods relying heavily on how fast a good estimator for action value can be learned. By viewing this problem through the lens of representation learning, good representations of both state…