Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Publication
35th Conference on Neural Processing Systems