On the Theory of Reinforcement Learning with Once-per-Episode Feedback

Publication
35th Conference on Neural Processing Systems