Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

Publication
42nd International Conference on Machine Learning