Adaptive Exploration for Multi-Reward Multi-Policy Evaluation