Reinforcement Learning

Autonomous Discovery from Data

Sequential decision making algorithms are history dependent policies. Modern sequence prediction models such as transformer architectures have made it feasible to successfully represent these objects in compact architectures. In these research works we explore how to meta-train these models from data to encode known and new sequential decision making strategies.

Neural Optimism for Genetic Perturbation Experiments

This work provides a theoretically sound framework for iteratively exploring the space of perturbations in pooled batches in order to maximize a target phenotype under an experimental budget.

Model Selection for Contextual Bandits and Reinforcement Learning

In the problem of model selection the objective is to design ways to select in online fashion the best suitable algorithm to solve a specific problem instance.

Beyond the Standard Assumptions in Reinforcement Learning

In Reinforcement Learning it is standard to assume the reward to be an additive function of per state feedback. In this work we challenge this standard assumption.