Aldo Pacchiano

Assistant Professor / Visiting Scientist

Boston University / Broad Institute of MIT and Harvard

I am an Assistant Professor at the Boston University Center for Computing and Data Sciences and a Visiting Scientist at the Schmidt Center of the Broad Institute of MIT and Harvard. I obtained my PhD at UC Berkeley where I was advised by Peter Bartlett and Michael Jordan. My research leverages the principles of sequential decision making to design intelligent and adaptive systems. I aim to advance our statistical understanding of learning phenomena in adaptive environments and translate these insights into the design of AI systems for autonomous discovery in large scale scientific and societal domains.

The Pacchiano’s Lab for Adaptive and Intelligent Algorithms (PLAIA) site can be found here.

A sample of my literary writings, including short stories and notes in both English and Spanish, can be found here.

Interests

LLMs and Discovery
Reinforcement Learning
AI for Science
Theory of Sequential Decision-Making

Education

PhD in Computer Science, 2021

University of California Berkeley
MEng in Computer Science, 2014

Massachusetts Institute of Technology
Masters of Advance Study in Pure Mathematics, 2013

Cambridge University
Bachelors of Science in Computer Science and Theoretical Mathematics, 2012

Massachusetts Institute of Technology

Publications

In-Context Learning for Pure Exploration

EXAIT ICML 2025.

Alessio Russo*, Ryan Welch*, Aldo Pacchiano

PDF

The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification

arXiv pre-print

Tavor Z. Baharav*, Spyros Dragazis*, Aldo Pacchiano

PDF

Post-training Large Language Models for Diverse High-Quality Responses

arXiv pre-print

Yilei Chen, Souradip Chakraborty, Lorenz Wolf, Ioannis Paschalidis, Aldo Pacchiano

PDF

Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward

NeurIPS 2025

Dipendra Misra*, Aldo Pacchiano*, Ta-Chung Chi, Ge Gao

Language Model Personalization via Reward Factorization

COLM 2025. Also presented as an oral at the 2nd Workshop on Test-Time Adaptation Putting Updates to the Test (PUT) 2025 at ICML and at …

Idan Shenfeld*, Felix Faltings*, Pulkit Agrawal, Aldo Pacchiano

PDF

Contextual Bandits with Stage-wise Constraints

Journal of Machine Learning Research 2025

Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett

PDF

Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

arXiv pre-print

Xinyi Hu, Aldo Pacchiano

PDF

On the Hardness of Bandit Learning

COLT 2025.

Nataly Brukhim*, Aldo Pacchiano*, Miro Dudik, Robert Schapire

PDF

Multiple-policy Evaluation via Density Estimation

ICML 2025. Also presented at the Foundations of Reinforcement Learning and Control – Connections and Perspectives Workshop, ICML …

Yilei Chen, Aldo Pacchiano, Ioannis Ch. Paschalidis

PDF

Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

ICML 2025.

Alessio Russo, Aldo Pacchiano

PDF

Feasible Action Search for Bandit Linear Programs via Thompson Sampling

ICML 2025.

Aditya Gangrade, Aldo Pacchiano, Clayton Scott, Venkatesh Saligrama

Pure Exploration with Feedback Graphs

AISTATS 2025.

Alessio Russo, Yichen Song, Aldo Pacchiano

Second Order Bounds for Contextual Bandits with Function Approximation

ICLR 2025

Aldo Pacchiano

PDF

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

ICLR 2025.

Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano, Pulkit Agrawal

PDF

A Theoretical Framework for Partially Observed Reward-States in RLHF

ICLR 2025. Also presented at the Aligning Reinforcement Learning Experimentalists and Theorists Workshop and the Workshop on Models of …

Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

PDF

State-free Reinforcement Learning

NeurIPS 2024.

Mingyu Chen, Aldo Pacchiano, Xuezhou Zhang

PDF

Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Presented at the Failure Modes of Sequential Decision-Making in Practice Workshop, RLC 2024.

Aida Afsar, Aldo Pacchiano

PDF

Provable Interactive Learning with Hindsight Instruction Feedback

ICML 2024.

Dipendra Misra*, Aldo Pacchiano*, Robert Schapire*

PDF

Provably Sample Efficient RLHF via Active Preference Optimization

Presented at the Theoretical Foundations of Foundation Models Workshop, ICML 2024.

Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury

PDF

Data-Driven Regret Balancing for Online Model Selection in Bandits

AISTATS 2024

Aldo Pacchiano, Chris Dann, Claudio Gentile

PDF

Improving Offline RL by Blending Heuristics

ICLR 2024 (spotlight).

Sinong Geng, Aldo Pacchiano, Andrey Kolobov, Ching-An Cheng

PDF

Experiment Planning with Function Approximation

NeurIPS 2023, also presented at the PAC-Bayes Meets Interactive Learning Workshop, ICML 2023.

Aldo Pacchiano, Jonathan Lee, Emma Brunskill

PDF

Anytime Model Selection in Linear Bandits

NeurIPS 2023, also presented at the PAC-Bayes Meets Interactive Learning Workshop, ICML 2023.

Parnian Kassraie, Aldo Pacchiano, Nicolas Emmengger, Andreas Krause

PDF

Supervised Pretraining Can Learn In-Context Reinforcement Learning

NeurIPS 2023, also presented at the New Frontiers in Learning, Control, and Dynamical Systems Workshop, ICML 2023.

Jonathan Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill

PDF

Transfer RL via the Undo Maps Formalism

Presented at the New Frontiers in Learning, Control, and Dynamical Systems Workshop, ICML 2023.

Abhi Gupta, Ted Moskovitz, David Alvarez-Melis, Aldo Pacchiano

PDF

A Unified Model and Dimension for Interactive Estimation

NeurIPS 2023.

Nataly Brukhi, Miroslav Dudik, Aldo Pacchiano, Robert Schapire

PDF

Leveraging Offline Data in Online Reinforcement Learning

ICML 2023.

Andrew Wagenmaker, Aldo Pacchiano

PDF

Estimating Optimal Policy Value in General Linear Contextual Bandits

TMLR Jorunal Paper.

Jonathan Lee, Weihao Kong, Aldo Pacchiano, Vidya Muthukumar, Emma Brunskill

PDF

Parallelizing Contextual Bandits

arXiv pre-print

Jeffrey Chan*, Aldo Pacchiano*, Nilesh Tripuraneni*, Yun S. Song, Peter Bartlett, Michael Jordan

PDF

Dueling RL: Reinforcement Learning with Trajectory Preferences

AISTATS 2023

Aldo Pacchiano*, Aadirupa Saha*, Jonathan Lee

PDF

Neural Design for Genetic Perturbation Experiments

ICLR 2023 (Spotlight)

Aldo Pacchiano, Drausin Wulsin, Robert A. Barton, Luis Voloch

PDF

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

ALT 2023

Aldo Pacchiano, Peter Bartlett, Michael Jordan

PDF

Learning General World Models in a Handful of Reward-Free Deployments

NeurIPS 2022.

Jack Parker-Holder*, Yingchen Xu*, Philip Ball*, Aldo Pacchiano*, Oleh Rybkin, Stephen Roberts, Tim Rocktäschel, Edward Grefenstette

PDF

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

NeurIPS 2022.

Abhishek Gupta*, Aldo Pacchiano*, Yuexiang Zhai, Sham Kakade, Sergey Levine

PDF

Best of Both Worlds Model Selection

NeurIPS 2022.

Aldo Pacchiano, Christoph Dann, Claudio Gentile

PDF

Joint Representation Training in Sequential Tasks with Shared Structure

arXiv pre-print

Aldo Pacchiano, Ofir Nachum, Nilesh Tripuraneni, Peter Bartlett

PDF

Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback

ICML 2022.

Darren Lin*, Aldo Pacchiano*, Yaodong Yu*, Michael Jordan

PDF

Meta Learning MDPs with Linear Transition Models

AISTATS 2022; also presented in the Workshop on Reinforcement Learning Theory, ICML 2021.

Robert Müller, Aldo Pacchiano

PDF

ES-ENAS: Blackbox Optimization over Hybrid Spaces via Combinatorial and Continuous Evolution

arXiv pre-print

Xingyou Song, Krzysztof Choromanski, Jack Parker-Holder, Yunhao Tang, Qiuyi Zhang, Daiyi Peng, Deepali Jain, Wenbo Gao, Aldo Pacchiano, Tamas Sarlos, Yuxiang Yang

PDF

Neural Pseudo-Label Optimism for the Bank Loan Problem

NeurIPS 2021.

Aldo Pacchiano, Shaun Singh, Edward Chou, Alexander C. Berg, Jakob Foerster

PDF

Towards an Understanding of Default Policies in Multitask Policy Optimization

AISTATS 2022; Oral, Nominated for Best Paper.

Ted Moskovitz, Michael Arbel, Jack Parker-Holder, Aldo Pacchiano

PDF

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

NeurIPS 2021, also presented in the Workshop on Reinforcement Learning Theory, ICML 2021.

Matteo Papini, Andrea Trinzioni, Aldo Pacchiano, Marcello Restelli, Alessandro Lazaric, Matteo Pirotta

PDF

Unlocking Pixels for Reinforcement Learning via Implicit Attention

arXiv pre-print

Krzysztof Choromanski, Deepali Jain, Wenhao Yu, Xingyou Song, Jack Parker-Holder, Tingnan Zhang, Valerii Likhosherstov, Aldo Pacchiano, Anirban Santara, Yunhao Tang, Jie Tan, Adrian Weller

PDF

Towards Tractable Optimism in Model-Based Reinforcement Learning

UAI 2021.

Aldo Pacchiano, Philip Ball, Jack Parker-Holder, Krzysztof Choromanski, Stephen Roberts

PDF

Dynamic Balancing for Model Selection in Bandits and RL

ICML 2021.

Ashok Cutkosky*, Christoph Dann*, Abhimanyu Das*, Claudio Gentile*, Aldo Pacchiano*, Manish Purohit*

PDF

Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity

ICML 2021.

Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li

PDF

Model Selection for Contextual Bandits and Reinforcement Learning

PhD Thesis.

Aldo Pacchiano

PDF

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

NeurIPS 2021; also presented as an oral talk in the Workshop on Reinforcement Learning Theory, ICML 2021.

Niladri S. Chatterji*, Aldo Pacchiano*, Peter L. Bartlett, Michael I. Jordan

PDF

Parallelizing Contextual Linear Bandits

arXiv pre-print

Jeffrey Chan*, Aldo Pacchiano*, Nilesh Tripuraneni*, Yun S. Song, Peter Bartlett, Michael Jordan

PDF

Learning the Truth From Only One Side of the Story

AISTATS 2021.

Heinrich Jiang*, Qijia Jiang*, Aldo Pacchiano*

PDF

Stochastic Bandits with Linear Constraints

AISTATS 2021.

Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett, Heinrich Jiang

PDF

Near Optimal Policy Optimization via REPS

NeurIPS 2021.

Aldo Pacchiano, Jonathan Lee, Peter Bartlett, Ofir Nachum

PDF

Tactical Optimism and Pessimism for Deep Reinforcement Learning

NeurIPS 2021.

Ted Moskovitz, Jack Parker-Holder, Aldo Pacchiano, Michael Arbel, Michael Jordan

PDF

Fairness with Continuous Optimal Transport

arXiv pre-print

Silvia Chiappa*, Aldo Pacchiano*

PDF

Effective Diversity in Population-Based Reinforcement Learning

NeurIPS 2020 (spotlight).

Aldo Pacchiano*, Jack Parker-Holder*, Krzysztof Choromanski, Stephen Roberts

PDF

Model Selection in Contextual Stochastic Bandit Problems

NeurIPS 2020.

Aldo Pacchiano*, My Phan*, Yasin Abbasi-Yadkori, Anup Rao, Julian Zimmert, Tor Lattimore, Csaba Szepesvari

PDF

Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian

NeurIPS 2020; also presented at the Beyond First Order Methods in ML Systems workshop, ICML, 2020 (Spotlight)

Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alexander Peysakhovich, Aldo Pacchiano, Jakob Foerster

PDF

Accelerated Message Passing for Entropy-Regularized MAP Inference

ICML 2020.

Jonathan Lee, Aldo Pacchiano, Peter Bartlett, Michael Jordan

PDF

Learning to Score Behaviors for Guided Policy Optimization

ICML 2020.

Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang, Krzysztof Choromanski, Anna Choromanska, Michael Jordan

PDF

On Approximate Thompson Sampling with Langevin Algorithms

ICML 2020.

Eric Mazumdar*, Aldo Pacchiano*, Yian Ma, Michael Jordan, Peter Bartlett

PDF

Ready Policy one: World Building Through Active Learning

ICML 2020.

Philip Ball, Jack Parker-Holder, Aldo Pacchiano, Krzysztof Choromanski, Stephen Roberts

PDF

Stochastic Flows and Geometric Optimization on the Orthogonal Group

ICML 2020.

Krzysztof Choromanski, David Cheikhi, Jared Davis, Valerii Likhosherstov, Achille Nazaret, Achraf Bahamou, Xingyou Song, Mrugank Akarte, Jack Parker-Holder, Jacob Bergquist, Yuan Gao, Aldo Pacchiano, Tamas Sarlos, Adrian Weller, Vikas Sindhwani

PDF

Online Model Selection for Reinforcement Learning with Function Approximation

AISTATS 2021.

Jonathan Lee, Aldo Pacchiano, Vidya Muthukumar, Weihao Kong, Emma Brunskill

PDF

Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

arXiv pre-print

Aldo Pacchiano, Chris Dann, Claudio Gentile, Peter L. Bartlett

PDF

Wasserstein Fair Classification

UAI 2019.

Ray Jiang*, Aldo Pacchiano*, Tom Stepleton, Heinrich Jiang, Silvia Chiappa

PDF

Taming the Herd: Multi-Modal Meta-Learning with a Population of Agents

ICML 2020 Workshop Lifelong ML.

Robert Müller, Jack Parker-Holder, Aldo Pacchiano

PDF

Regret Balancing for Bandit and RL Model Selection

arXiv pre-print

Yasin Abbasi-Yadkori, Aldo Pacchiano, My Phan

PDF

Convergence Rates of Smooth Message Passing with Rounding in Entropy-Regularized MAP Inference

AISTATS 2020.

Jonathan Lee*, Aldo Pacchiano*, Michael Jordan

PDF

Practical Nonisotropic Monte Carlo Sampling in High Dimensions via Determinantal Point Processes

AISTATS 2020.

Krzysztof Choromanski*, Aldo Pacchiano*, Jack Parker-Holder*, Yunhao Tang*

PDF

Provably Robust Blackbox Optimization for Reinforcement Learning

CoRL 2019.

Krzysztof Choromanski*, Aldo Pacchiano*, Jack Parker-Holder*, Yunhao Tang, Deepali Jain, Yuxiang Yang, Atil Iscen, Jasmine Hsu, Vikas Sindhwani

PDF

Robustness Guarantees for Mode Estimation with an Application to Bandits

AAAI 2021.

Aldo Pacchiano, Heinrich Jiang, Michael Jordan

PDF

From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization

NeurIPS 2019

Krzysztof Choromanski*, Aldo Pacchiano*, Jack Parker-Holder*, Yunhao Tang*, Vikas Sindhwani

PDF

ES-MAML: Simple Hessian-Free Meta Learning

ICLR 2020.

Xingyou Song, Wenbo Gao, Yuxiang Yang, Krzysztof Choromanski, Aldo Pacchiano, Yunhao Tang

PDF

Reinforcement Learning with Chromatic Networks for Compact Architecture Search

arxiv pre-print

Xingyou Song, Krzysztof Choromanski, Jack Parker-Holder, Yunhao Tang, Wenbo Gao, Aldo Pacchiano, Tamas Sarlos, Deepali Jain, Yuxiang Yang

PDF

Computing Stable Solutions in Threshold Network Flow Games With Bounded Treewidth

AAMAS 2019.

Aldo Pacchiano, Yoram Bachrach

PDF

KAMA-NNs: Low-Dimensional Rotation Based Neural Networks

AISTATS 2019

Krzystof Choromanski*, Aldo Pacchiano*, Jeffrey Pennington*, Yunhao Tang*

PDF

Gen-Oja: A Two-time-scale approach for Streaming CCA

NeurIPS 2018

Kush Bhatia*, Aldo Pacchiano*, Nicolas Flammarion, Peter Bartlett, Michael Jordan

PDF

Online learning with kernel losses

ICML 2019 (Long Talk)

Aldo Pacchiano*, Niladri S. Chatterji*, Peter L. Bartlett

PDF

Reinforcement Learning with Wasserstein Distance Regularisation, with Applications to Multipolicy Learning

EWRL 2018

Mohammed Amin Abdullah*, Aldo Pacchiano*, Moez Draief

PDF

Conditions Beyond Treewidth for Tightness of Higher-order LP Relaxations

AISTATS 2017

Mark Rowland, Aldo Pacchiano, Adrian Weller

PDF

Real Time Clustering of Time Series Using Triangular Potentials

arXiv pre-print

Aldo Pacchiano, Oliver Williams

PDF

Computational Approaches to Poisson Traces Associated to Finite Subgroups of Sp2n(C)

Journal of Experimental Mathematics.

Pavel Etingof, Sherry Gong, Aldo Pacchiano, Qingchun Ren, Travis Schedler

PDF

A General Approach to Fairness with Optimal Transport

AAAI 2020.

Silvia Chiappa, Ray Jiang, Tom Stepleton, Aldo Pacchiano, Heinrich Jiang, John Aslanides

PDF

Geometrically Coupled Monte Carlo Sampling

NeurIPS 2018.

Mark Rowland, Krzysztof Choromanski, François Chalus, Aldo Pacchiano, Tamas Sarlos, Richard E Turner, Adrian Weller

PDF

Trace Reconstruction Problem

Masters Thesis.

Aldo Pacchiano

PDF

Aldo Pacchiano

Assistant Professor / Visiting Scientist

Boston University / Broad Institute of MIT and Harvard

Interests

Education

Recent Posts

Publications

Contact