Aldo Pacchiano
Home
Publications
Contact
Talks
Nirjhar Das
Latest
Provably Sample Efficient RLHF via Active Preference Optimization
Cite
×