Aldo Pacchiano
Home
Publications
Contact
Talks
Provably Sample Efficient RLHF via Active Preference Optimization
Nirjhar Das
,
Souradip Chakraborty
,
Aldo Pacchiano
,
Sayak Ray Chowdhury
February 2024
PDF
Type
Preprint
Cite
×