A Theoretical Framework for Partially Observed Reward-States in RLHF

Publication
13th International Conference on Learning Representations