Post-training Large Language Models for Diverse High-Quality Responses

Publication
14th International Conference on Learning Representations