reference
The paper 'Iterative label refinement matters more than preference optimization under weak supervision' was presented at The Thirteenth International Conference on Learning Representations (ICLR).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper