reference
The paper 'Scaling laws for reward model overoptimization' was published in the International Conference on Machine Learning, pp. 10835–10866.

Authors

Sources

Referenced by nodes (1)