reference
The paper 'SFT memorizes, RL generalizes: a comparative study of foundation model post-training' was published in the Proceedings of the 42nd International Conference on Machine Learning, Vol. 267, pp. 10818–10838.

Authors

Sources

Referenced by nodes (3)