reference
The paper 'Towards a theoretical understanding of synthetic data in llm post-training: a reverse-bottleneck perspective' provides a theoretical framework for understanding the role of synthetic data in post-training large language models.

Authors

Sources

Referenced by nodes (2)