measurement
Li et al. (2023d) found that the performance gap between real and synthetic data is smallest for low-subjectivity tasks like news classification, but significantly larger for high-subjectivity tasks like humor or sarcasm detection.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- Synthetic data concept