claim
Exposure bias in large language models is the discrepancy between the distribution of conditioning contexts seen during training, which uses ground-truth tokens via teacher forcing, and the distribution seen during inference, which uses model-generated tokens.

Authors

Sources

Referenced by nodes (2)