claim
Exposure bias in large language models is the discrepancy between the distribution of conditioning contexts seen during training, which uses ground-truth tokens via teacher forcing, and the distribution seen during inference, which uses model-generated tokens.
Authors
Sources
- Hallucination Causes: Why Language Models Fabricate Facts mbrenndoerfer.com via serper
Referenced by nodes (2)
- Large Language Models concept
- exposure bias concept