claim
The gradient signal in teacher forcing is clean because the model is evaluated against the correct answer given the correct context at every step, producing a well-defined and unambiguous learning signal.
Authors
Sources
- Hallucination Causes: Why Language Models Fabricate Facts mbrenndoerfer.com via serper
Referenced by nodes (1)
- Large Language Models concept