claim
The gradient signal in teacher forcing is clean because the model is evaluated against the correct answer given the correct context at every step, producing a well-defined and unambiguous learning signal.

Authors

Sources

Referenced by nodes (1)