Fact — procedure — Knowledge Tree

During training, large language models use a technique called teacher forcing, where the model conditions the probability of the next token on ground-truth previous tokens from the training data rather than on its own previous predictions.

Authors

Person: M. Brenndoerfer Organization: mbrenndoerfer.com
Hallucination Causes: Why Language Models Fabricate Facts

Sources

Hallucination Causes: Why Language Models Fabricate Facts mbrenndoerfer.com M. Brenndoerfer · mbrenndoerfer.com via serper

Referenced by nodes (1)

Large Language Models concept