Fact — claim — Knowledge Tree

Early studies by Shin et al. (2020) and Deng et al. (2022) demonstrate that short discrete triggers can reliably elicit target behaviors in language models, although these prompts are often difficult for humans to interpret.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (1)

Language Model concept