claim
Kadavath et al. (2022) argue that language models generally possess an understanding of their own knowledge limitations, as detailed in their preprint 'Language models (mostly) know what they know'.

Authors

Sources

Referenced by nodes (1)