measurement
GPT-4 solves approximately 75% of false-belief tasks, which is comparable to the performance of a 6-year-old human, as reported by Kosinski (2024) and Strachan et al. (2024).
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- GPT-4 concept