reference
Yufan Wu, Yinghui He, Yilin Jia, Rada Mihalcea, Yulong Chen, and Naihao Deng developed 'Hi-ToM', a benchmark designed for evaluating higher-order theory of mind reasoning in large language models.

Authors

Sources

Referenced by nodes (1)