reference
Hainiu Xu, Runcong Zhao, Lixing Zhu, Jinhua Du, and Yulan He developed 'OpenToM', a comprehensive benchmark for evaluating theory-of-mind reasoning capabilities of large language models, presented at the 62nd Annual Meeting of the Association for Computational Linguistics.

Authors

Sources

Referenced by nodes (1)