reference
AgentBench, a framework for evaluating large language models as agents, was described by Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, et al. in the 2023 arXiv preprint 'Agentbench: Evaluating llms as agents'.

Authors

Sources

Referenced by nodes (1)