Fact — reference — Knowledge Tree

The paper 'AI hospital: Benchmarking large language models in a multi-agent medical interaction simulator' by Zhihao Fan et al. introduces a benchmark for evaluating large language models in medical interaction scenarios, published in the Proceedings of the 31st International Conference on Computational Linguistics in January 2025.

Authors

Person: Not available Organization: arXiv
A Comprehensive Benchmark and Evaluation Framework for Multi ...

Sources

A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org arXiv via serper

Referenced by nodes (1)

Large Language Models concept