reference
The paper 'AI hospital: Benchmarking large language models in a multi-agent medical interaction simulator' by Zhihao Fan et al. introduces a benchmark for evaluating large language models in medical interaction scenarios, published in the Proceedings of the 31st International Conference on Computational Linguistics in January 2025.

Authors

Sources

Referenced by nodes (1)