claim
The evaluation framework described in the paper is the first fully integrated benchmark specifically designed for evaluating multi-turn medical consultation competence in Large Language Models (LLMs).

Authors

Sources

Referenced by nodes (1)