reference
MedDialogRubrics is a benchmark and evaluation framework designed to assess the multi-turn inquiry abilities of medical Large Language Models (LLMs) by focusing on fine-grained, human-aligned evaluation of the diagnostic process rather than just single-turn QA or final diagnosis accuracy.

Authors

Sources

Referenced by nodes (2)