claim
MedDialogRubrics is a benchmark for multi-turn medical consultations in Large Language Models (LLMs) that comprises 5,200 synthetically constructed patient cases and over 60,000 fine-grained evaluation rubrics.
Authors
Sources
- A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- MedDialogRubrics concept