reference
MediQ is a benchmark for reliable interactive clinical reasoning that evaluates question-asking capabilities in Large Language Models, as presented by Li et al. in November 2024.
Authors
Sources
- A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept