measurement
Experiments show that i-MedRAG outperforms standard RAG approaches on complex questions from the United States Medical Licensing Examination (USMLE) and Massive Multitask Language Understanding (MMLU) datasets, according to Xiong et al. (2024).
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper