reference
The MedHallu benchmark, derived from PubMedQA, contains 10,000 question-answer pairs with deliberately planted plausible hallucinations to evaluate medical hallucination detection.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
- MedHallu: Benchmark for Medical LLM Hallucination Detection www.emergentmind.com via serper
Referenced by nodes (3)
- medical hallucination concept
- MedHallu concept
- PubmedQA concept