procedure
The MedHallu benchmark generates hallucinated answers through a controlled pipeline to create a dataset for binary hallucination detection.
Authors
Sources
- A Comprehensive Benchmark for Detecting Medical Hallucinations ... aclanthology.org via serper
Referenced by nodes (2)
- hallucination detection concept
- MedHallu concept