claim
Incorporating domain-specific knowledge and introducing a 'not sure' category as an answer option improves precision and F1 scores by up to 38% relative to baselines in medical hallucination detection.
Authors
Sources
- A Comprehensive Benchmark for Detecting Medical Hallucinations ... aclanthology.org via serper
Referenced by nodes (2)
- medical hallucination concept
- Domain-Specific Knowledge concept