claim
The MediHallDetector model surpasses GPT-3.5, GPT-4, and Gemini in hallucination detection performance and improves efficiency compared to manual evaluation, though it still trails human performance.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (2)
- hallucination detection concept
- Gemini concept