Fact — claim — Knowledge Tree

The MediHallDetector model surpasses GPT-3.5, GPT-4, and Gemini in hallucination detection performance and improves efficiency compared to manual evaluation, though it still trails human performance.

Authors

Person: Not available Organization: arXiv
Detecting and Evaluating Medical Hallucinations in Large Vision ...

Sources

Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org arXiv via serper

Referenced by nodes (2)

hallucination detection concept
Gemini concept