measurement
GPT-4o had a hallucination rate of 22.0% in Diagnosis Prediction, which was marginally lower than the rate observed for Gemini-2.0-flash-exp (2.25%), though the authors note a potential data discrepancy in the Gemini figures.
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (1)
- GPT-4 concept