measurement
The medical-specialized AI models PMC-Llama, MedAlpaca, AlpacaRE, and MedGemma demonstrate baseline hallucination resistance scores of 40.8%, 32.0%, 28.6%, and 52.6% respectively, which is less than half the resistance of general-purpose models.
Authors
Sources
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (1)
- hallucination resistance concept