formula
The Hallucination Pointwise Score used in the Med-HALT benchmark is calculated as the average score across samples, where each correct prediction (Pc) is awarded a positive score of +1 and each incorrect prediction (Pw) incurs a negative penalty of -0.25.
Authors
Sources
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (1)
- Med-HALT concept