measurement
The Pointwise Score used in Med-HALT (Pal et al., 2023) evaluates model performance by calculating the average score across samples, where each correct prediction is awarded a positive score (Pc = +1) and each incorrect prediction incurs a negative penalty (Pw = −0.25).
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper
Referenced by nodes (1)
- Med-HALT concept