reference
The Med-HALT benchmark categorizes hallucination tests into Reasoning Hallucination Tests (RHTs), which evaluate a Large Language Model's ability to reason accurately with medical information and generate logically sound, factually correct outputs without fabrication.
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (2)
- hallucination concept
- Med-HALT concept