Fact — claim — Knowledge Tree

The MedHALT benchmark is limited to assessing the reasoning capabilities of Large Language Models over the medical domain in a Question Answering (QA) format.

Authors

Person: Not available Organization: Nature
A framework to assess clinical safety and hallucination rates of LLMs ...

Sources

A framework to assess clinical safety and hallucination rates of LLMs ... www.nature.com Nature via serper

Referenced by nodes (2)

Question Answering concept
Med-HALT concept