Fact — measurement — Knowledge Tree

Providing domain-specific knowledge enhances hallucination detection performance across both general-purpose and medical fine-tuned LLMs, with some general models seeing up to a 32% improvement in F1 scores.

Authors

Person: Not available Organization: Emergent Mind
MedHallu: Benchmark for Medical LLM Hallucination Detection

Sources

MedHallu: Benchmark for Medical LLM Hallucination Detection www.emergentmind.com Emergent Mind via serper

Referenced by nodes (3)