measurement
Providing domain-specific knowledge enhances hallucination detection performance across both general-purpose and medical fine-tuned LLMs, with some general models seeing up to a 32% improvement in F1 scores.
Authors
Sources
- MedHallu: Benchmark for Medical LLM Hallucination Detection www.emergentmind.com via serper
Referenced by nodes (3)
- Large Language Models concept
- hallucination detection concept
- Domain-Specific Knowledge concept