Fact — procedure — Knowledge Tree

The authors of the survey "Survey and analysis of hallucinations in large language models" conducted controlled experiments on multiple Large Language Models (GPT-4, LLaMA 2, DeepSeek, Gwen) using standardized hallucination evaluation benchmarks, specifically TruthfulQA, HallucinationEval, and RealToxicityPrompts.

Authors

Person: Not available Organization: Frontiers
Survey and analysis of hallucinations in large language models

Sources

Survey and analysis of hallucinations in large language models www.frontiersin.org Frontiers via serper

Referenced by nodes (3)