measurement
The authors of KGHaluBench evaluated 25 frontier Large Language Models using novel accuracy and hallucination metrics.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... aclanthology.org via serper
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- KGHaluBench concept