measurement
The KGHaluBench entity-level filter achieved its highest F1 score of 78.07% at a threshold of 0.700, with an overall agreement of 77.98%.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (2)
- KGHaluBench concept
- entity-level filtering concept