procedure
In the fact-level filtering task of the KGHaluBench evaluation, participants were provided with three facts corresponding to the relations in a question and were required to verify whether each fact was explicitly stated in the Large Language Model's response.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (1)
- KGHaluBench concept