measurement
The LLM Entailment filter in the KGHaluBench pipeline uses Llama3.1:8B to resolve 71.39% of facts with an average verification time of 1.91 seconds.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (1)
- KGHaluBench concept