procedure
The KGHaluBench automated verification pipeline detects abstentions and verifies LLM responses at both conceptual and correctness levels to identify different types of hallucinations.
Authors
Sources
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... aclanthology.org via serper
- A Knowledge Graph-Based Hallucination Benchmark for Evaluating ... arxiv.org via serper
Referenced by nodes (2)
- hallucination concept
- KGHaluBench concept