measurement
Using GraphEval in conjunction with state-of-the-art natural language inference (NLI) models improves balanced accuracy on various hallucination benchmarks compared to using raw NLI models alone.
Authors
Sources
- A Knowledge-Graph Based LLM Hallucination Evaluation Framework arxiv.org via serper
Referenced by nodes (2)
- natural language inference (NLI) concept
- GraphEval concept