claim
The majority of existing benchmarks for evaluating hallucination detection models focus on response-level evaluation.
Authors
Sources
- Knowledge Graphs, Large Language Models, and Hallucinations www.sciencedirect.com via serper
Referenced by nodes (2)
- hallucination detection concept
- benchmarks concept