claim
A hallucination evaluation model can serve as a valid proxy for human judges provided that the evaluation model is highly correlated with human raters' judgments.
Authors
Sources
- vectara/hallucination-leaderboard - GitHub github.com via serper
Referenced by nodes (1)
- Hallucination Evaluation Model concept