claim
The Vectara hallucination leaderboard authors chose to evaluate hallucination rates in summarization tasks rather than attempting to determine if a response was hallucinated without a reference source, because the latter would require training a model as large or larger than the LLMs being evaluated.
Authors
Sources
- vectara/hallucination-leaderboard - GitHub github.com via serper
Referenced by nodes (2)
- Large Language Models concept
- Vectara entity