Fact — reference — Knowledge Tree

The Hallucinations Leaderboard evaluates hallucination detection using two tasks: SelfCheckGPT, which checks for self-consistency in model answers, and HaluEval, which checks for faithfulness hallucinations in QA, Dialog, and Summarisation tasks relative to a knowledge snippet.

Authors

Person: Not available Organization: Hugging Face
The Hallucinations Leaderboard, an Open Effort to Measure ...

Sources

The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co Hugging Face via serper

Referenced by nodes (2)

Hallucination Leaderboard concept
HaluEval concept