claim
The Hallucination Leaderboard includes tasks across several categories: Closed-book Open-domain QA (NQ Open, TriviaQA, TruthfulQA), Summarisation (XSum, CNN/DM), Reading Comprehension (RACE, SQuADv2), Instruction Following (MemoTrap, IFEval), Fact-Checking (FEVER), Hallucination Detection (FaithDial, True-False, HaluEval), and Self-Consistency (SelfCheckGPT).
Authors
Sources
- The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co via serper
Referenced by nodes (11)
- hallucination detection concept
- TruthfulQA concept
- fever concept
- TriviaQA concept
- summarization concept
- RACE concept
- Hallucination Leaderboard concept
- SQuAD concept
- fact-checking concept
- HaluEval concept
- NQ-Open concept