reference
The FEVER (Fact Extraction and VERification) dataset is a benchmark for assessing a model's ability to check the veracity of statements, where each instance includes a claim and a label of SUPPORTS, REFUTES, or NOT ENOUGH INFO, evaluated in a 16-shot setting.
Authors
Sources
- The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co via serper
Referenced by nodes (1)
- fever concept