procedure
In the TruthfulQA task, models are evaluated in a multi-class (MC1) or multi-label (MC2) zero-shot classification setting where the task is to select the correct answer from provided options.
Authors
Sources
- The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co via serper
Referenced by nodes (1)
- TruthfulQA concept