measurement
Falcon 7B yields the best results on the NQ (8-shot) dataset among models evaluated on the Hallucinations Leaderboard.

Authors

Sources

Referenced by nodes (1)