measurement
The openai/gpt-oss-120b model achieved a hallucination rate of 14.2%, a factual consistency rate of 85.8%, an answer rate of 99.9%, and an average summary length of 135.2 words as of March 20, 2026.
Authors
Sources
- vectara/hallucination-leaderboard - GitHub github.com via serper
Referenced by nodes (4)
- hallucination rate concept
- answer rate concept
- factual consistency rate concept
- summary length concept