measurement
The openai/gpt-4o-2024-08-06 model achieved a hallucination rate of 9.6%, a factual consistency rate of 90.4%, an answer rate of 93.8%, and an average summary length of 86.6 words as of March 20, 2026.

Authors

Sources

Referenced by nodes (4)