measurement
Evaluation of hallucinations uses the percentage of wrong answers and cases where the model knows it is wrong (Snowballed Hallucinations) as metrics, and utilizes datasets including Primality Testing, Senator Search, and Graph Connectivity.

Authors

Sources

Referenced by nodes (1)