perspective
The evaluation landscape for large language model hallucinations is fragmented, lacking a standard protocol across tasks or domains, which hinders cross-model comparison and generalization.

Authors

Sources

Referenced by nodes (1)