perspective
The evaluation landscape for large language model hallucinations is fragmented, lacking a standard protocol across tasks or domains, which hinders cross-model comparison and generalization.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper