formula
The authors propose the Joint Attribution Score (JAS) metric to quantify prompt-model interaction effects in LLM hallucinations, defined as JAS = Cov(P, M) / (σP * σM), where σP and σM are the standard deviations of hallucination rates across all prompts and all models, respectively.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (1)
- hallucination concept