measurement
LLaMA 2 exhibits high Prompt Sensitivity (PS), while DeepSeek shows high Model Variability (MV).
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (2)
- Prompt Sensitivity concept
- Model Variability concept