claim
The radar plot in Figure 4 of the study 'Survey and analysis of hallucinations in large language models' visualizes the comparative performance of DeepSeek, Mistral, and LLaMA 2 across five behavioral dimensions: Factuality, Coherence, Prompt Sensitivity, Model Variability, and Usability.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (5)
- Prompt Sensitivity concept
- factuality concept
- coherence concept
- Survey and analysis of hallucinations in large language models: attribution to prompting strategies or model behavior concept
- Model Variability concept