claim
The Hallucinations Leaderboard team uses hierarchical clustering on datasets, metrics, and models to identify performance clusters, specifically grouping models into Mistral 7B-based models, LLaMA 2-based models, and smaller models such as BLOOM 560M and GPT-Neo.
Authors
Sources
- The Hallucinations Leaderboard, an Open Effort to Measure ... huggingface.co via serper
Referenced by nodes (2)
- LLaMA concept
- Hallucination Leaderboard concept