reference
The study selected the following open-source Large Language Models (LLMs) for evaluation: LLaMA 2 (13B) (Meta AI, 2023), a transformer-based model fine-tuned for dialogue; Mistral 7B instruct, an instruction-tuned model; DeepSeek 67B (DeepSeek AI, 2023), a multilingual model trained on code and web data; OpenChat-3.5 (Openchat Team, 2023), a community-finetuned model derived from LLaMA; and Gwen, an open-access research model emphasizing retrieval-enhanced factual generation.
Authors
Sources
- Survey and analysis of hallucinations in large language models www.frontiersin.org via serper
Referenced by nodes (2)
- Meta entity
- DeepSeek-AI entity