claim
The authors of the study did not evaluate larger closed-source models like Anthropic's Claude or OpenAI's GPT-4, noting that these systems have undergone extensive fine-tuning and may exhibit different hallucination profiles compared to the models tested.

Authors

Sources

Referenced by nodes (3)