claim
Research presented at ACL 2025 evaluates leading AI models, specifically GPT-4o, Gemini-1.5, and Llama-3.2-Vision, in scenarios where a model correctly identifies an object visually in English but hallucinates its properties when generating text in another language.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- GPT-4 concept