claim
The METEOR metric fails to directly reflect whether a Large Vision-Language Model's answer aligns with the ground truth, regardless of whether the answer is correct or hallucinatory.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- Large Vision-Language Models concept