claim
The ACC (Accuracy) metric directly measures correctness but is only suitable for evaluating short, fine-grained texts, failing to intuitively measure LVLM output when content is complex, multi-dimensional, or belongs to long texts.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- anterior cingulate cortex concept