claim
In Image-Report Generation (IRG) tasks, mere correctness does not capture an LVLM's judgment of factuality across all dimensions when the model needs to analyze image content from various dimensions.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- medical image report generation concept