procedure
In coarse-grained multi-dimension Image-Report Generation (IRG) scenarios, Large Vision-Language Model (LVLM) outputs are segmented into sentences and annotated at the sentence level.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (2)
- Large Vision-Language Models concept
- medical image report generation concept