claim
Traditional medical datasets are difficult to use for evaluating Large Vision-Language Models (LVLMs) because they contain short answers or unstructured image reports, whereas LVLM outputs are typically well-ordered long texts.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- Large Vision-Language Models concept