claim
Traditional medical datasets are difficult to use for evaluating Large Vision-Language Models (LVLMs) because they contain short answers or unstructured image reports, whereas LVLM outputs are typically well-ordered long texts.

Authors

Sources

Referenced by nodes (1)