claim
LLaVA1.5-7b, LLaVA1.5-13b, and mPLUG-Owl2 exhibit higher precision on the Med-VQA task compared to other models, as reflected in their METEOR and BLEU metric scores.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- BLEU concept