Fact — measurement — Knowledge Tree

The BLIP family of models achieves an average score of 7.35% on the ROUGE metric when evaluated on the Med-VQA task within the Med-HallMark benchmark.

Authors

Person: Not available Organization: arXiv
Detecting and Evaluating Medical Hallucinations in Large Vision ...

Sources

Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org arXiv via serper

Referenced by nodes (1)

ROUGE concept