Fact — measurement — Knowledge Tree

In ID1 and ID2 scenarios where Large Vision Language Model (LVLM) answers are entirely correct, BertScore values are 66.73% and 46.11% respectively, indicating a significant and unwarranted disparity.

Authors

Person: Not available Organization: arXiv
Detecting and Evaluating Medical Hallucinations in Large Vision ...

Sources

Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org arXiv via serper

Referenced by nodes (2)

Large Vision-Language Models concept
BERTScore concept