Fact — claim — Knowledge Tree

The BLEU metric scores zero when there are no shared n-grams or subsequences between a model's generated response and the ground truth, even if the model's answer is semantically correct.

Authors

Person: Not available Organization: arXiv
Detecting and Evaluating Medical Hallucinations in Large Vision ...

Sources

Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org arXiv via serper

Referenced by nodes (1)

BLEU concept