Relations (1)

related 1.00 — strongly supporting 1 fact

The relationship is established through the context of [1], which highlights the limitations of human involvement in the rapid development and evaluation of artificial intelligence systems.

Facts (1)

Sources
vectara/hallucination-leaderboard - GitHub github.com Vectara 1 fact
claimThe creators of the Vectara hallucination leaderboard chose to use a model-based evaluation process rather than human evaluation because human evaluation does not scale sufficiently to allow for constant updates as new APIs and models are released in the fast-moving field of AI.