Relations (1)
related 1.00 — strongly supporting 1 fact
The relationship is established through the context of [1], which highlights the limitations of human involvement in the rapid development and evaluation of artificial intelligence systems.
Facts (1)
Sources
vectara/hallucination-leaderboard - GitHub github.com 1 fact
claimThe creators of the Vectara hallucination leaderboard chose to use a model-based evaluation process rather than human evaluation because human evaluation does not scale sufficiently to allow for constant updates as new APIs and models are released in the fast-moving field of AI.