reference
Evaluation metrics for AI systems include Precision, Recall, and F1 scores calculated under cross-examination strategies such as AYS, IDK, Confidence-Based, and IC-IDK.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper