reference
Evaluation metrics for custom open-domain text generation datasets, LLM-generated encyclopedic text, and PopQA include AUROC and AURAC.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper
Referenced by nodes (1)
- AUROC concept