claim
Prometheus 2 functions as an 'LLM-as-a-judge' that has been fine-tuned to align with human ratings of LLM responses.
Authors
Sources
- Real-Time Evaluation Models for RAG: Who Detects Hallucinations ... cleanlab.ai via serper
Referenced by nodes (1)
- LLM-as-a-judge concept