claim
The 'LLM-as-a-Judge' (LLM-Judges) paradigm, which leverages a powerful large language model to score or rank the outputs of other models, has become a widespread method for evaluating open-ended generation (Gu et al., 2025).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- LLM-as-a-judge concept