claim
The 'LLM-as-a-Judge' (LLM-Judges) paradigm, which leverages a powerful large language model to score or rank the outputs of other models, has become a widespread method for evaluating open-ended generation (Gu et al., 2025).

Authors

Sources

Referenced by nodes (1)