claim
Some research suggests that systematic biases in LLM-Judges can be partially mitigated through robust prompting with detailed scoring rubrics (Gao et al., 2025).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- LLM-as-a-judge concept