perspective
The LLM-as-a-Judge paradigm relies on the assumptions that large language models can serve as valid human proxies, are capable evaluators, are scalable, and are cost-effective, all of which are being theoretically challenged (Dorner et al., 2025).

Authors

Sources

Referenced by nodes (2)