reference
Aman Singh Thakur et al. (2025) evaluated alignment and vulnerabilities in LLMs-as-judges in their paper 'Judging the judges: Evaluating alignment and vulnerabilities in llms-as-judges'.
Authors
Sources
- Re-evaluating Hallucination Detection in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- LLM-as-a-judge concept