reference
The research paper 'On robustness and reliability of benchmark-based evaluation of llms' was published as an arXiv preprint (arXiv:2509.04013).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (1)
- arXiv entity