procedure
The evaluation of Large Language Models' performance in the study involved randomized tests with 1000 iterations for each sample, during which the query was rephrased while keeping instructions unchanged.

Authors

Sources

Referenced by nodes (1)