claim
HellaSwag is a benchmark for evaluating commonsense reasoning in natural language by testing a model's ability to complete sentences coherently and sensibly.

Authors

Sources

Referenced by nodes (2)