claim
HellaSwag is a benchmark for evaluating commonsense reasoning in natural language by testing a model's ability to complete sentences coherently and sensibly.
Authors
Sources
- A survey on augmenting knowledge graphs (KGs) with large ... link.springer.com via serper
Referenced by nodes (2)
- natural language concept
- common-sense reasoning concept