measurement
The hybrid fact-checking pipeline developed by Kolli et al. achieves an F1 score of 0.93 on the FEVER benchmark for the Supported/Refuted split without requiring task-specific fine-tuning.

Authors

Sources

Referenced by nodes (1)