measurement
The proposed framework achieves state-of-the-art performance on the GRBench dataset, improving by at least 26.5% over Chain-of-Thought (CoT) baselines.

Authors

Sources

Referenced by nodes (1)