Fact — claim — Knowledge Tree

Huang et al. (2024b) observed a 'cliff-like decline' in GPT-4's performance on medium-to-hard problems when tested on novel competition problems released after its training data cut-off, suggesting reliance on memorization rather than algorithmic reasoning.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (2)

GPT-4 concept
memorization concept