claim
Large Language Models struggle to determine correct answers for temporal reasoning tasks (such as finding the earliest or latest time to adjust a plan) when all data is fed into the model within a single prompt, even when provided with background knowledge.

Authors

Sources

Referenced by nodes (1)