Fact — reference — Knowledge Tree

The paper 'CoT-space: a theoretical framework for internal slow-thinking via reinforcement learning' proposes a framework for internal reasoning (slow-thinking) in models using reinforcement learning.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (1)

reinforcement learning concept