claim
Cao et al. (2024) introduced a method for enhancing reinforcement learning by utilizing dense rewards derived from a language model critic.
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- reinforcement learning concept