reference
Nathan Lambert, Thomas Krendl Gilbert, and Tom Zick published 'The history and risks of reinforcement learning and human feedback' in 2023.
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- reinforcement learning concept