claim
Behavioral psychology concepts, including conditioning, reinforcement schedules, and reward design, are commonly utilized during the post-training and Reinforcement Learning from Human Feedback (RLHF) stages to guide Large Language Model alignment with human preferences.
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (2)
- Reinforcement learning from human feedback (RLHF) concept
- conditioning concept