Fact — claim — Knowledge Tree

Behavioral psychology concepts, including conditioning, reinforcement schedules, and reward design, are commonly utilized during the post-training and Reinforcement Learning from Human Feedback (RLHF) stages to guide Large Language Model alignment with human preferences.

Authors

Person: Not available Organization: arXiv
A Survey of Incorporating Psychological Theories in LLMs - arXiv

Sources

A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org arXiv via serper

Referenced by nodes (2)

Reinforcement learning from human feedback (RLHF) concept
conditioning concept