claim
Thorndike's Law of Effect asserts that behaviors followed by satisfying outcomes are more likely to recur, a principle reflected in the RLHF process where models adapt to human preferences (Lambert et al., 2023).
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- RLHF concept