claim
Reinforcement schedules in LLMs, such as variable ratio or interval rewards, may unintentionally condition users to engage compulsively, which creates a risk of manipulative design.
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept