claim
Reinforcement schedules in LLMs, such as variable ratio or interval rewards, may unintentionally condition users to engage compulsively, which creates a risk of manipulative design.

Authors

Sources

Referenced by nodes (1)