reference
Sriyash Poddar et al. introduced 'Personalizing reinforcement learning from human feedback with variational preference learning' in 2024, a method for personalizing reinforcement learning models.
Authors
Sources
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper
Referenced by nodes (1)
- reinforcement learning concept