reference
Sriyash Poddar et al. introduced 'Personalizing reinforcement learning from human feedback with variational preference learning' in 2024, a method for personalizing reinforcement learning models.

Authors

Sources

Referenced by nodes (1)