reference
Paul F. Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, and Dario Amodei published 'Deep reinforcement learning from human preferences' in Advances in Neural Information Processing Systems in 2017.
Authors
Sources
- Unlocking the Potential of Generative AI through Neuro-Symbolic ... arxiv.org via serper
- A Survey of Incorporating Psychological Theories in LLMs - arXiv arxiv.org via serper