reference
The paper 'Unintentional unalignment: likelihood displacement in direct preference optimization' was presented at The Thirteenth International Conference on Learning Representations and is cited in section 3.2.3 of 'A Survey on the Theory and Mechanism of Large Language Models'.

Authors

Sources

Referenced by nodes (1)