Fact — reference — Knowledge Tree

The paper 'Inform: mitigating reward hacking in rlhf via information-theoretic reward modeling' (Advances in Neural Information Processing Systems 37, pp. 134387–134429) is cited in section 4.2.2 of 'A Survey on the Theory and Mechanism of Large Language Models'.

Authors

Person: Not available Organization: arXiv
A Survey on the Theory and Mechanism of Large Language Models

Sources

A Survey on the Theory and Mechanism of Large Language Models arxiv.org arXiv via serper

Referenced by nodes (1)

Advances in Neural Information Processing Systems entity