claim
Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, and Goran Radanovic study data corruption robustness for reinforcement learning with human feedback (RLHF) in an offline setting.

Authors

Sources

Referenced by nodes (1)