claim
Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, and Goran Radanovic study data corruption robustness for reinforcement learning with human feedback (RLHF) in an offline setting.
Authors
Sources
- Track: Poster Session 3 - aistats 2026 virtual.aistats.org via serper