claim
Reinforcement learning from knowledge feedback (RLKF) achieves superior factuality in AI models compared to decoding strategies or supervised fine-tuning.
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (2)
- factuality concept
- supervised fine-tuning concept