claim
Prompts are used to augment labeled data with reasoning chains for supervised fine-tuning (SFT) or in SFT initialization steps before reinforcement learning (RL).
Authors
Sources
- Detecting hallucinations with LLM-as-a-judge: Prompt ... - Datadog www.datadoghq.com via serper
Referenced by nodes (3)
- reinforcement learning concept
- supervised fine-tuning concept
- prompts concept