procedure
Fine-tuning models with guardrails involves training the model to refuse to answer when uncertain and including rejection examples in the dataset, such as 'I’m not sure about that. Let me check the source before answering.'

Authors

Sources

Referenced by nodes (2)