claim
The datasets DiSafety (Meade et al. 2023) and SafeTexT (Levy et al. 2022) are designed to induce safety in Language Models and Large Language Models through supervised learning.
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (3)
- Large Language Models concept
- Language Model concept
- supervised learning concept