claim
The datasets DiSafety (Meade et al. 2023) and SafeTexT (Levy et al. 2022) are designed to induce safety in Language Models and Large Language Models through supervised learning.

Authors

Sources

Referenced by nodes (3)