account
Perez et al. (2022) conducted red-teaming between Language Models to determine if they could produce harmful text without human involvement in generating the adversarial test cases.
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (1)
- Language Model concept