reference
Perez et al. (2022) proposed a method for red teaming language models by using other language models to generate adversarial prompts.
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (1)
- Language Model concept