reference
Perez et al. (2022) proposed a method for red teaming language models by using other language models to generate adversarial prompts.

Authors

Sources

Referenced by nodes (1)