Fact — reference — Knowledge Tree

Perez et al. (2022) proposed a method for red teaming language models by using other language models to generate adversarial prompts.

Authors

Person: Not available Organization: arXiv
Building Trustworthy NeuroSymbolic AI Systems - arXiv

Sources

Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org arXiv via serper

Referenced by nodes (1)

Language Model concept