Fact — account — Knowledge Tree

Perez et al. (2022) conducted red-teaming between Language Models to determine if they could produce harmful text without human involvement in generating the adversarial test cases.

Authors

Person: Not available Organization: arXiv
Building Trustworthy NeuroSymbolic AI Systems - arXiv

Sources

Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org arXiv via serper

Referenced by nodes (1)

Language Model concept