claim
GPT 3.5 exhibits fragility in its safety and instruction-following capabilities, as paraphrased versions of an initial query can disrupt the model's performance.
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (1)
- GPT 3.5 concept