reference
The paper 'Revisiting jailbreaking for large language models: a representation engineering perspective' was published in the Proceedings of the 31st International Conference on Computational Linguistics, pp. 3158–3178.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- Association for Computational Linguistics entity