reference
Comprehensive surveys on LLM safety and trustworthiness include: safety (Shi et al., 2024), trustworthiness (Huang et al., 2024a; d; Liu et al., 2023c), fairness (Li et al., 2023b; Gallegos et al., 2024; Chu et al., 2024), and privacy (Yao et al., 2024b; Yan et al., 2024; Das et al., 2025).
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (3)
- privacy concept
- trustworthiness concept
- fairness concept