reference
He et al. (2024a) introduced a unified theoretical framework for watermarking Large Language Models that jointly optimizes the watermarking scheme and the detector, revealing a fundamental trade-off between watermark detectability (Type-II error) and text distortion.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Large Language Models concept
- watermarking concept