reference
Sparse autoencoder and attention-mapping approaches are techniques used to identify specific combinations of neural activations that correlate with LLM hallucinations.
Authors
Sources
- Detecting hallucinations with LLM-as-a-judge: Prompt ... - Datadog www.datadoghq.com via serper
Referenced by nodes (1)
- LLM hallucinations in medicine concept