reference
Sparse autoencoder and attention-mapping approaches are techniques used to identify specific combinations of neural activations that correlate with LLM hallucinations.

Authors

Sources

Referenced by nodes (1)