reference
Hallucinations can be categorized into four attribution types based on Prompt Sensitivity (PS) and Model Variation (MV) scores: Prompt-dominant (high PS, low MV), Model-dominant (low PS, high MV), Mixed-origin (high PS, high MV), and Unclassified/noise (low PS, low MV).

Authors

Sources

Referenced by nodes (2)