reference
HaluCheck is a family of 1B–3B parameter LLM detectors aligned via Direct Preference Optimization (DPO) using synthetic hallucinated negatives ranked by grounding difficulty via the MiniCheck method.

Authors

Sources

Referenced by nodes (1)