reference
HaluCheck is a family of 1B–3B parameter LLM detectors aligned via Direct Preference Optimization (DPO) using synthetic hallucinated negatives ranked by grounding difficulty via the MiniCheck method.
Authors
Sources
- EdinburghNLP/awesome-hallucination-detection - GitHub github.com via serper