Fact — reference — Knowledge Tree

HaluCheck is a family of 1B–3B parameter LLM detectors aligned via Direct Preference Optimization (DPO) using synthetic hallucinated negatives ranked by grounding difficulty via the MiniCheck method.

Authors

Person: Not available Organization: GitHub
EdinburghNLP/awesome-hallucination-detection - GitHub

Sources

EdinburghNLP/awesome-hallucination-detection - GitHub github.com GitHub via serper

Referenced by nodes (1)

Direct Preference Optimization (DPO) concept