claim
A study found that the Trustworthy Language Model (TLM) detects incorrect responses more effectively than LLM-as-a-judge or token probability (logprobs) techniques across all major LLM models.

Authors

Sources

Referenced by nodes (2)