claim
Evaluation approaches for large language models are evolving to include natural language inference-based scoring, fact-checking pipelines, and LLM-as-a-judge methodologies, as noted by Liu et al. (2023).

Authors

Sources

Referenced by nodes (2)