claim
LLM-Mini-CEX supports multi-turn interactions, includes key points rubrics, and is expert-validated.
Authors
Sources
- A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org via serper
Referenced by nodes (1)
- multi-turn conversations concept