reference
Liao et al. (2024) developed an automatic interactive evaluation method for Large Language Models that utilizes a state-aware patient simulator.
Authors
Sources
- A Comprehensive Benchmark and Evaluation Framework for Multi ... arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept