claim
The observed inter-rater reliability in the study was moderate, but sufficient to support the identification of systematic biases and error modalities within the clinical reasoning and text generation capabilities of the language models.
Authors
Sources
- Medical Hallucination in Foundation Models and Their ... www.medrxiv.org via serper
- Medical Hallucination in Foundation Models and Their Impact on ... www.medrxiv.org via serper
Referenced by nodes (2)
- Language Model concept
- text generation concept