measurement
In a performance comparison on the PRIMATE dataset, the knowledge-powered CREST framework showed an improvement of 6% in PHQ-9 answerability and 21% in BLEURT scores compared to GPT-3.5.
Authors
Sources
- Building Trustworthy NeuroSymbolic AI Systems - arXiv arxiv.org via serper
Referenced by nodes (3)
- CREST framework concept
- GPT 3.5 concept
- PHQ-9 concept