claim
DoorDash uses an LLM Judge to monitor chatbot performance by assessing five metrics: retrieval correctness, response accuracy, grammar and language accuracy, coherence to context, and relevance to the Dasher's request.
Authors
Sources
- 10 RAG examples and use cases from real companies - Evidently AI www.evidentlyai.com via serper
Referenced by nodes (1)
- LLM-as-a-judge concept