claim
DoorDash uses an LLM Judge to monitor chatbot performance by assessing five metrics: retrieval correctness, response accuracy, grammar and language accuracy, coherence to context, and relevance to the Dasher's request.

Authors

Sources

Referenced by nodes (1)