procedure
The process for using Datadog's LLM-as-a-Judge involves three steps: (1) defining evaluation prompts to establish application-specific quality standards, (2) using a personal LLM API key to execute evaluations with a preferred model provider, and (3) automating these evaluations across production traces within LLM Observability to monitor model quality in real-world conditions.

Authors

Sources

Referenced by nodes (3)