reference
OpenAI Evals is an open-source framework created by OpenAI for systematically evaluating model outputs by defining tests that check outputs against known correct answers or style guidelines.
Authors
Sources
- LLM Observability: How to Monitor AI When It Thinks in Tokens | TTMS ttms.com via serper
Referenced by nodes (1)
- OpenAI entity