← Back to Leaderboard
Developer ToolsTOOL
OpenAI Evals
https://github.com/openai/evalsAbout
OpenAI's framework for evaluating LLMs and LLM systems, with an open-source registry of benchmarks the community can extend.
Tags
llmevaluationbenchmarksopenaitesting
Tech Stack
Python
Comments
No comments yet.