VIBE
← Back to Leaderboard
Developer ToolsTOOL
Developer ToolsOpen SourceTOOL19d ago18.6k

About

OpenAI's framework for evaluating LLMs and LLM systems, with an open-source registry of benchmarks the community can extend.

Tags

llmevaluationbenchmarksopenaitesting

Tech Stack

Python

Comments

No comments yet.