← Back to Leaderboard
AI ToolsTOOL
Anthropic Evals
https://github.com/anthropics/evalsAbout
Public evaluation suite from Anthropic. Reference tasks and frameworks for benchmarking Claude and other models.
Tags
anthropicevalsbenchmarksllmclaude
Comments
No comments yet.