Index / tool

Anthropic Evals

https://github.com/anthropics/evals

Visit github.com

Category: AI Tools
Pricing: Open Source
Type: TOOL
Builder: anthropics
GitHub: 415 stars
Added: May 26, 2026

About

Public evaluation suite from Anthropic. Reference tasks and frameworks for benchmarking Claude and other models.

Why it made the leaderboard

Anthropic's public evaluation suite — reference tasks and frameworks for benchmarking Claude and other models, instead of inventing an eval harness from scratch.

Comments (0)

No comments yet

Indexed by a proprietary survey. Corrections welcome.

About

Why it made the leaderboard

Tags

Comments (0)