← Back to Leaderboard
AI AgentsTOOL
SkillsBench
www.skillsbench.aiAbout
The first benchmark framework for evaluating AI agent skills across 84 diverse tasks and 7 models. It measures how well AI agents perform when equipped with domain-specific skills versus without them, providing structured evaluation across multiple abstraction layers.
Tags
aibenchmarkevaluationagentsskillsperformancetestingframework
Comments
No comments yet.