VIBE
← Back to Leaderboard
SkillsBench
AI AgentsTOOL
AI AgentsOpen SourceTOOLby vibeleaderboard.ai1mo ago

About

The first benchmark framework for evaluating AI agent skills across 84 diverse tasks and 7 models. It measures how well AI agents perform when equipped with domain-specific skills versus without them, providing structured evaluation across multiple abstraction layers.

Tags

aibenchmarkevaluationagentsskillsperformancetestingframework

Comments

No comments yet.