← Back to Leaderboard
AI AgentsAGENT
About
Stanford IRIS Lab's agent scaffolding that hit 76.4% on Terminal-Bench 2.0 with Claude Opus 4.6.
Tags
agentterminal-benchclaudebenchmarkscaffolding
Tech Stack
Python
Comments
No comments yet.
About
Stanford IRIS Lab's agent scaffolding that hit 76.4% on Terminal-Bench 2.0 with Claude Opus 4.6.
Tags
Tech Stack
No comments yet.