VIBE
← Back to Leaderboard
BullshitBench
AI ToolsTOOL
AI ToolsOpen SourceTOOLby vibeleaderboard.ai1mo ago1.4k

About

A benchmark tool that tests whether AI models can detect and challenge nonsensical prompts instead of confidently answering invalid questions. It evaluates models across multiple domains using 100 carefully crafted nonsense questions.

Tags

aibenchmarkevaluationtestingmachine-learningmodel-performancenonsense-detection

Promo Video

Comments

No comments yet.