VIBE
← Back to Leaderboard
Developer ToolsTOOL
Developer ToolsOpen SourceTOOL3h ago37

About

A specialized benchmarking tool for evaluating Large Language Models on Apple's MLX framework knowledge and coding tasks. It includes 441 questions across different categories and difficulty levels, supports multiple LLM providers (local and cloud), and generates detailed performance reports.

Tags

mlxbenchmarkllmapplemachine-learningclievaluation

Tech Stack

Python

Comments

No comments yet.