← Back to Leaderboard
Developer ToolsTOOL
About
A specialized benchmarking tool for evaluating Large Language Models on Apple's MLX framework knowledge and coding tasks. It includes 441 questions across different categories and difficulty levels, supports multiple LLM providers (local and cloud), and generates detailed performance reports.
Tags
mlxbenchmarkllmapplemachine-learningclievaluation
Tech Stack
Python
Comments
No comments yet.