← Back to Leaderboard
AI ToolsAPP
About
A library that runs 70B-parameter LLM inference on a single 4GB GPU through aggressive layer-by-layer memory management.
Tech Stack
Python
Comments
No comments yet.
About
A library that runs 70B-parameter LLM inference on a single 4GB GPU through aggressive layer-by-layer memory management.
Tech Stack
No comments yet.