← Back to Leaderboard
AI ToolsTOOL
About
A lightweight 64M parameter GPT model that can be trained from scratch in just 2 hours on a single RTX 3090. Provides complete training pipeline including pretraining, SFT, LoRA, RLHF, and tool use capabilities.
Tags
llmgptpytorchtrainingopen-sourcelightweighteducational
Tech Stack
Python
Comments
No comments yet.