VIBE
← Back to Leaderboard
AI ToolsTOOL
AI ToolsOpen SourceTOOL1mo ago51.5k

About

A lightweight 64M parameter GPT model that can be trained from scratch in just 2 hours on a single RTX 3090. Provides complete training pipeline including pretraining, SFT, LoRA, RLHF, and tool use capabilities.

Tags

llmgptpytorchtrainingopen-sourcelightweighteducational

Tech Stack

Python

Comments

No comments yet.