VIBE
← Back to Leaderboard
EducationTOOL
EducationOpen SourceTOOL19d ago8.1k

About

Train a 65M-parameter vision-language model from scratch in just 2 hours — readable, didactic implementation for learning VLM internals.

Tags

vlmvision-languagetrainingeducationfrom-scratch

Tech Stack

Python

Comments

No comments yet.