VIBE
← Back to Leaderboard
CLIP
AI ToolsTOOL
AI ToolsOpen SourceTOOL19d ago33.7k

About

OpenAI's vision-language model that predicts the most relevant text snippet for a given image, trained on 400M image-text pairs from the web.

Tags

clipvisionmultimodalopenaiembeddings

Tech Stack

Python

Comments

No comments yet.