← Back to Leaderboard
AI ToolsTOOL
About
OpenAI's vision-language model that predicts the most relevant text snippet for a given image, trained on 400M image-text pairs from the web.
Tags
clipvisionmultimodalopenaiembeddings
Tech Stack
Python
Comments
No comments yet.