← Back to Leaderboard
AI ToolsTOOL
About
Microsoft's screen-parsing model that turns UI screenshots into structured element data — the perception layer for pure-vision GUI agents.
Tags
gui-agentvisionscreen-parsingocrmicrosoft
Tech Stack
Python
Comments
No comments yet.