Computer Use
OSIA uses a 6-tier deterministic hierarchy for computer control, prioritizing APIs and accessibility over vision-based approaches.
Control Hierarchy
1
Deep Links & URL Schemes
steam://, spotify:, direct APIs
2
DOM / Playwright
Precise web element interaction via CDP
3
Accessibility APIs
UIA (Windows), AT-SPI (Linux), AX (macOS)
4
Keyboard Navigation
Tab/Enter + UIA focus reading
5
Zoom Vision (2-pass)
Refined screenshot analysis
6
Simple Vision
Last resort — full screenshot analysis
Deterministic Verification
OSIA never claims success without proof. It verifies actions by reading app manifests, checking process states, enumerating windows, and inspecting DOM. Screenshots are stripped from history after each turn to prevent context ballooning.
48+ Native Tools
File management
System control
Web scraping
API calls
Scheduling
Browser automation
App launching
Window management