Tools
Desktop Automation
Control the desktop via X11 tools (xdotool, wmctrl, xdg-open) for URLs, apps, keys, typing, and clicks.
Desktop Automation
Executes desktop actions through X11 tooling.
- Path:
workers/desktop-automation/ - API (port 8095):
/open/url,/launch/app,/window/activate,/key,/type,/click,/intent/run
Examples
- Open our app: intent text “open our app” → opens
http://localhost:3000 - Open specific URL:
POST /open/url {"url":"https://example.com"} - Focus window:
POST /window/activate {"name_contains":"Firefox"} - Type text:
POST /type {"text":"hello"}
Environment
DISPLAY(X11),DEFAULT_APP_URL,DRY_RUN
Wayland Note
Limited support on pure Wayland; use X11/XWayland for full control.