Web + Desktop Automation

v1.0.0

Use when the user wants browser automation, web scraping, form filling, clicking, or desktop GUI automation, including mixed workflows that move between web...

0· 254· 1 versions· 0 current· 0 all-time· Updated 12h ago· MIT-0

by@lurui808

Security Scans

VirusTotalBenign ClawScanBenign Static analysisBenign

Install

openclaw skills install web-desktop-automation

Web + Desktop Automation

Use this skill when a task may involve:

Opening or controlling websites
Reading or extracting page content
Filling forms, clicking buttons, logging in
Downloading or uploading files
Controlling desktop apps with mouse/keyboard
Combining browser steps with local app steps

Core rule

Prefer the simplest reliable path:

If the task can be done in the browser, use browser automation.
If the task needs local apps or OS-level interaction, use desktop automation.
If both are needed, split the job into clear phases and verify after each phase.

Execution strategy

1) Classify the task

Decide which of these applies:

Browser only
Desktop only
Mixed browser + desktop

2) Browser automation

Use browser automation for:

Navigation
Search
Page reading
Form filling
Clicking controls
File upload/download
Logged-in web workflows

Prefer stable selectors and explicit waits. Avoid brittle coordinate-based clicking when browser selectors exist.

3) Desktop automation

Use desktop automation for:

Native apps
Window switching
Copy/paste between apps
File manager operations
UI flows outside the browser

Prefer application/window-aware methods when available. Use image-based or coordinate-based control only when necessary.

4) Mixed workflows

Break the task into phases:

Browser phase
Desktop phase
Browser phase again if needed

After each phase, verify the result before continuing.

Recovery rules

If a step fails:

Re-check the current UI state
Re-locate the target element or window
Try a more stable selector or a different interaction method
If the task risks loss of data or irreversible action, stop and ask the user

Best practices

Prefer deterministic steps over guessing
Avoid rapid blind retries
Capture key state when tasks are long or fragile
Keep flows small and modular
Use scripts for repeated actions
Use scripts/browser_runner.py for Playwright browser automation templates
Use scripts/desktop_runner.py for PyAutoGUI desktop automation templates
Use scripts/mixed_orchestrator.py for browser + desktop handoffs
Put browser-specific patterns in references/browser-workflows.md
Put desktop-specific patterns in references/desktop-workflows.md
Put mixed-flow orchestration examples in references/mixed-flows.md
Put dependency and installation notes in references/dependencies.md
Put a realistic browser-download → desktop-edit → browser-upload flow in references/mixed-example.md
See requirements.txt for a minimal install set
Put dependency and installation notes in references/dependencies.md
Put a realistic browser-download → desktop-edit → browser-upload flow in references/mixed-example.md
Put dependency and installation notes in references/dependencies.md

Version tags

latestvk970m2gmw6kj28w6vyfe9rc5gh83ed4e