Install
openclaw skills install mano-cuaComputer use for GUI automation tasks via VLA models. Use when the user describes a task in natural language that requires visual screen interaction and no API or CLI exists for the target app.
openclaw skills install mano-cuaDesktop GUI automation for tasks via VLA models. Use when the user describes a task in natural language that requires visual screen interaction and no API or CLI exists for the target app. Supports fully on-device local mode and cloud mode.
mano-cua binary installed (v1.1.0+ recommended for full feature support)macOS / Linux (Homebrew):
brew install Mininglamp-AI/tap/mano-cua
# Update to latest version
brew upgrade Mininglamp-AI/tap/mano-cua
Windows:
Download the latest mano-cua-windows.zip from GitHub Releases, extract it, and add the folder to your PATH.
# Run a task
mano-cua run "your task description"
# Run with options (minimize UI panel and set max steps)
mano-cua run "task" --minimize --max-steps 10
# Open a URL in the browser before starting the task
mano-cua run "task" --url "https://example.com"
# Open an app before starting the task
mano-cua run "task" --app "Notes"
# Run in local mode (on-device inference, macOS Apple Silicon only)
mano-cua run "task" --local
# Stop the current running task
mano-cua stop
Run mano-cua --help or mano-cua <command> --help for full flags and options.
Note: Only one task can run at a time per device. If you need to start a new task, first stop the current one with
mano-cua stop.
--app vs --url: Use one or the other, not both.
--applaunches a desktop application by name.--urlopens a URL in the default browser. Both bring the target to the foreground before the agent starts.
Troubleshooting: If tasks fail unexpectedly or features described below are unavailable, ensure your CLI is up to date:
brew upgrade Mininglamp-AI/tap/mano-cua.
mano-cua config --list # Show all settings
mano-cua config --set max-steps 50 # Set default max steps
mano-cua config --set minimize true # Always start with UI panel minimized
mano-cua config --set disable-bash true # Disable shell tool in cloud mode
Runs Mano-P entirely on-device via MLX. No data leaves the machine. Requires macOS with Apple Silicon (M1+). The local model is lightweight (4B) — clarify the user's instruction and add context that the model may not infer on its own. Use --app or --url to set the starting context.
Tips for local mode:
- Vague instructions need specifics: "look up AI news" →
"Search for artificial intelligence news and open the first result"with--url "https://www.google.com"- Tasks requiring domain knowledge need context: "adjust screen brightness" →
"Adjust screen brightness to 50% in System Settings > Display"with--app "System Settings"
Setup:
mano-cua check
mano-cua install-sdk
mano-cua install-model
# Optional: use a custom Python environment or model path if dependencies or weights are already in local
mano-cua config --set python-path /path/to/.venv/bin/python
mano-cua config --set default-model-path /path/to/model-weights
Model format: Local mode expects MLX w8a16 quantized weights for optimal performance. If your model is in fp16, convert it first:
python -m mlx_vlm.convert --hf-path /path/to/fp16-model --mlx-path /path/to/output-w8a16 -q --q-bits 8 --dtype float16
Run:
mano-cua run "Search for openai on Google and open the first result" --local --url "https://www.google.com"
mano-cua run "Search for iphone on Xiaohongshu and open the first post" --local --url "https://www.xiaohongshu.com" --minimize --max-steps 15
mano-cua run "Create a new note titled hello world" --local --app "Notes"
# Local mode (recommended for privacy — all inference on-device, no data leaves the machine)
mano-cua run "Search for openai on Google and open the first result" --local --url "https://www.google.com" --minimize
mano-cua run "Create a new note titled hello world" --local --app "Notes"
# Cloud mode
mano-cua run "Open Notes and create a new note titled Meeting Summary"
mano-cua run "Search for AI news in the browser and show the first result" --minimize --max-steps 20
# Cloud mode with --app or --url
mano-cua run "Create a calendar event for Friday 20:00 named Team Meeting" --app "Microsoft Outlook"
mano-cua run "Compare available plans for the AeroAPI" --url "https://www.flightaware.com/"
# Cloud mode — shell tool used for applicable steps to reduce time and improve accuracy (v1.1.0+)
mano-cua run "Create a file called report.txt on the Desktop with the content 'Q2 revenue summary', then mark it with a red tag in Finder"
# Stop the current task (use before starting a new one)
mano-cua stop
At each step, the current screen state is analyzed by a hybrid vision model to decide the next action. The agent performs bounded GUI actions (click, type, scroll, drag) only within the user-specified task scope, visible foreground target, and configured step/session limits. In cloud mode, when certain steps can be accomplished via shell, a shell tool will be invoked to perform the action rather than GUI operations to reduce steps and improve accuracy (requires v1.1.0+). For sensitive or irreversible actions, the agent pauses and prompts the user for explicit confirmation before proceeding.
Hybrid vision model:
The system automatically selects the appropriate model based on task complexity.
In local mode (--local), a local Mano-P model runs on-device via MLX. No network calls for inference. Local mode is only supported on macOS with Apple Silicon.
Structural capability boundaries (what the tool cannot do):
mano-cua config --set disable-bash true.A small UI panel is displayed on the top-right corner of the screen to track and manage the current session status.
--max-steps, preventing runaway execution.mano-cua stop.--local) runs inference entirely on-device with zero network calls — no data ever leaves the machine.--app or --url is specified, the agent's interaction is focused on that specific application or webpage.task_model.py) for easy auditing.~/.myapp_device_id) — no secrets are transmitted or stored remotely.macOS is the primary and most tested platform. Windows adaptation has been completed with full support for GUI automation in cloud mode. Local mode (on-device inference) is only available on macOS with Apple Silicon. Linux support is functional but less tested — minor issues are expected.