macOS Desktop Control Enhanced

Prompts

macOS Desktop Control Enhanced provides system-wide desktop automation on macOS, including screenshot capture, process management, clipboard operations, system information, application control (open/close), and direct mouse, keyboard, and screen interaction. Use this skill when you need to programmatically control or query the macOS desktop environment.

Install

openclaw skills install macos-desktop-control-enhanced

When to Use This Skill

  • You need to capture the screen or a region programmatically.
  • You need to query or control running processes.
  • You need to read or modify the clipboard content.
  • You need to retrieve system information (e.g., battery level, OS version).
  • You need to open, focus, or close applications by bundle ID.
  • You need to move the mouse, click, drag, or simulate keyboard input.
  • You need to integrate macOS desktop automation into larger workflows (e.g., testing, data collection, accessibility).

Overview

macOS Desktop Control Enhanced offers the following core capabilities:

1. Screenshot

  • screenshot([options]) – Capture the entire screen or a specified region; returns image path.

2. Process Management

  • get_front_process() – Returns information about the currently frontmost process.
  • kill_process(pid) – Terminates the process with the given PID.
  • launch_app(bundle_id) – Launches an application identified by its bundle ID.

3. Clipboard

  • get_clipboard() – Retrieves current clipboard text.
  • set_clipboard(text) – Overwrites the clipboard with the given text.

4. System Information

  • get_system_info() – Returns a dictionary with macOS version, battery level, and other relevant system metrics.

5. Application Control

  • focus_app(bundle_id) – Brings the specified application to the foreground.
  • terminate_app(bundle_id) – Force‑closes the application with the given bundle ID.

6. Mouse Control

  • move_mouse(x, y) – Moves the cursor to the given screen coordinates.
  • click(x, y, button='left') – Performs a mouse click at the given coordinates.
  • drag(x1, y1, x2, y2, button='left') – Drags from (x1, y1) to (x2, y2).

7. Keyboard Control

  • type_text(text) – Types the given string using the system keyboard.
  • press_key(key) – Sends a single keyboard key event.

Structure

This skill follows a Capabilities‑Based structure, grouping functionality by core features. Each capability section lists the available functions, their purpose, and brief usage notes.

References

  • references/api_reference.md – Complete function signatures, parameter details, and return values.
  • scripts/example_mouse.py – Sample Python script demonstrating mouse movement and clicking.
  • assets/example_asset.png – Example asset showing a captured screenshot.

License

MIT‑0 (Free to use, modify, and redistribute. No attribution required.)