Skip to content

Windows UI Automation

Verified

Automate Windows GUI interactions (mouse, keyboard, windows) using PowerShell. Use when the user needs to simulate user input on the desktop, such as moving the cursor, clicking buttons, typing text in non-web apps, or managing window states.

3,730 downloads
$ Add to .claude/skills/

About This Skill

# Windows UI Automation

Control the Windows desktop environment programmatically.

Core Capabilities

  • Mouse: Move, click (left/right/double), drag.
  • Keyboard: Send text, press special keys (Enter, Tab, Alt, etc.).
  • Windows: Find, focus, minimize/maximize, and screenshot windows.

Usage Guide

Mouse Control

Use the provided PowerShell script `mouse_control.ps1.txt`:

```powershell # Move to X, Y powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action move -X 500 -Y 500

# Click at current position powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action click

# Right click powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action rightclick ```

Keyboard Control

Use `keyboard_control.ps1.txt`:

```powershell # Type text powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Text "Hello World"

# Press Enter powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Key "{ENTER}" ```

Window Management

To focus a window by title: ```powershell $wshell = New-Object -ComObject WScript.Shell; $wshell.AppActivate("Notepad") ```

Best Practices

  1. Safety: Always move the mouse slowly or include delays between actions.
  2. Verification: Take a screenshot before and after complex UI actions to verify state.
  3. Coordinates: Remember that coordinates (0,0) are at the top-left of the primary monitor.

Use Cases

  • Automate Windows GUI interactions including mouse clicks and keyboard input
  • Control Windows applications programmatically for testing and workflow automation
  • Simulate user interactions across Windows desktop applications
  • Build automated testing workflows for Windows-native software
  • Create PowerShell-based automation scripts for repetitive GUI tasks

Pros & Cons

Pros

  • +PowerShell-based — no additional software installation required on Windows
  • +Covers mouse, keyboard, and window management automation
  • +Useful for automating legacy Windows applications without APIs

Cons

  • -Windows only — no cross-platform support
  • -GUI automation is inherently fragile — breaks when UI layouts change

FAQ

What does Windows UI Automation do?
Automate Windows GUI interactions (mouse, keyboard, windows) using PowerShell. Use when the user needs to simulate user input on the desktop, such as moving the cursor, clicking buttons, typing text in non-web apps, or managing window states.
What platforms support Windows UI Automation?
Windows UI Automation is available on Claude Code, OpenClaw.
What are the use cases for Windows UI Automation?
Automate Windows GUI interactions including mouse clicks and keyboard input. Control Windows applications programmatically for testing and workflow automation. Simulate user interactions across Windows desktop applications.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.