Windows UI Automation
VerifiedAutomate Windows GUI interactions (mouse, keyboard, windows) using PowerShell. Use when the user needs to simulate user input on the desktop, such as moving the cursor, clicking buttons, typing text in non-web apps, or managing window states.
$ Add to .claude/skills/ About This Skill
# Windows UI Automation
Control the Windows desktop environment programmatically.
Core Capabilities
- Mouse: Move, click (left/right/double), drag.
- Keyboard: Send text, press special keys (Enter, Tab, Alt, etc.).
- Windows: Find, focus, minimize/maximize, and screenshot windows.
Usage Guide
Mouse Control
Use the provided PowerShell script `mouse_control.ps1.txt`:
```powershell # Move to X, Y powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action move -X 500 -Y 500
# Click at current position powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action click
# Right click powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action rightclick ```
Keyboard Control
Use `keyboard_control.ps1.txt`:
```powershell # Type text powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Text "Hello World"
# Press Enter powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Key "{ENTER}" ```
Window Management
To focus a window by title: ```powershell $wshell = New-Object -ComObject WScript.Shell; $wshell.AppActivate("Notepad") ```
Best Practices
- Safety: Always move the mouse slowly or include delays between actions.
- Verification: Take a screenshot before and after complex UI actions to verify state.
- Coordinates: Remember that coordinates (0,0) are at the top-left of the primary monitor.
Use Cases
- Automate Windows GUI interactions including mouse clicks and keyboard input
- Control Windows applications programmatically for testing and workflow automation
- Simulate user interactions across Windows desktop applications
- Build automated testing workflows for Windows-native software
- Create PowerShell-based automation scripts for repetitive GUI tasks
Pros & Cons
Pros
- +PowerShell-based — no additional software installation required on Windows
- +Covers mouse, keyboard, and window management automation
- +Useful for automating legacy Windows applications without APIs
Cons
- -Windows only — no cross-platform support
- -GUI automation is inherently fragile — breaks when UI layouts change
FAQ
What does Windows UI Automation do?
What platforms support Windows UI Automation?
What are the use cases for Windows UI Automation?
100+ free AI tools
Writing, PDF, image, and developer tools — all in your browser.
Next Step
Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.