Skip to content

Stagehand Browser CLI

Verified

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.

2,323 downloads
$ Add to .claude/skills/

About This Skill

# Browser Automation

Automate browser interactions using Stagehand CLI with Claude.

First: Environment Selection (Local vs Remote)

  • The skill automatically selects between local and remote browser environments:
  • If Browserbase API keys exist (BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in .env file): Uses remote Browserbase environment
  • If no Browserbase API keys: Falls back to local Chrome browser
  • No user prompting: The selection happens automatically based on available configuration

Setup (First Time Only)

Check `setup.json` in this directory. If `setupComplete: false`:

```bash npm install # Install dependencies npm link # Create global 'browser' command ```

Commands

All commands work identically in both modes:

```bash browser navigate <url> # Go to URL browser act "<action>" # Natural language action browser extract "<instruction>" ['{}'] # Extract data (optional schema) browser observe "<query>" # Discover elements browser screenshot # Take screenshot browser close # Close browser ```

Quick Example

```bash browser navigate https://example.com browser act "click the Sign In button" browser extract "get the page title" browser close ```

Mode Comparison

| Feature | Local | Browserbase | |---------|-------|-------------| | Speed | Faster | Slightly slower | | Setup | Chrome required | API key required | | Stealth mode | No | Yes | | Proxy/CAPTCHA | No | Yes | | Best for | Development | Production/scraping |

Best Practices

  1. Always navigate first before interacting
  2. View screenshots after each command to verify
  3. Be specific in action descriptions
  4. Close browser when done

Troubleshooting

  • Chrome not found: Install Chrome or use Browserbase mode
  • Action fails: Use `browser observe` to discover available elements
  • Browserbase fails: Verify API key and project ID are set

For detailed examples, see EXAMPLES.md. For API reference, see REFERENCE.md.

Use Cases

  • Automate web browsing, form filling, and button clicking using natural language commands
  • Extract structured data from websites without writing CSS selectors or XPath
  • Take screenshots of web pages for visual documentation or testing
  • Navigate multi-step web workflows like login, search, and checkout processes
  • Discover interactive elements on a page using the observe command

Pros & Cons

Pros

  • +Natural language interface eliminates the need to write browser automation code
  • +Dual mode support — local Chrome for development, Browserbase for production scraping
  • +Simple CLI commands make it easy to integrate into scripts and automation pipelines

Cons

  • -Requires Node.js setup and npm link for global CLI access
  • -Local mode lacks stealth features — may be blocked by anti-bot protections

FAQ

What does Stagehand Browser CLI do?
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
What platforms support Stagehand Browser CLI?
Stagehand Browser CLI is available on Claude Code, OpenClaw.
What are the use cases for Stagehand Browser CLI?
Automate web browsing, form filling, and button clicking using natural language commands. Extract structured data from websites without writing CSS selectors or XPath. Take screenshots of web pages for visual documentation or testing.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.