Stagehand Browser CLI
VerifiedAutomate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
$ Add to .claude/skills/ About This Skill
# Browser Automation
Automate browser interactions using Stagehand CLI with Claude.
First: Environment Selection (Local vs Remote)
- The skill automatically selects between local and remote browser environments:
- If Browserbase API keys exist (BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in .env file): Uses remote Browserbase environment
- If no Browserbase API keys: Falls back to local Chrome browser
- No user prompting: The selection happens automatically based on available configuration
Setup (First Time Only)
Check `setup.json` in this directory. If `setupComplete: false`:
```bash npm install # Install dependencies npm link # Create global 'browser' command ```
Commands
All commands work identically in both modes:
```bash browser navigate <url> # Go to URL browser act "<action>" # Natural language action browser extract "<instruction>" ['{}'] # Extract data (optional schema) browser observe "<query>" # Discover elements browser screenshot # Take screenshot browser close # Close browser ```
Quick Example
```bash browser navigate https://example.com browser act "click the Sign In button" browser extract "get the page title" browser close ```
Mode Comparison
| Feature | Local | Browserbase | |---------|-------|-------------| | Speed | Faster | Slightly slower | | Setup | Chrome required | API key required | | Stealth mode | No | Yes | | Proxy/CAPTCHA | No | Yes | | Best for | Development | Production/scraping |
Best Practices
- Always navigate first before interacting
- View screenshots after each command to verify
- Be specific in action descriptions
- Close browser when done
Troubleshooting
- Chrome not found: Install Chrome or use Browserbase mode
- Action fails: Use `browser observe` to discover available elements
- Browserbase fails: Verify API key and project ID are set
For detailed examples, see EXAMPLES.md. For API reference, see REFERENCE.md.
Use Cases
- Automate web browsing, form filling, and button clicking using natural language commands
- Extract structured data from websites without writing CSS selectors or XPath
- Take screenshots of web pages for visual documentation or testing
- Navigate multi-step web workflows like login, search, and checkout processes
- Discover interactive elements on a page using the observe command
Pros & Cons
Pros
- +Natural language interface eliminates the need to write browser automation code
- +Dual mode support — local Chrome for development, Browserbase for production scraping
- +Simple CLI commands make it easy to integrate into scripts and automation pipelines
Cons
- -Requires Node.js setup and npm link for global CLI access
- -Local mode lacks stealth features — may be blocked by anti-bot protections
FAQ
What does Stagehand Browser CLI do?
What platforms support Stagehand Browser CLI?
What are the use cases for Stagehand Browser CLI?
100+ free AI tools
Writing, PDF, image, and developer tools — all in your browser.