RedPincer — AI Red Team Suite

Verified

AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak,...

142 downloads

$ Add to .claude/skills/

$ openclaw install

About This Skill

# RedPincer — AI/LLM Red Team Suite

Automated security testing for language models. Point at any LLM API endpoint, select attack modules, and run assessments with real-time results and exportable reports.

> ⚠️ For authorized security testing and research only. Only test systems you own or have explicit permission to audit.

Quick Start

```bash # Clone and install git clone https://github.com/rustyorb/pincer.git {baseDir}/redpincer cd {baseDir}/redpincer npm ci

# Run npm run dev # Dashboard at http://localhost:3000 ```

For production: ```bash npm run build npx next start -H 0.0.0.0 -p 3000 ```

What It Tests

| Category | Payloads | Description | |:---------|:--------:|:------------| | 💉 Prompt Injection | 40 | Instruction override, delimiter confusion, indirect injection, payload smuggling | | 🔓 Jailbreak | 40 | Persona splitting, gradual escalation, hypothetical framing, roleplay exploitation | | 🔍 Data Extraction | 40 | System prompt theft, training data probing, membership inference, embedding extraction | | 🛡️ Guardrail Bypass | 40 | Output filter evasion, multi-language bypass, homoglyph tricks, context overflow |

Total: 160 base payloads × 20 variant transforms = 3,200 test permutations

Supported Providers

``` OpenAI · Anthropic · OpenRouter · Any OpenAI-compatible endpoint ```

Features

Attack Engine - 160+ payloads across 4 categories - Model-specific attacks (GPT, Claude, Llama variants) - 20 variant transforms (unicode, encoding, case rotation, etc.) - Attack chaining with template variables (`{{previous_response}}`) - AI-powered payload generation — uses the target LLM to generate novel attacks against itself - Stop/cancel running attacks instantly

Analysis & Reporting - Heuristic response classifier with context-aware analysis - Reduced false positives — detects "explain then refuse" patterns - Vulnerability heatmap — visual category × severity matrix - Custom scoring rubrics with weighted grades (A+ to F) - Verbose 10-section pen-test reports with appendices - Multi-target comparison — side-by-side security profiles - Regression testing — save baselines, track fixes over time

Advanced Tools

| Tool | What It Does | |:-----|:-------------| | Compare | Same payloads against 2-4 targets simultaneously | | Adaptive | Analyzes weaknesses, generates targeted follow-ups | | Heatmap | Visual matrix of vulnerability rates by category/severity | | Regression | Save baseline → re-run later → detect fixes or regressions | | Scoring | Custom rubrics with weighted category/severity/classification scores | | Chains | Multi-step attacks with `{{previous_response}}` templates | | Payload Editor | Create custom payloads with syntax highlighting + AI generation |

Usage Workflow

```
Configure Target → Add LLM endpoint + API key + model
Select Categories → Pick attack types to test
Run Attack → Stream results in real-time
Review Results → Heuristic classification + severity scores
Adaptive → Auto-generate follow-up attacks on weaknesses
Generate Report → Export comprehensive findings as Markdown
```

Architecture

All client-side — no server components, your API keys stay local
NDJSON streaming — real-time results during attack runs
Heuristic analysis — pattern-matching classifier (no LLM-based grading = no extra cost)
Zustand + localStorage — state persists across sessions

Companion Tool: RedClaw

For autonomous multi-strategy campaigns (CLI/TUI), see RedClaw — the autonomous red-teaming agent framework.

RedPincer = web dashboard, manual + automated testing
RedClaw = autonomous CLI agent, adaptive multi-strategy campaigns
Together = complete LLM security testing suite

---

*Built by @rustyorb — Crack open those guardrails. 🦞*

Use Cases

Analyze data and content to extract actionable insights
Generate structured output from specifications or requirements
Automate repetitive workflows to save time and reduce errors
Run automated tests to verify functionality and catch regressions
Search and retrieve information from various data sources

Pros & Cons

Pros

+Clean CLI interface integrates well with automation pipelines and AI agents
+API-based architecture allows flexible integration with various platforms
+Leverages AI models for intelligent automation beyond simple rule-based tools
+Configurable parameters allow tuning for different quality and cost tradeoffs

Cons

-Requires API key configuration — not free or self-contained
-Depends on external AI model APIs which may incur usage costs
-Output quality varies based on input specificity and model capabilities

FAQ

What does RedPincer — AI Red Team Suite do?

AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak,...

What platforms support RedPincer — AI Red Team Suite?

RedPincer — AI Red Team Suite is available on Claude Code, OpenClaw.

What are the use cases for RedPincer — AI Red Team Suite?

Analyze data and content to extract actionable insights. Generate structured output from specifications or requirements. Automate repetitive workflows to save time and reduce errors.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

AI Humanizer

Make AI text undetectable

AI Detector

Free, unlimited

PDF Tools

Merge, split, compress

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.

Open Free Tools Try AI Detector