Metadata-Version: 2.4
Name: ptai
Version: 0.16.0
Summary: Autonomous AI pentesting with 200+ tools, exploit chaining, PoC validation, and credential-safe MCP server
Author: 0xSteph
License: MIT
Project-URL: Homepage, https://pentestai.xyz
Project-URL: Repository, https://github.com/0xSteph/pentest-ai
Project-URL: Documentation, https://pentestai.xyz
Project-URL: Issues, https://github.com/0xSteph/pentest-ai/issues
Keywords: pentest,pentesting,security,mcp,ai,cybersecurity,exploit,vulnerability
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Security
Classifier: Topic :: Software Development :: Testing
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: fastmcp>=2.0
Requires-Dist: httpx>=0.27
Requires-Dist: pydantic>=2.0
Requires-Dist: rich>=13.0
Requires-Dist: typer>=0.9
Requires-Dist: aiosqlite>=0.20
Requires-Dist: pyyaml>=6.0
Requires-Dist: python-dotenv>=1.0
Requires-Dist: jinja2>=3.1
Requires-Dist: cryptography>=42.0
Requires-Dist: dnspython>=2.6
Requires-Dist: scapy>=2.5
Requires-Dist: paramiko>=3.4
Requires-Dist: impacket>=0.11
Requires-Dist: bloodhound>=1.7
Requires-Dist: requests>=2.31
Requires-Dist: beautifulsoup4>=4.12
Requires-Dist: aiohttp>=3.9
Requires-Dist: tenacity>=8.2
Requires-Dist: structlog>=24.1
Provides-Extra: dev
Requires-Dist: pytest>=8.0; extra == "dev"
Requires-Dist: pytest-asyncio>=0.23; extra == "dev"
Requires-Dist: pytest-cov>=4.1; extra == "dev"
Requires-Dist: pytest-timeout>=2.3; extra == "dev"
Requires-Dist: ruff>=0.3; extra == "dev"
Requires-Dist: mypy>=1.8; extra == "dev"
Requires-Dist: pre-commit>=3.6; extra == "dev"
Requires-Dist: types-PyYAML>=6.0; extra == "dev"
Requires-Dist: lxml>=5.2; extra == "dev"
Provides-Extra: cloud
Requires-Dist: boto3>=1.34; extra == "cloud"
Requires-Dist: azure-identity>=1.15; extra == "cloud"
Requires-Dist: azure-mgmt-resource>=23.0; extra == "cloud"
Requires-Dist: google-cloud-storage>=2.14; extra == "cloud"
Provides-Extra: litellm
Requires-Dist: litellm>=1.50; extra == "litellm"
Provides-Extra: api
Requires-Dist: fastapi>=0.115; extra == "api"
Requires-Dist: uvicorn[standard]>=0.30; extra == "api"
Requires-Dist: websockets>=13.0; extra == "api"
Provides-Extra: menu
Requires-Dist: questionary>=2.0; extra == "menu"
Provides-Extra: browser
Requires-Dist: playwright>=1.45; extra == "browser"
Provides-Extra: stealth
Requires-Dist: scrapling[all]>=0.4.7; extra == "stealth"
Provides-Extra: tracing
Requires-Dist: opentelemetry-api>=1.27; extra == "tracing"
Requires-Dist: opentelemetry-sdk>=1.27; extra == "tracing"
Requires-Dist: opentelemetry-exporter-otlp-proto-grpc>=1.27; extra == "tracing"
Provides-Extra: all
Requires-Dist: ptai[api,browser,cloud,litellm,menu,stealth,tracing]; extra == "all"
Dynamic: license-file

<div align="center">

<img src="assets/transparentbanner.png" alt="pentest-ai" width="640">

<h1>pentest-ai</h1>

**Autonomous pentests from one command. Real tools, working PoCs, audit-ready reports.**

[![PyPI](https://img.shields.io/pypi/v/ptai?color=red&label=pypi&style=flat-square)](https://pypi.org/project/ptai/)
[![Python](https://img.shields.io/badge/python-3.10%2B-red?style=flat-square)](https://pypi.org/project/ptai/)
[![CI](https://img.shields.io/github/actions/workflow/status/0xSteph/pentest-ai/ci.yml?branch=main&label=CI&color=red&style=flat-square)](https://github.com/0xSteph/pentest-ai/actions/workflows/ci.yml)
[![License](https://img.shields.io/github/license/0xSteph/pentest-ai?color=red&style=flat-square)](LICENSE)
[![Stars](https://img.shields.io/github/stars/0xSteph/pentest-ai?color=red&style=flat-square)](https://github.com/0xSteph/pentest-ai/stargazers)
[![Discord](https://img.shields.io/badge/discord-join-red?style=flat-square&logo=discord&logoColor=white)](https://discord.gg/6weeTAubJw)

[**Website**](https://pentestai.xyz) · [**Install**](#install) · [**Docs**](docs/) · [**Benchmarks**](docs/benchmarks/juice-shop.md) · [**Agents**](https://github.com/0xSteph/pentest-ai-agents) · [**Discord**](https://discord.gg/6weeTAubJw)

</div>

> ⚠️ **Offensive tooling, authorized testing only.** By installing you accept the [AUP](https://pentestai.xyz/aup) and [Terms](https://pentestai.xyz/terms). Full text in [Responsible use ↓](#responsible-use)

Point ptai at a target. It runs recon, logs in, and ties findings into multi-step attack paths. Every finding comes with a working PoC. The report writes itself.

Runs on your laptop. No cloud, no telemetry.

> **What's new in 0.15.1 (2026-05-16).** Three new flags for pointing ptai at real production targets without breaking the engagement rules. `intensity=safe` skips probes that mutate server state (no mass-assignment, no stored XSS, no race conditions). `respect_rate_limits=true` honors HTTP 429 and `Retry-After` instead of hammering through them. `strict_scope=true` refuses any request whose host doesn't match the engagement target, and turns off redirect-following so a 302 to attacker.com can't pull the scan off-target. All three default off — existing 0.14.x behavior is unchanged unless you opt in. There's also a working `bearer` auth flow now: POST your creds, ptai pulls the JWT out of a JSON path you configure, then attaches it to every probe. JWT-style APIs (Juice Shop, crAPI, most modern bug-bounty stacks) actually work. Measured: **63.24% catch rate (43/68 in-scope challenges)** on OWASP Juice Shop v19.2.1 with `intensity=aggressive` + `strict_scope=true`. See [CHANGELOG](CHANGELOG.md#0151--2026-05-16).
>
> Also in 0.14.0 (2026-05-13): 200+ wrapped security tools (wpscan, dalfox, hydra, hashcat, paramspider, ffuf, gobuster, sqlmap, ...) are reachable from Claude Code, Cursor, Codex, and any MCP client via `list_tools`, `run_tool`, `plan_tools`, `ensure_tools_installed`. **No Anthropic API key required.** Pick a tool, install missing ones in one batched prompt, run, get findings.

## See it run

<p align="center">
  <img src="assets/demo.gif" alt="ptai-via-Claude-Code-MCP scanning OWASP Juice Shop: 17 critical findings, 7 attack chains, 264 detection rules generated" width="900">
</p>

One prompt to Claude Code. The MCP server ran ptai's tools against the target, and Claude streamed findings back into the session. Subscription-driven, no API key.

The scan returned **17 critical, 53 high, 107 total findings, 7 confirmed attack chains, and 264 generated detection rules** against a stock OWASP Juice Shop instance. JWT `alg:none` accepted on 8+ protected endpoints, SQLi auth bypass on `/rest/user/login`, UNION-based SQLi on `/rest/products/search`, path-filter bypass via NUL byte, XXE disclosing `/etc/passwd`, file upload polyglot, mass assignment, password reset bypass. Each one has a working PoC.

> Recording is the actual output of `claude -p` against a local OWASP Juice Shop with `pentest-ai` registered as an MCP server. Cast file in [`assets/realdemo.cast`](assets/realdemo.cast); the time-paced re-render used for the GIF is in [`assets/realdemo-paced.cast`](assets/realdemo-paced.cast). Findings are real; inter-line timing was reconstructed for watchability since `claude -p` buffers and dumps in non-interactive mode. A deterministic synthesized fallback ([`assets/demo.tape`](assets/demo.tape) + [`assets/demo.sh`](assets/demo.sh)) is kept for reproducible re-renders.

> **Honesty caveat:** Juice Shop is the most-written-about deliberately-vulnerable app on the internet, so LLMs and probe authors both have a head start. Against a *novel* target the catch rate is whatever the curated probe library actually covers: 60 web probes today, growing each release. The LLM coordinates and reasons about results; it doesn't replace the probes. A private honeypot harness in `tests/honeypot/` measures coverage against bugs we wrote ourselves and is asserted in CI at 10/10 caught (`tests/honeypot/test_mcp_honeypot_e2e.py`). Numbers there are lower than Juice Shop, and that's the point. We publish both. See the full [Juice Shop benchmark vs ZAP / Nuclei / HexStrike](docs/benchmarks/juice-shop.md).

## Install

```bash
pip install ptai
```

### Path 1: Drive it from Claude Code (no API key)

If you already pay for Claude Pro / Max / Team, your subscription IS the LLM. Wire ptai in as an MCP server:

```bash
claude mcp add pentest-ai -- ptai mcp
```

Restart Claude Code, then ask:

> *"Run an authenticated pentest against staging.acme.com. Login is at /login, password is in $APP_PASS."*

> **What hits the network**: ptai's tools and probes execute locally against your target. Your prompts and the tool output that Claude Code reads go through Anthropic's API, same as any Claude Code session. If you need an air-gapped path, see Path 3 (Ollama / on-prem LLM).

Claude Code drives ptai via these MCP tools (49 of them as of 0.14.0):

- `list_tools` / `run_tool`: list and invoke any of 200+ wrapped security tools
- `plan_tools` / `ensure_tools_installed`: get the canonical tool list for an engagement, batched install
- `list_probes` / `run_probe`: 60 SPA-aware probes for OWASP Top 10 bug classes
- `http_request`: raw HTTP under a hard scope guard for novel chains
- `start_engagement` / `get_findings` / `get_attack_chains`: the engagement record
- plus `test_web_app`, `test_active_directory`, `test_cloud`, `test_api_security`, and the rest

### Path 2: Other MCP clients (Cursor, VS Code Copilot, Codex, Claude Desktop)

```bash
ptai setup --mcp
```

Auto-detects every MCP-compatible client you have installed and writes their config files. Restart the client and the same 49 tools are there.

### Path 3: Standalone CLI when you DON'T have an MCP client

If you're using Claude Code, Cursor, Codex, or Claude Desktop, use Path 1 or 2 above and skip this section. **No API key needed there.**

Path 3 is for CI/CD pipelines, scheduled cron jobs, air-gapped terminals, and users without an MCP client. The standalone CLI has no LLM of its own, so you bring one via env var:

```bash
export ANTHROPIC_API_KEY=sk-ant-...   # Claude (best results)
# or
export OPENAI_API_KEY=sk-...          # OpenAI
# or, fully local, no cloud
export OLLAMA_HOST=localhost:11434    # Ollama
# or, any of 300+ models via LiteLLM (OpenRouter, Azure, DeepSeek, Groq, Mistral, ...)
pip install ptai[litellm]

ptai start https://your-target.com
```

#### Spending cap (Path 3 only)

The standalone agent loop drives its own LLM, so runaway loops cost real money. ptai caps spend per engagement at **$10 USD by default**. A normal Sonnet 4.6 web-app sweep with prompt caching finishes well under that; an Opus 4.7 deep run can blow past it.

Change it via env var (no CLI flag — env var is the only knob):

```bash
export PTAI_PRICE_LIMIT=25        # raise to $25
export PTAI_PRICE_LIMIT=0         # unlimited (logs a warning)
unset PTAI_PRICE_LIMIT            # back to the $10 default
```

If the cap fires mid-engagement, the engagement is marked `aborted_cost_limit` and its checkpoint is preserved. Raise the cap and resume from where it stopped:

```bash
export PTAI_PRICE_LIMIT=25
ptai resume <engagement_id>
```

Paths 1 and 2 (MCP) don't use this cap — your AI client (Claude Code, Cursor, etc.) handles its own LLM billing.

### Installing security tools

ptai wraps 200+ external tools. Three ways to get them on the box:

```bash
# 1. Zero-config (recommended). At engagement start, the planner predicts
#    which tools the LLM will need and asks ONCE to install the missing
#    ones. Decline once and the answer persists in
#    ~/.pentest-ai/install-preferences.json.
ptai start https://target.example.com

# 2. Batch install upfront. Skips the engagement-time prompt entirely.
ptai setup --tier core            # ~6 essentials, ~30s
ptai setup --tier recommended     # + fuzzers, crawlers, password tools, ~5m
ptai setup --tier full            # everything, ~30m

# 3. Install specific tools by name.
ptai setup --per-tool wpscan,dalfox,paramspider
ptai setup --wizard               # interactive picker
```

In non-interactive contexts (`PTAI_NON_INTERACTIVE=1` or no TTY) ptai uses what's on PATH and logs (rather than prompts) for anything missing.

<details>
<summary><strong>Other paths</strong>: REST API, MCP composition, HITL teleoperation, cloud workspace, public benchmarks</summary>

### HTTP REST API (for dashboards and integrations)

```bash
pip install ptai[api]
ptai serve --port 8888
```

Endpoints: `/health`, `/version`, `/agents`, `/tools`, `/engagements` (list, detail, findings, chains, detection rules, SARIF export). Write endpoints (`POST /engagements`, `POST /engagements/{id}/abort`) require `Authorization: Bearer $PENTEST_AI_API_TOKEN`. Live event stream at `WS /engagements/{id}/stream`.

### Load other MCP servers as tool sources

Compose with hexstrike or any other MCP-compatible security server. Edit `~/.pentest-ai/mcp_servers.json`:

```json
{
  "servers": [
    {"name": "hexstrike", "command": "python3 hexstrike_mcp.py", "transport": "stdio"}
  ]
}
```

### Take over mid-run (HITL teleoperation)

While an engagement is running, press `Ctrl+C` twice within 600ms to pause the orchestrator and drop into a REPL: `step`, `inspect findings`, `inject <instruction>`, `skip`, `resume`, `abort`. Current LLMs aren't fully autonomous. The operator owns the call when it matters.

### Public benchmarks

Reproducible solve-rate measurements live in [`benchmarks/`](benchmarks/):

```bash
./benchmarks/scripts/run_all.sh   # writes JSON per run + RESULTS.md
```

Spec, harness, results all in git. The full Juice Shop comparison vs ZAP / Nuclei / HexStrike is at [`docs/benchmarks/juice-shop.md`](docs/benchmarks/juice-shop.md). No "98.7% detection rate" claims you can't audit.

### Cloud workspace (Pro / Team / Enterprise)

The CLI is free forever and stores everything locally. If you want engagement history, branded client-ready PDF reports, and team collaboration, link the CLI to an [app.pentestai.xyz](https://app.pentestai.xyz) workspace:

```bash
# Sign up, then Dashboard → API Keys → Generate → copy ptai_...
ptai auth login        # paste the key (hidden prompt)
ptai auth status       # confirm link
# or use an env var for CI:
export PENTESTAI_API_KEY=ptai_...
```

`ptai start` auto-syncs findings to your cloud workspace when authed. No cloud = no calls; integration is silently off unless you log in.

### No LLM at all (interactive launcher)

```bash
ptai menu
```

Numeric category navigation, search (`/term`), tag filtering (`t web`), keyword-based recommendation. Real engagements still go through `ptai start` with full scope confirmation.

</details>

## Why it's different

|  |  |
|---|---|
| 🤖 **LLM-coordinated, not LLM-dependent** | Seventeen agents cover recon, web, API, AD, cloud, mobile, wireless, browser, credentials, privesc, vuln scan, chaining, PoC, detection, report, social engineering, and LLM red team. The LLM runs the phase loop and reasons about results; bug detection is in the curated deterministic probe library. Set no API key and the same probes still run. The LLM coordinates; it doesn't scan. |
| 🔓 **No API key on the MCP path** | Claude Code / Cursor / Codex users drive ptai through MCP using their existing subscription. 200+ tool wrappers and 60 probes are LLM-callable without an Anthropic key. The standalone CLI (`ptai start --agent-mode`) is where the API key matters; that's the Codex-without-MCP, CI, and air-gapped paths. |
| 🔐 **It logs in** | Most scanners die at the login page. This one holds a session, refreshes credentials when they expire, and every downstream tool inherits the cookie. Auth profiles store *references* (env vars, `op://`, Vault paths, AWS Secrets Manager ARNs), never the value. |
| 🧪 **Every finding is proven** | A non-destructive proof of concept runs against the target. No more triaging 40 maybes from a noisy scanner. |
| ⚡ **CI-native** | GitHub Action, severity gates, SARIF output, PR comments. Drop it into your workflow file and it runs on the next PR. |
| 💾 **Runs on your laptop** | MIT licensed, no cloud calls. Runs offline with Ollama. Findings stay on your disk. |

## How it works

```
┌─────────────────────────────────────────────────────────────┐
│                    ptai start <target>                      │
└─────────────────────────────────────────────────────────────┘
                             │
          ┌──────────────────┼──────────────────┐
          ▼                  ▼                  ▼
      ┌────────┐        ┌────────┐        ┌─────────┐
      │ recon  │   →    │  auth  │   →    │   web   │
      └────────┘        └────────┘        └─────────┘
                                               │
          ┌────────────────────────────────────┤
          ▼                                    ▼
      ┌────────┐                          ┌─────────┐
      │   ad   │   ┌──────────────────┐   │ cloud   │
      └────────┘   │  Findings DB     │   └─────────┘
          │        │  (sqlite + evidence)│       │
          └───────▶│  scope-guarded     │◀──────┘
                   │  deduplicated      │
                   └──────────────────┘
                             │
                ┌────────────┼────────────┐
                ▼            ▼            ▼
           ┌──────┐    ┌─────────┐  ┌──────────┐
           │chain │    │validate │  │ detect   │
           └──────┘    └─────────┘  └──────────┘
                             │
                             ▼
                       ┌──────────┐
                       │  report  │   md · html · pdf · SARIF · JUnit
                       └──────────┘
```

Each agent runs with an LLM when you've set a key, or as a deterministic tool loop when you haven't. Either way the phase order is the same.

## Agents

| Agent | Phase | Does |
|---|---|---|
| `recon` | 1 | Port scan, DNS and subdomain enum, service fingerprinting |
| `web` | 2 | Authenticated OWASP Testing Guide v4 pass |
| `api_security` | 2 | OpenAPI/GraphQL/REST surface analysis, OWASP API Top 10 |
| `browser` | 2 | Playwright-driven DOM analysis, XHR capture, security-header grading |
| `ad` | 3 | AD enum, Kerberoasting, BloodHound pathfinding, delegation abuse |
| `cloud` | 4 | AWS, Azure, GCP IAM, misconfig, K8s RBAC, serverless |
| `credential_tester` | 4 | Password spraying, credential stuffing, MFA bypass checks |
| `privesc` | 5 | Local and lateral privilege-escalation advice from collected context |
| `vuln_scanner` | 5 | Cross-cutting vuln aggregation against the findings DB |
| `exploit_chain` | 6 | Correlates findings into multi-step attack paths |
| `poc_validator` | 7 | Non-destructive proof of concept per finding |
| `detection` | 8 | Sigma, SPL, KQL rules for the blue team |
| `report` | 9 | Markdown, HTML, PDF, SARIF, JUnit, compliance maps |
| `llm_redteam` | opt | OWASP LLM Top 10 probes |
| `social_engineer` | opt | Phishing corpus and pretext generation |
| `mobile` | opt | Android/iOS static + dynamic checks |
| `wireless` | opt | Wireless reconnaissance and handshake capture |

## Playbooks

Your methodology as a file. Checked into git. Shared with your team.

```yaml
name: internal-ad-pentest
inputs:
  domain: { required: true, prompt: "AD domain" }
  dc_ip:  { required: true, prompt: "DC IP" }

phases:
  - id: recon
    tools: [nmap, masscan]

  - id: ad-enum
    depends_on: [recon]
    condition: "any_finding(type='open_port', port=445)"
    tools: [enum4linux, ldapsearch, bloodhound-python]

  - id: kerberoast
    requires_finding: { type: ad_user_enumerated }
    tools: [impacket-getuserspns]
    llm_decide: true         # let the LLM skip if context says useless
```

```bash
ptai playbook list                  # show installed playbooks
ptai playbook show web-app-quick    # preview before running
ptai playbook run ./my-ad.yaml      # execute
```

Five playbooks ship built-in. A community catalog is coming.

## Drop it into your CI

```yaml
# .github/workflows/security.yml
name: Security scan
on: [pull_request]

jobs:
  ptai:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - run: pip install ptai
      - run: |
          ptai start ${{ vars.STAGING_URL }} \
            --ci \
            --fail-on high \
            --sarif pentest.sarif
        env:
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
      - uses: github/codeql-action/upload-sarif@v3
        if: always()
        with:
          sarif_file: pentest.sarif
```

Findings post as a PR comment, SARIF uploads to GitHub Code Scanning, and the build fails on gated severity. **GitLab CI and Jenkins** templates plus advanced options (auth profiles in CI, cost gates, scope files) → [docs/ci-cd.md](docs/ci-cd.md).

## Benchmarks

ptai's design is purpose-built for SPA pentesting with curated probe coverage. On OWASP Juice Shop, the published [4-tool matrix](docs/benchmarks/juice-shop.md) showed:

| Tool | Findings | Critical+High | OWASP Top 10 buckets | FP rate |
|---|---:|---:|---:|---:|
| **ptai 0.13.0** | **88** | **46** | **5** | **0%** |
| ZAP 2.17.0 | 593 | 0 | 1 | 47% |
| Nuclei 3.8.0 | 1 | 0 | 1 | 0% |
| HexStrike v6.0 | 11 | 0 | 1 | — |

n=1 single-rater, single-shot. Methodology + raw artifacts in [`benchmarks/results/2026-05-12/juice-shop/`](benchmarks/results/2026-05-12/juice-shop/). The honest read: ptai is better at SPA web pentests with curated probe coverage. HexStrike is broader (cloud, binary, CTF) and likely beats ptai on traditional crawlable surfaces like WordPress. v5 will widen the comparison.

Recent research context: fully autonomous LLM-pentest agents finish **21–31%** of tasks end-to-end; human-assisted setups reach **64%** (ARTEMIS, DARPA AICC Atlantis, xOffense). ptai is built for the human-assisted regime: the LLM reasons about results, the curated probes detect, and Ctrl+C twice lets the operator take over.

## vs the field

|  | `ptai` | Hexstrike | ZAP | Nuclei | Burp Pro | PentestGPT |
|---|:-:|:-:|:-:|:-:|:-:|:-:|
| LLM-driven via MCP (no API key) | ✓ | ✓ | | | | |
| LLM-synthesized HTTP under scope guard | ✓ | partial | | | | |
| Authenticated scanning via MCP | ✓ | partial | partial | raw HTTP | ✓ | |
| Exploit chaining | ✓ | partial | | | | partial |
| Non-destructive PoC validation | ✓ | | | | partial | |
| Stored injection chains (POST → GET verify) | ✓ | manual | partial | | manual | |
| Curated probes (specialised, not template-driven) | 60 | tool-wrapper-driven | rule-driven | 8000+ templates | manual + scan | — |
| Wrapped CLI security tools | 200+ | 150+ | — | — | — | — |
| Tool install wizard | core/recommended/full + per-tool | — | n/a | n/a | n/a | — |
| Smart install at engagement start | ✓ | | | | | |
| CI-native (SARIF + severity gates) | ✓ | | partial | partial | partial | |
| LLM red team probes | ✓ | | | | | |
| YAML playbooks | ✓ | | | templates | | |
| License | MIT | MIT | Apache-2.0 | MIT | commercial | MIT |

## What's inside

- **17 agents** across recon, web, API security, AD, cloud, mobile, wireless, browser, credential testing, privilege escalation, vuln scanning, exploit chaining, PoC validation, detection, reporting, LLM red team, social engineering
- **60 curated web probes** covering OWASP Top 10 + API Top 10
- **200+ tool wrappers** with auto-install: nmap, masscan, nuclei, ffuf, sqlmap, gobuster, wapiti, nikto, dalfox, xsstrike, wpscan, hydra, hashcat, enum4linux, bloodhound-python, the impacket suite, trufflehog, gitleaks, kube-hunter, trivy, prowler, scout-suite, and more
- **4000+ Nuclei templates** integrated for atomic vulnerability detection
- **49 MCP tools** for LLM-driven engagements, including the 0.14.0 additions (`plan_tools`, `ensure_tools_installed`) that let the outer LLM batch-install tools without an Anthropic API key
- **300+ LLM models** via the LiteLLM provider (Anthropic, OpenAI, Ollama direct; Azure, OpenRouter, DeepSeek, Groq, Mistral, Together AI, Bedrock, Vertex AI, Cohere via LiteLLM)
- **HTTP REST API + WebSocket** surface (`ptai serve`) for non-MCP integrations
- **Local web dashboard** with live engagement view, findings table, attack chain visualization, SARIF export
- **Browser automation agent** with screenshot capture, DOM analysis, network capture, security header grading (Playwright-driven)
- **Human-In-The-Loop teleoperation** (Ctrl+C twice to take over an engagement mid-run)
- **MCP client** capability to load external MCP servers as tool sources
- **Public reproducible benchmark harness** in `benchmarks/`. Numbers, code, raw artifacts, all in git.
- **6 output formats**: Markdown, HTML, PDF, SARIF 2.1.0, JUnit XML, compliance mappings (OWASP, CWE, CVE, CVSS v3.1)
- **1,000+ tests** with CI on Python 3.10, 3.11, 3.12, 3.13
- **MIT licensed**, 100% yours

## Who uses it for what

**AppSec teams.** Wire `ptai` into your CI. Every PR against staging gets an authenticated scan. The build fails on high-severity findings. The fix → retest → confirm loop runs on its own.

**Consultants.** Set up a week-long engagement, point `ptai` at the target list, and spend your time on the parts that need a human: analyzing findings, picking chains to demonstrate, talking to the client. The report writes itself.

**Bug bounty hunters.** Run it over breakfast. Come back to a list of validated findings with PoCs ready to paste into HackerOne.

**Red teamers.** Encode your AD methodology as a YAML playbook. Every new engagement runs it. Same methodology, shared across the team.

**Claude Code / Cursor / Codex users.** Add ptai as an MCP server. Ask your assistant to run a scan in plain English. Your existing subscription pays for the LLM; ptai supplies the tools.

**Developers shipping AI features.** Enable `--enable-llm-redteam` against your chatbot. Get an OWASP LLM Top 10 report in minutes.

## Responsible use

`pentest-ai` is offensive security tooling. It executes real network and host operations against the targets you specify. **You are solely responsible for ensuring you have explicit, written authorization to test every target.**

By installing or running `ptai` you agree to the [Acceptable Use Policy](https://pentestai.xyz/aup) and the [Terms of Service](https://pentestai.xyz/terms). Testing systems you do not own without written authorization may violate the Computer Fraud and Abuse Act, the Computer Misuse Act 1990, GDPR Article 32, and equivalents in your jurisdiction. Misuse is your sole responsibility.

First-run prompts you to confirm AUP acceptance and persists the choice to `~/.pentest-ai/aup-consent.txt`. Set `PENTEST_AI_AUP_ACCEPTED=1` in CI to bypass the prompt non-interactively.

On startup `ptai` loads a scope file. Out-of-scope hosts are refused at tool-invocation time. PoCs are non-destructive by default. Rate limits kick in automatically in stealth mode. Don't be that person.

### Out-of-band callbacks (OAST) — privacy

`ptai` detects blind vulnerability classes (blind SSRF, blind SQLi, blind XXE, blind stored XSS, SSTI, Log4Shell) by emitting payloads that, when fired server-side, ring an out-of-band collaborator. By default, callbacks route to ProjectDiscovery's public `oast.fun` infrastructure.

**What lands on the collaborator and who can read it.** Each engagement generates a fresh RSA-2048 keypair in your local `ptai` process. Interaction payloads (raw HTTP requests, DNS queries, SMTP envelopes received by the collaborator) are AES-CTR-256 encrypted at rest server-side, with the AES key wrapped in RSA-OAEP-SHA256 using your engagement's public key. **Only the holder of the matching private key — your local `ptai` process — can decrypt them.** ProjectDiscovery (or whoever runs the collaborator) cannot read interaction contents. However, **metadata is server-visible**: the fact that an interaction happened, source IP of the calling target, timestamp, and protocol.

**When to self-host.** PortSwigger explicitly forbids public-Burp-Collaborator use in their bug bounty rules of engagement, and large enterprise programs (Meta, Apple, finance) increasingly require that callback infrastructure terminate on tester-controlled hosts. For paid engagements, run your own Interactsh server (Apache-2.0, single Go binary) and point ptai at it:

```bash
ptai start http://target --oast-server https://oast.example.com --oast-token <T>
```

**To disable OAST entirely:**

```bash
ptai start http://target --no-oast
```

Blind-vuln classes will not be detected when OAST is off; in-band detection paths (size delta / SQL error markers / metadata signatures / time-based) still run.

## Ecosystem

| Repo | What |
|---|---|
| [**pentest-ai**](https://github.com/0xSteph/pentest-ai) | This repo. The CLI and MCP server. Python product. |
| [**pentest-ai-agents**](https://github.com/0xSteph/pentest-ai-agents) | Standalone Claude Code subagent markdown files. Optional, runs without this CLI. |

Need shared workspaces, branded PDF reports, SSO, or a managed engagement? The [website](https://pentestai.xyz) has Pro / Team / Enterprise dashboards and a one-shot Launch Engagement option. The OSS tool stays OSS, free forever.

## Community

- **Discord:** [join the server](https://discord.gg/6weeTAubJw). Chat, get help, share findings, lurk.
- **Questions, ideas, feedback:** [GitHub Discussions](https://github.com/0xSteph/pentest-ai/discussions)
- **Bug reports:** [GitHub Issues](https://github.com/0xSteph/pentest-ai/issues)
- **Show and tell:** post the wildest finding `ptai` gave you in [Show and tell](https://github.com/0xSteph/pentest-ai/discussions/categories/show-and-tell)

## Contributing

PRs welcome. Before you submit:

```bash
ruff check . && mypy . && pytest -q
```

See [CONTRIBUTING.md](CONTRIBUTING.md) for the full flow.

## Contributors

<a href="https://github.com/0xSteph/pentest-ai/graphs/contributors">
  <img src="https://contrib.rocks/image?repo=0xSteph/pentest-ai" alt="Contributors" />
</a>

## Star history

<a href="https://star-history.com/#0xSteph/pentest-ai&Date">
  <img src="https://api.star-history.com/svg?repos=0xSteph/pentest-ai&type=Date" alt="Star history chart" width="600">
</a>

## License

MIT. Do whatever you want with it.

<div align="center">

**If `ptai` saved you a Sunday, [star the repo](https://github.com/0xSteph/pentest-ai). It's the only payment I ask for.**

</div>
