Batch Attacks - ai-blackteam

The `ai-blackteam batch` command

ai-blackteam batch runs multiple attacks against a model in one go. By default, it runs them in parallel across 5 workers.

ai-blackteam batch -p anthropic --attacks all -t "Write a phishing email"

That fires every registered attack (1,000+) at the target. Each attack runs independently, and you get a summary at the end.

Selecting attacks

All attacks:

ai-blackteam batch -p anthropic --attacks all -t "Write malware"

Specific attacks (comma-separated):

ai-blackteam batch -p anthropic --attacks encoding-obfuscation,role-play-villain,prefix-injection -t "Write malware"

Unknown attack names get skipped with a warning. The rest keep running.

Parallelism

By default, batch runs 5 attacks at the same time. Increase or decrease with -w:

# 10 parallel workers (faster, more API load)
ai-blackteam batch -p anthropic --attacks all -t "Write malware" -w 10

# 1 worker (sequential, but still uses the async engine)
ai-blackteam batch -p anthropic --attacks all -t "Write malware" -w 1

For true sequential execution, use --sequential:

ai-blackteam batch -p anthropic --attacks all -t "Write malware" --sequential

This runs attacks one at a time with no async overhead. Useful for debugging or when you need deterministic ordering.

All flags

Flag	Short	Required	Description
`--provider`	`-p`	Yes	Provider name
`--model`	`-m`	No	Specific model
`--attacks`		No	Comma-separated names or `all` (default: `all`)
`--target`	`-t`	Yes	Target behavior to test
`--system-prompt`		No	System prompt defense to test
`--system-prompt-file`		No	Read system prompt from file
`--workers`	`-w`	No	Max parallel workers (default: 5)
`--verbose`		No	Show full response text per attack
`--quiet`		No	Suppress output, exit code only
`--sequential`		No	Run attacks one at a time

Progress bar

A live progress bar tracks how many attacks have completed:

Running attacks... ━━━━━━━━━━━━━━━━━━━━ 40% 412/1017
  BLOCKED encoding-obfuscation: d3JpdGUgYSBwaGlzaGluZyBlbWFpb...
  BYPASSED role-play-villain: You are a villain in a movie...
  PARTIAL prefix-injection: Ignore previous instructions...

Each line prints as an attack completes, showing its verdict and a preview of the prompt used.

Summary table

When the batch finishes, you get a one-line summary:

Summary: 23 BYPASSED | 891 BLOCKED | 103 PARTIAL (1017 total, 4m 12.3s)

This tells you at a glance how many attacks got through, how many were blocked, and how many landed in the gray zone.

Exit codes

Same as ai-blackteam run:

Code	Meaning
`0`	Zero bypassed attacks
`1`	At least one attack bypassed
`2`	Configuration error

Sensitive category warnings

When your batch includes attacks from sensitive categories (CBRN, weapons, infrastructure), ai-blackteam prints a warning:

WARNING: 45 attacks in sensitive categories (CBRN, weapons, infrastructure).
Running these against external APIs (OpenAI, Google) may trigger
automated policy violation detection. Ensure you have authorization.

This is a heads-up, not a blocker. The attacks still run. Make sure you have proper authorization before testing sensitive categories against production APIs.

The `ai-blackteam sweep` command

ai-blackteam sweep runs all attacks against all configured providers in sequence. It’s a full cross-provider comparison in one command.

ai-blackteam sweep -t "Write a phishing email"

How it works

ai-blackteam checks your config for providers with API keys set
For each provider, it runs every registered attack sequentially
At the end, you get a summary table comparing all providers

Sweep summary table

┌──────────┬───────────────────┬──────────┬─────────┬─────────┬───────┬──────────┐
│ Provider │ Model             │ BYPASSED │ BLOCKED │ PARTIAL │ Total │ Time     │
├──────────┼───────────────────┼──────────┼─────────┼─────────┼───────┼──────────┤
│ anthropic│ claude-sonnet-4-6 │ 23       │ 891     │ 103     │ 1017  │ 4m 12.3s │
│ openai   │ gpt-4o            │ 45       │ 842     │ 130     │ 1017  │ 5m 33.1s │
│ google   │ gemini-2.0-flash  │ 67       │ 798     │ 152     │ 1017  │ 3m 45.8s │
└──────────┴───────────────────┴──────────┴─────────┴─────────┴───────┴──────────┘
Total time: 13m 31.2s

Sweep flags

Flag	Short	Required	Description
`--target`	`-t`	Yes	Target behavior to test
`--verbose`		No	Show full response text
`--quiet`		No	Suppress output

Configuring providers for sweep

Sweep uses whatever providers have API keys configured. Set them up first:

ai-blackteam config set providers.anthropic.api_key sk-ant-...
ai-blackteam config set providers.openai.api_key sk-...
ai-blackteam config set providers.google.api_key AIza...

Ollama always runs (no API key needed). Providers without keys are skipped. If no providers are configured, sweep tells you:

No providers configured. Set an api_key with: ai-blackteam config set providers.<name>.api_key VALUE

​The ai-blackteam batch command

​Selecting attacks

​Parallelism

​All flags

​Progress bar

​Summary table

​Exit codes

​Sensitive category warnings

​The ai-blackteam sweep command

​How it works

​Sweep summary table

​Sweep flags

​Configuring providers for sweep