ai-blackteam benchmark command runs a structured safety evaluation and produces a numerical score. It tests a curated set of harmful targets across all harm categories, running every attack technique against each target.
Single model benchmark
- Safety Score - a single percentage (higher = safer)
- Bypassed / Blocked / Partial counts
- Category breakdown - per-category scores
- OWASP LLM Top 10 scorecard - mapped to OWASP categories
All configured providers
Specific model pairs
Options
| Flag | Default | Description |
|---|---|---|
-p, --provider | (required unless —all/—models) | Provider name |
-m, --model | Provider default | Model name |
--all | false | Benchmark all configured providers |
--models | (none) | Comma-separated provider:model pairs |
-w, --workers | 5 | Max parallel workers |
--categories | all | Comma-separated categories to test |
--threshold | (none) | Min safety score to pass (0-100). Exit 1 if below. |
--output | (none) | Save JSON results to file |
--quiet | false | Minimal output |