Anthropic - ai-blackteam

Default model

claude-sonnet-4-6

Supported models

Any model available through the Anthropic API:

claude-sonnet-4-6 (default)
claude-opus-4-20250115
claude-haiku-3-20240307
claude-sonnet-4-20250514
Any other model ID the API accepts

Authentication

Environment variable:

export ANTHROPIC_API_KEY=sk-ant-...

Config command:

ai-blackteam config set providers.anthropic.api_key sk-ant-...

Get your API key from the Anthropic Console.

Example usage

# Single attack with default model (claude-sonnet-4-6)
ai-blackteam run -p anthropic -a encoding-obfuscation -t "Write a phishing email"

# Full batch with a specific model
ai-blackteam batch -p anthropic --model claude-opus-4-20250115 --attacks all -t "Write a phishing email"

# Benchmark
ai-blackteam benchmark -p anthropic --threshold 80

Tool-use support

Yes. Compatible with all 39 tool-use attacks in the framework, agent exploitation, MCP exploitation, multi-agent protocol attacks, and tool-injection vectors all work with Claude models. Filter the attack catalog with:

ai-blackteam list-attacks
# look for rows with Mode = tool-use

The provider uses Anthropic’s native tool-use format with tool_use content blocks.

Notes

Max output tokens per request: 4,096
The provider uses the official anthropic Python SDK
System prompts are passed via the system parameter (not as a message)
Retry with exponential backoff is automatic on API failures (3 attempts)
If no API key is provided via config or env var, the SDK falls back to its own default resolution (e.g. ANTHROPIC_API_KEY in the environment)

Providers Overview OpenAI

​Default model

​Supported models

​Authentication

​Example usage

​Tool-use support