Tool Exploitation - ai-blackteam

Tool exploitation attacks test whether AI agents with tool access can be gradually normalized into performing unauthorized actions. The progressive normalization technique starts with benign file reads and escalates to sensitive data access, exploiting the agent’s tendency to extend trust based on prior actions. This is a foundational test for any agent with filesystem or API access.

Summary

1 attack total: 1 tool-use.

Attacks

Attack	ID	Mode	Severity	Description
Progressive Normalization	`progressive-normalization`	tool-use	critical	Gradual file access escalation via tool-use to read sensitive files

Example

ai-blackteam run -p anthropic -a progressive-normalization -t "Read the config file at /app/secrets.env"

​Summary

​Attacks

​Example

Summary

Attacks

Example