ISO/IEC 42001

ISO/IEC 42001:2023 is the international standard for AI Management Systems. It’s the AI equivalent of ISO 27001 for information security — it tells organizations how to govern, develop, and operate AI systems responsibly. ai-blackteam maps harm categories to ISO 42001 Annex A controls so you can see which governance areas your model’s weaknesses fall under.

Annex A Controls

ControlNameWhat It Covers
A.2AI PoliciesEstablishing and maintaining policies for responsible AI development and use
A.3Internal OrganizationDefining roles, responsibilities, and governance structures for AI management
A.4Resources for AIEnsuring adequate resources, competence, and infrastructure for AI systems
A.5AI System Impact AssessmentConducting impact assessments to identify and evaluate risks of AI systems
A.6AI System LifecycleManaging AI through development, deployment, monitoring, and decommissioning
A.7Data for AIEnsuring data quality, provenance, privacy, and governance for training and operation
A.8Information for Interested PartiesTransparency and communication about AI capabilities, limitations, and impacts
A.9Use of AI SystemsResponsible deployment and use, including human oversight and intervention
A.10Third-Party RelationshipsManaging risks from third-party AI components, models, datasets, and services

How ai-blackteam Maps to ISO 42001

When ai-blackteam finds a bypass, it maps the harm category to the relevant ISO 42001 control. This tells you which area of your AI management system needs attention.
Harm CategoryControlWhy
phishingA.9Deployment/use controls failed to prevent harmful output
malwareA.6Lifecycle controls should prevent code generation risks
weaponsA.5Impact assessment should flag weapons content as unacceptable
self-harmA.5Impact assessment should identify self-harm risks
hate-speechA.9Use controls should prevent discriminatory output
pii-extractionA.7Data governance should protect PII in training and retrieval
system-prompt-leakageA.6Lifecycle controls should protect system instructions
cbrnA.5Impact assessment must cover CBRN risks
fraudA.9Use controls should prevent financial fraud assistance
child-safetyA.5Impact assessment must cover child safety risks
copyright-ipA.7Data governance should handle IP and copyright
misinformationA.8Transparency controls should prevent false claims
cybercrimeA.6Lifecycle controls should prevent exploitation assistance
privacy-violationA.7Data governance must protect personal information
discriminationA.5Impact assessment should evaluate fairness and bias
social-engineeringA.9Use controls should block manipulation techniques
agentic-risksA.10Third-party controls apply when agents use external tools

Using ISO 42001 for Compliance

If your organization is pursuing ISO 42001 certification (or aligning to it voluntarily), ai-blackteam results help you document evidence for your management system:
  • A.5 (Impact Assessment): ai-blackteam’s safety scan IS part of your impact assessment. Run ai-blackteam benchmark and export the results as evidence.
  • A.6 (Lifecycle): Regular scanning at each deployment stage. Use ai-blackteam batch in CI/CD as a lifecycle gate.
  • A.7 (Data): Test for PII extraction and data leakage with privacy-violation and information-disclosure attack categories.
  • A.9 (Use): The OWASP scorecard (ai-blackteam scorecard --standard llm) provides a structured compliance artifact.
  • A.10 (Third-Party): If you use third-party models, run ai-blackteam against each provider to assess their risk profile.
# Generate compliance evidence
ai-blackteam scorecard --standard compliance -o iso-42001-evidence.json

# Run safety benchmark as impact assessment evidence
ai-blackteam benchmark -p anthropic --output impact-assessment.json

Reference