EvaluateSummaryV3 format, letting you import safety scan results into Promptfoo’s dashboard and compare them alongside your existing evaluations.
Exporting to Promptfoo
The EvaluateSummaryV3 schema
The export produces a JSON file matching Promptfoo’s expected structure:Per-result structure
Each result entry includes:| Field | Description |
|---|---|
provider.id | Provider and model (e.g., anthropic:claude-sonnet-4-6) |
prompt.raw | The actual attack prompt sent |
prompt.label | Attack technique name |
response.output | Model’s full response text |
response.tokenUsage | Token counts for the request |
success | true if BLOCKED, false if BYPASSED |
score | 1.0 (BLOCKED), 0.5 (PARTIAL), 0.0 (BYPASSED) |
gradingResult | Detailed evaluation with metadata |
gradingResult.metadata includes:
pluginId- the ai-blackteam attack technique IDstrategyId- single-turn or multi-turnseverity- critical, high, medium, or lowtags- OWASP LLM Top 10 tags (e.g.,owasp:llm01)