OWASP Agentic AI Top 10 Evidence Cross-walk¶

Non-normative. This document is informative only. Nothing here changes TRACE v0.1 schema fields, wire formats, required claims, or conformance requirements. References to "TRACE" mean the TRACE v0.1 Trust Record as defined in spec/trace-v0.1.md.

Purpose¶

OWASP's Agentic AI Top 10 (ASI01–ASI10) names the principal risk categories for autonomous AI agents. TRACE is an evidence and attestation layer: it does not prevent these risks, but it produces verifiable, hardware-rooted records that support audit, incident review, and governance verification after or during execution.

This page answers a concrete question for each OWASP risk: which TRACE Trust Record claims carry relevant evidence, what that evidence actually proves, and where TRACE v0.1 leaves a gap that a future profile would need to close.

Field names are exact TRACE v0.1 claim names as specified in §3.1 of the spec. Sub-field references such as policy.bundle_hash, runtime.platform, and appraisal.status refer to the nested fields shown in the §3.2 wire-format example.

Mapping table¶

OWASP risk	Relevant TRACE claims	Evidence TRACE can provide	Limitations / out of scope	Future TRACE profile need
ASI01: Agent Goal Hijack	`policy`, `tool_transcript`, `data_class`, `runtime`, `transparency`	`policy.bundle_hash` shows which policy was bound at execution time; `policy.enforcement_mode` shows whether it ran in `enforce` or `silent`; `tool_transcript.hash` and `tool_transcript.call_count` show the scope of tool activity; `data_class` shows the sensitivity of data processed; `transparency` anchors the record to an append-only log for independent audit.	Does not capture the agent's internal goal representation or reasoning chain. Does not itself detect or block prompt injection. A hijacked goal that stays within recorded tool boundaries leaves no additional fingerprint in v0.1 claims.	MCP/A2A profile fields for intent binding, goal hash, prompt-to-plan linkage, and per-decision policy outcome records.
ASI02: Tool Misuse and Exploitation	`tool_transcript`, `policy`, `data_class`, `runtime`, `transparency`	`tool_transcript.hash` binds the transcript of tool invocations crossing a protocol boundary (MCP, A2A) into the signed record; `tool_transcript.call_count` bounds the observed call volume; `policy.bundle_hash` and `policy.enforcement_mode` show what rules were in force; `data_class` shows the sensitivity classification applied at the call layer; `runtime.platform` and `runtime.measurement` bind the execution environment.	`tool_transcript` covers calls crossing an instrumented protocol boundary. Functions embedded inside the deployed binary are bound only by `build_provenance` and `model`, not by per-invocation transcript records. Does not prove each call was semantically appropriate without per-call policy decision records.	MCP profile for per-call tool identity, parameters, egress data class, and allow/deny decision outcome per invocation.
ASI03: Identity and Privilege Abuse	`subject`, `cnf`, `signature`, `runtime`, `policy`, `appraisal`	`subject` carries the SPIFFE SVID binding workload identity to a TEE-held key; `cnf` binds the signing key to the TEE via the hardware measurement in `runtime`; `appraisal.status` records the verifier's judgment of the evidence against policy; `policy.bundle_hash` shows what authorization rules were bound. Together these allow a verifier to confirm that the claimed identity corresponded to a genuine, measured workload at record issuance time.	Does not replace IAM or RBAC systems. Does not prove that external permissions granted to the workload were correctly scoped. Does not cover lateral movement or privilege escalation that occurred at infrastructure layers outside the TEE boundary.	Delegation profile fields for scoped credentials, audience restriction, purpose binding, and per-action authorization evidence.
ASI04: Agentic Supply Chain Vulnerabilities	`build_provenance`, `model`, `runtime`, `transparency`, `appraisal`	`build_provenance.slsa_level`, `build_provenance.builder`, and `build_provenance.digest` bind build-time provenance to the running workload; `model.weights_digest` and `model.model_id` bind the model artifact; `runtime.measurement` and `runtime.rim_uri` bind the runtime image to vendor-published Reference Integrity Manifests; `transparency` provides an append-only anchor for the full record.	Does not enumerate every transitive dependency. Does not detect malicious code that was present at build time but is not reflected in measurement divergence. Does not prove model behavior is safe -- only that the identified artifact was the one that executed.	Richer AIBOM/SBOM linkage in the `model` sub-claim; vendor platform annexes specifying per-silicon supply-chain appraisal criteria.
ASI05: Unexpected Code Execution	`runtime`, `build_provenance`, `tool_transcript`, `policy`, `appraisal`	`runtime.measurement` shows what workload and container digest were loaded; `build_provenance.digest` and `build_provenance.slsa_level` bind the build artifact; `tool_transcript.hash` captures tool invocations at protocol boundaries; `appraisal.status` records whether the verifier accepted the runtime evidence against current RIMs.	Does not prevent code execution. Does not prove no code ran outside the recorded `tool_transcript` scope: inline code execution embedded in the workload binary is not captured at invocation granularity by v0.1 claims.	Tool and code-execution profile: sandbox identity, command execution transcript hash, filesystem and network boundary evidence per invocation.
ASI06: Memory and Context Poisoning	`model`, `data_class`, `tool_transcript`, `build_provenance`, `transparency`	`model.model_id` and `model.weights_digest` identify the model artifact; `data_class` records the sensitivity classification applied to inputs and outputs; `tool_transcript.hash` captures context-retrieval calls that crossed an instrumented boundary; `build_provenance.digest` binds the build artifact that includes any vector-store or RAG client code.	TRACE v0.1 does not bind memory snapshots, RAG corpus roots, embedding-store contents, or poisoning scan results. Context provenance -- where retrieved chunks originated and whether they were validated -- is not a v0.1 claim.	Memory and RAG profile: context source hashes, corpus Merkle roots, poisoning scan status, namespace and tenant boundary evidence.
ASI07: Insecure Inter-Agent Communication	`subject`, `cnf`, `signature`, `runtime`, `tool_transcript`, `transparency`	`subject` and `cnf` establish and bind workload identity for each participating agent; `signature` (or the enveloping-signature equivalent) proves record integrity; `runtime.measurement` binds the TEE that produced the record; `tool_transcript.hash` anchors the inter-agent call transcript; `transparency` anchors the record in an append-only log accessible to any verifier.	TRACE v0.1 does not define a normative A2A profile. Message-level semantic validation, nonce binding per message, and delegation chain records across multiple hops are not v0.1 claims.	A2A profile: peer agent identity, per-message hash, audience restriction, nonce, delegation chain, and protocol version binding.
ASI08: Cascading Failures	`tool_transcript`, `transparency`, `subject`, `appraisal`, `policy`	`tool_transcript.hash` and `tool_transcript.call_count` provide a bounded activity record for post-incident traceability; `subject` identifies the workload at each hop where a Trust Record was issued; `appraisal.status` and `appraisal.verifier` record the verification outcome; `policy.bundle_hash` shows what controls were bound; `transparency` provides an auditable anchor for reconstructing the sequence of records.	Does not itself detect cascade conditions, stop propagation, or reconstruct full multi-agent lineage unless every participating agent issued Trust Records and those records are correlated at analysis time. Lineage reconstruction requires out-of-band tooling.	Workflow provenance profile: parent/child record references, propagation lineage identifiers, blast-radius counters, and circuit-breaker event records.
ASI09: Human-Agent Trust Exploitation	`tool_transcript`, `policy`, `data_class`, `transparency`, `appraisal`	`tool_transcript.hash` records the set of actions the agent took at protocol boundaries; `policy.bundle_hash` and `policy.enforcement_mode` show what rules were in force; `data_class` records the sensitivity class of data involved; `appraisal.status` provides the verifier's assessment; `transparency` anchors the record for independent audit. Collectively these support a post-incident review of what the agent was authorized to do and what it did.	Does not prove what a human was shown in the UI, whether the explanation was accurate, or whether informed consent was obtained. Does not detect social engineering at the human layer.	HITL (human-in-the-loop) evidence profile: approval record, reviewer identity, hash of displayed risk summary, and human decision evidence bound into the Trust Record.
ASI10: Rogue Agents	`subject`, `runtime`, `cnf`, `signature`, `policy`, `tool_transcript`, `appraisal`, `transparency`	The full v0.1 claim set combines hardware-bound identity (`subject`, `cnf`), measured execution environment (`runtime`), policy binding (`policy.bundle_hash`, `policy.enforcement_mode`), tool activity (`tool_transcript`), verifier judgment (`appraisal.status`), and transparency anchoring (`transparency`) into one signed, independently verifiable record. A verifier can confirm that the agent presenting this record ran as a measured, policy-bound workload with a TEE-rooted identity at the stated time.	Does not determine behavioral intent. Does not detect rogue behavior that remained within the boundary of recorded claims. A workload can issue a valid Trust Record and still behave in ways the record does not constrain at inference time.	Behavioral integrity profile: declared capability manifest hash, expected tool set, execution baseline hash, and anomaly or quarantine event records.

What TRACE does not do¶

The table above surfaces per-risk scope boundaries. These apply across all ten risks:

TRACE does not prevent OWASP Agentic AI risks. TRACE is an evidence and attestation layer. It records what happened -- what executed, under what policy, in what environment, on what data class, invoking what tools -- and binds that record cryptographically to a hardware root of trust. Controls that act before, during, or around execution (firewalls, guardrails, RBAC, prompt filters, sandboxing) are out of scope.

TRACE does not adjudicate model behavior. As stated in §2.4 of the spec, prompt injection, jailbreaks, hallucination, and alignment drift are permanent scope boundaries. TRACE proves what model artifact executed and what countermeasures were bound; it does not evaluate whether the model's output was correct or intended.

TRACE v0.1 does not cover intra-binary execution. tool_transcript captures invocations crossing an instrumented protocol boundary (MCP, A2A, or equivalent). Code paths executed inside the deployed binary -- including embedded tool dispatch, in-process memory reads, or direct API calls -- are bound only by build_provenance and model, not at invocation granularity.

TRACE does not replace IAM, RBAC, or network controls. subject and cnf establish and bind workload identity, but the permissions granted to that identity by external systems are outside the Trust Record. A correctly identified workload can still hold excessive privileges.

TRACE does not capture UI or human-layer interactions. What a human sees, approves, or consents to is not a v0.1 claim. This affects ASI09 directly and is a gap for any risk that involves human oversight.

TRACE does not reconstruct multi-agent lineage automatically. Records from multiple agents can be correlated by a verifier using subject, transparency, and tool_transcript identifiers, but the v0.1 schema has no normative parent/child record pointer. Lineage reconstruction requires out-of-band analysis tooling and is addressed in the v0.2 workflow provenance profile item.

Relationship to the TRACE roadmap¶

This cross-walk is a v0.2 roadmap item listed in ROADMAP.md. The "Future TRACE profile need" column in the table above describes the additional claim structures that would close each gap; those profiles are separate roadmap items and will be specified in later releases.

The MITRE ATLAS cross-walk (a separate v0.2 item) will address tactics and techniques at a finer granularity than the OWASP risk categories here.

References¶

OWASP Agentic AI Top 10: https://genai.owasp.org/
TRACE v0.1 specification: spec/trace-v0.1.md
TRACE roadmap: ROADMAP.md
MITRE ATLAS: https://atlas.mitre.org/