# UnifAI Policies

<figure><img src="/files/BnBlavpGGBGqlmCabkbl" alt=""><figcaption></figcaption></figure>

UnifAI policies are your built-in guardrails for AI security and compliance. Instead of manually tracking complex regulations, UnifAI automates policy enforcement across your AI ecosystem, so you can innovate without risk.

Every policy is mapped to global standards like OWASP and EU AI Act, ensuring your AI systems stay compliant and resilient.

## Lineaje Policies

Lineaje provides out-of-the-box policies across four categories:

* AI Threats and Exploits
* Data Security and Privacy
* Identity and Access Control
* Vulnerability

In UnifAI, enter the prompt `View all policies` to see your AI Assets.

### AI Threats and Exploits&#x20;

Blocks prompt injection, adversarial inputs, and unsafe model behavior before they reach your AI apps.

<details>

<summary>Do not allow malicious content via hidden prompts</summary>

AI\_APP\_SEC\_001&#x20;

**Violation Summary**

Hidden or non-visible prompts detected in the system introduce risks of prompt injection, bypass of safety controls, and untraceable model behavior.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**

Hidden prompts introduce several risks including:&#x20;

* Undetectable prompt injection&#x20;
* Unpredictable, unsafe or incorrect output&#x20;
* Bypass safety and governance controls&#x20;
* Unsafe or inconsistent agent behavior&#x20;
* Regulatory and Ethical exposure&#x20;

**Attack Vector**: Prompt&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentially Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 - OWASP-LLM**: LLM01, LLM02, LLM04, LLM08&#x20;
* **March 2025 - OWASP-ASI**: ASI-01, ASI-04, ASI-07, ASI-09&#x20;
* **Aug 1, 2024 - EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;

**References**

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/> &#x20;

<https://artificialintelligenceact.eu/ai-act-explorer/>  &#x20;

</details>

<details>

<summary>Do not allow malicious content via encoded prompts</summary>

AI\_APP\_SEC\_002

**Violation Summary**

Encoded prompts are instructions hidden inside obfuscated text, Base64, hex, zero-width characters, steganographic patterns, metadata, or structured payloads. They allow attackers or internal actors to bypass oversight, evade filters, or manipulate an AI system without detection.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**&#x20;

Hidden prompts introduce several risks including:&#x20;

* Invisible prompt injection leading to unauthorized system behavior&#x20;
* Safety bypass (toxicity, policy evasion, jailbreaks)&#x20;
* Leakage of sensitive data or internal system instructions&#x20;
* Corruption of downstream workflows due to manipulated outputs&#x20;
* Violations of transparency, record-keeping, and explainability requirements&#x20;

**Attack Vector**: Prompt&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentially Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 - OWASP-LLM**: LLM01, LLM04, LLM05, LLM08&#x20;
* **March 2025 - OWASP-ASI**: ASI-01, ASI-04, ASI-07, ASI-09&#x20;
* **Aug 1, 2024 - EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/> &#x20;

<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Use only LLMs from the organization’s approved list</summary>

AI\_APP\_SEC\_006

**Violation Summary** \
Using an LLM that is not on the organization’s approved list introduces uncontrolled security, privacy, compliance, and operational risks. Unapproved LLMs may have unknown data handling practices, insufficient security controls, unclear training or retention policies, weak contractual protections, or unvetted model behavior. This bypasses governance, procurement, and risk management processes, exposing the organization to data leakage, regulatory violations, vendor lock-in, and unpredictable AI behavior across agentic and automated workflows.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

High/Critical

**Technical Details**

Using an unapproved LLM introduces several risks including:&#x20;

* Uncontrolled processing, retention, or reuse of sensitive data and prompts&#x20;
* Unknown security posture, access controls, and logging practices&#x20;
* Potential training on proprietary or regulated data without consent&#x20;
* Incompatibility with organizational guardrails, monitoring, or audit tooling&#x20;
* Increased exposure to prompt injection, data leakage, or unsafe outputs&#x20;
* Breach of contractual, legal, or regulatory obligations&#x20;
* Loss of centralized governance, visibility, and incident response capability&#x20;

**Attack Vector**: LLM selection / API usage outside approved platforms&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Medium (instability or service changes)&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM03, LLM05, LLM08 &#x20;
* **March 2025 – OWASP-ASI**: ASI-04, ASI-05, ASI-09, ASI-12&#x20;
* **Aug 1, 2024 – EU AI Act**: Articles 11, 12, 13, 50, obligations for GPAI risk management and provider accountability&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>MCP server must validate and sanitize all input</summary>

AI\_APP\_SEC\_014

**Violation Summary**&#x20;

If an MCP server accepts input from clients, agents, or LLMs without validation or sanitization, it becomes vulnerable to malformed payloads, injection attacks, unauthorized tool invocation, unsafe command execution, and data corruption. Because MCP servers often expose high-privilege operations (file access, API calls, system actions), unvalidated input can be weaponized to manipulate workflows, escalate privileges, or deliver malicious instructions that compromise both the server environment and downstream systems.&#x20;

**Affected Assets**&#x20;

MCP Server&#x20;

**Severity**

Critical

**Input Validation**&#x20;

Validation ensures that input conforms to expected structure, type, length, format, and policy constraints before it is processed by the LLM. It answers the question if the input is allowed to be processed.&#x20;

**Examples of Validation**&#x20;

* Rejecting prompts longer than a defined maximum length&#x20;
* Enforcing schema compliance (e.g., JSON with specific fields only)&#x20;
* Blocking inputs containing disallowed patterns (e.g., ignore previous instructions, system override)&#x20;
* Restricting input sources to authenticated or trusted origins&#x20;
* Ensuring prompts match an approved task or intent category&#x20;

**Input Sanitization**&#x20;

Sanitization transforms input to remove, neutralize, or normalize unsafe elements while preserving legitimate intent. Sanitization ensures that the input is made safe before processing. &#x20;

**Examples of Sanitization**&#x20;

* Normalizing Unicode to remove obfuscation (e.g., leetspeak, homoglyphs)&#x20;
* Stripping zero-width or invisible characters&#x20;
* Decoding and inspecting encoded content (Base64, hex) before use&#x20;
* Escaping or isolating untrusted text so it cannot be interpreted as instructions&#x20;
* Removing or redacting sensitive data (PII, secrets)&#x20;

**Technical Details**

Not validating or sanitizing MCP server input introduces several risks including:&#x20;

* Injection of malicious commands, payloads, or structured data into tools or system functions&#x20;
* Execution of unsafe or hallucinated instructions originating from LLM output&#x20;
* Unauthorized access or misuse of server-side capabilities and sensitive APIs&#x20;
* Corruption of data, resources, or operational workflows through malformed input&#x20;
* Increased attack surface for prompt-to-system escalation attacks&#x20;
* Loss of governance, auditability, and explainability of server-driven actions&#x20;
* Violations of integrity, safety, and regulatory obligations for high-risk functions&#x20;

**Attack Vector**: MCP client → server input channel&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: Medium to High (depending on server capabilities)&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01 (Prompt/Instruction Injection), LLM04 (Behavior Manipulation), LLM05 (Sensitive Information Disclosure), LLM06 (Hallucination Risks), LLM08 (Transparency & Audit Failures)&#x20;
* **March 2025 – OWASP-ASI**: ASI-01 (Input/Output Integrity), ASI-05 (Safe Handling), ASI-07 (Reliability), ASI-09 (Traceability), ASI-12 (Operational Monitoring)&#x20;
* **Aug 1, 2024 – EU AI Act**: Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus Annex III robustness & safety requirements&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>MCP clients must log all interactions with the MCP server</summary>

AI\_APP\_SEC\_022

**Violation Summary**\
A missing or incomplete logging mechanism between the MCP client and MCP server creates a critical visibility and governance gap. MCP interactions often trigger high-privilege actions (tool execution, data access, workflow modification). Without proper logs, misuse, anomalies, attacks, or unauthorized system changes cannot be detected, investigated, or attributed. This results in opaque AI behavior, broken audit trails, and non-compliance with required traceability and transparency obligations.&#x20;

**Affected Assets**

* MCP Client
* MCP Server

**Severity**

Critical

**Technical Details**

Failure to log MCP interactions introduces several risks including:&#x20;

* Undetectable misuse or abuse of MCP server tools&#x20;
* Inability to perform forensic investigation during an incident&#x20;
* Loss of accountability for AI-driven actions and decisions&#x20;
* Exposure to covert prompt injection or unauthorized system manipulation&#x20;
* Violations of traceability, transparency, and record-keeping requirements&#x20;
* Difficulty detecting anomalous behavior or lateral movement&#x20;
* Corruption of downstream workflows due to hidden actions&#x20;

**Attack Vector:** MCP tool invocation / API interaction

**Attack Complexity:** Low

**Privileges Required:** None, when exploited via LLM-driven tool calls.

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** High

**Availability Impact:** Low&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM01, LLM04, LLM05, LLM08
* **March 2025 – OWASP-ASI:** ASI-01, ASI-04, ASI-07, ASI-09
* **Aug 1, 2024 – EU AI Act:** 1.11, 2.12, 3.13, 4.50&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Client must validate and sanitize any output from a MCP server</summary>

AI\_APP\_SEC\_023

**Violation Summary**

When an MCP client consumes output from an MCP server without validation or sanitization, it exposes the AI system and downstream components to malformed data, malicious payloads, injection attacks, hallucinated instructions, and unsafe tool execution. MCP server output may include structured data, commands, untrusted text, or model-generated content. Without safeguards, unvalidated output can drive unsafe automated actions, corrupt workflows, or leak sensitive information.&#x20;

**Affected Assets**

* AI Agent
* MCP Client

**Severity**

Critical

**Input Validation**&#x20;

Validation ensures that input conforms to expected structure, type, length, format, and policy constraints before it is processed by the LLM. It answers the question if the input is allowed to be processed.&#x20;

**Examples of validation**&#x20;

* Rejecting prompts longer than a defined maximum length&#x20;
* Enforcing schema compliance (e.g., JSON with specific fields only)&#x20;
* Blocking inputs containing disallowed patterns (e.g., ignore previous instructions, system override)&#x20;
* Restricting input sources to authenticated or trusted origins&#x20;
* Ensuring prompts match an approved task or intent category&#x20;

**Input Sanitization**&#x20;

Sanitization transforms input to remove, neutralize, or normalize unsafe elements while preserving legitimate intent. Sanitization ensures that the input is made safe before processing. &#x20;

**Examples of sanitization**

* Normalizing Unicode to remove obfuscation (e.g., leetspeak, homoglyphs)&#x20;
* Stripping zero-width or invisible characters&#x20;
* Decoding and inspecting encoded content (Base64, hex) before use&#x20;
* Escaping or isolating untrusted text so it cannot be interpreted as instructions&#x20;
* Removing or redacting sensitive data (PII, secrets)&#x20;

**Affected Assets**&#x20;

* AI Agent&#x20;
* MCP Client&#x20;

**Technical Details**&#x20;

Lack of output validation introduces several risks including:&#x20;

* Execution of harmful or unintended actions triggered by malformed MCP output&#x20;
* Injection of unsafe code, commands, or control sequences into downstream systems&#x20;
* Propagation of hallucinated, incorrect, or manipulated data&#x20;
* Leakage of sensitive information through unfiltered server responses&#x20;
* Corruption of business workflows or agent decision chains&#x20;
* Evasion of safety controls due to unmonitored tool responses&#x20;
* Violations of auditability, reliability, and compliance requirements&#x20;

**Attack Vector**: MCP response / server-generated output&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Medium (via cascading workflow corruption)&#x20;

**Framework**

* **Nov 18, 2024** – OWASP-LLM: LLM01, LLM03, LLM04, LLM06, LLM08&#x20;
* **March 2025** – OWASP-ASI: ASI-01, ASI-05, ASI-07, ASI-09&#x20;
* **Aug 1, 2024** – EU AI Act: 1.11, 2.12, 3.13, 4.50&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Do not use LLMs from the organization's disallowed list</summary>

AI\_APP\_SEC\_028

**Violation Summary** \
Using an LLM that is explicitly on the organization’s block list represents a deliberate bypass of governance, security, and risk controls. Block-listed LLMs are typically prohibited due to known deficiencies such as unsafe data handling, unacceptable training or retention practices, lack of contractual protections, regulatory exposure, weak security posture, or demonstrated unsafe behavior. Their use introduces severe security, privacy, compliance, and reputational risks and undermines centralized AI governance.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**

Using a block-listed LLM introduces several risks including:&#x20;

* Known or previously identified data leakage, retention, or misuse risks&#x20;
* Exposure of sensitive, proprietary, or regulated data to untrusted providers&#x20;
* Circumvention of organizational security, legal, and compliance controls&#x20;
* Lack of auditability, logging, or incident response visibility&#x20;
* Increased likelihood of unsafe, biased, or non-compliant model behavior&#x20;
* Breach of regulatory, contractual, or internal policy obligations&#x20;
* Loss of trust in AI governance and enforcement mechanisms&#x20;

**Attack Vector**: Unauthorized LLM selection / direct API or UI usage&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Medium&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM03, LLM05, LLM08&#x20;
* **March 2025 – OWASP-ASI**: ASI-04, ASI-05, ASI-09, ASI-12&#x20;
* **Aug 1, 2024 – EU AI Act**: Articles 11, 12, 13, 50, GPAI risk-management and provider accountability requirements&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Agent must validate, sanitize LLM output including for presence of eval or any dynamic code execution primitive in LLM output</summary>

AI\_APP\_SEC\_029

**Violation Summary**

When LLM outputs are consumed without validation or sanitization, the system becomes vulnerable to unsafe instructions, hallucinated commands, malicious payloads, and untrusted code. This risk becomes critical when the LLM output may contain eval, shell commands, SQL statements, or other dynamic execution primitives. If such outputs pass directly into an interpreter, agent tool, or workflow engine, they can lead to arbitrary code execution, data exfiltration, workflow corruption, or full system compromise.&#x20;

**Affected Assets**

AI Agent

**Severity**

Critical

**Input Validation**&#x20;

Validation ensures that input conforms to expected structure, type, length, format, and policy constraints before it is processed by the LLM. It answers the question if the input is allowed to be processed.&#x20;

**Examples of Validation**&#x20;

* Rejecting prompts longer than a defined maximum length&#x20;
* Enforcing schema compliance (e.g., JSON with specific fields only)&#x20;
* Blocking inputs containing disallowed patterns (e.g., ignore previous instructions, system override)&#x20;
* Restricting input sources to authenticated or trusted origins&#x20;
* Ensuring prompts match an approved task or intent category&#x20;

**Input Sanitization**&#x20;

Sanitization transforms input to remove, neutralize, or normalize unsafe elements while preserving legitimate intent. Sanitization ensures that the input is made safe before processing. &#x20;

**Examples of Sanitization**&#x20;

* Normalizing Unicode to remove obfuscation (e.g., leetspeak, homoglyphs)&#x20;
* Stripping zero-width or invisible characters&#x20;
* Decoding and inspecting encoded content (Base64, hex) before use&#x20;
* Escaping or isolating untrusted text so it cannot be interpreted as instructions&#x20;
* Removing or redacting sensitive data (PII, secrets) &#x20;

**Technical Details**&#x20;

Failure to validate LLM output introduces several risks including:&#x20;

* Accidental or malicious execution of model-generated code (e.g., eval, exec, Function, subprocess calls)&#x20;
* Injection of harmful commands or payloads into tools, agents, or downstream applications&#x20;
* Execution of hallucinated instructions that modify resources, corrupt data, or trigger destructive operations&#x20;
* Leakage of internal or sensitive information through improperly filtered responses&#x20;
* Exploitation of agents that automatically convert LLM output into actions (“AI code injection”)&#x20;
* Loss of safety, explainability, reliability, and auditability in automated pipelines&#x20;
* Violations of governance, logging, and traceability requirements&#x20;

**Attack Vector**: LLM output → downstream interpreter / agent tool&#x20;

**Attack Complexity**: Low (LLM can be tricked into generating dangerous primitives)&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (fully autonomous execution paths are most at risk)&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: Medium to High&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01, LLM03, LLM04, LLM06, LLM08&#x20;
* **March 2025 – OWASP-ASI**: ASI-01, ASI-05, ASI-07, ASI-09, ASI-12&#x20;
* **Aug 1, 2024 – EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Do not allow malicious content via hidden prompts written in leetspeak</summary>

AI\_APP\_SEC\_032

**Violation Summary** \
Allowing prompts written in leetspeak (e.g., h4x0r, 3v4l, 1nj3ct, byp4ss) or similar obfuscated language enables attackers to evade input validation, safety filters, and policy enforcement mechanisms. Leetspeak transforms malicious intent into visually altered but semantically equivalent text, allowing prompt injection, jailbreak attempts, encoded instructions, and policy bypasses to slip past keyword-based detection and moderation layers. This weakens the integrity and reliability of LLM-driven systems, especially in agentic and autonomous workflows.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**&#x20;

Allowing leetspeak introduces several risks including:&#x20;

* Bypass of keyword-based safety, moderation, and policy filters&#x20;
* Injection of malicious or unsafe instructions disguised as benign input&#x20;
* Increased success of encoded or obfuscated prompt attacks&#x20;
* Manipulation of agent reasoning or tool invocation logic&#x20;
* Reduced auditability and explainability due to obfuscated intent&#x20;
* Amplification of downstream risks when leetspeak-generated outputs drive actions&#x20;

**Attack Vector**: LLM input prompt (user, agent, or external system)&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (especially in automated or agent-driven scenarios)&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High to Critical&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01, LLM04, LLM06, LLM08 &#x20;
* **March 2025 – OWASP-ASI**: ASI-01, ASI-04, ASI-07, ASI-09), ASI-12 &#x20;
* **Aug 1, 2024 – EU AI Act**: Articles 11, 12, 13, 50, plus Annex III robustness and risk-management requirements&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>MCP server must not interact directly with an LLM</summary>

AI\_APP\_SEC\_033

**Violation Summary**\
When an MCP server directly interacts with an LLM—rather than operating only through an authenticated, validated, policy-enforcing MCP client—it collapses the trust boundary between system capabilities and untrusted model output. This allows an LLM to influence, manipulate, or trigger server-side actions without authorization or validation. Direct LLM-to-server interaction bypasses safety controls, authentication layers, input validation, output filtering, logging standards, and audit requirements. This exposes the environment to unbounded prompt injection, unsafe tool execution, data leakage, and full-system compromise.&#x20;

**Affected Assets**

* LLM
* MCP Server

**Severity**

Critical

**Technical Details**\
Allowing an MCP server to directly interact with an LLM introduces several risks including:&#x20;

* Execution of unsafe, hallucinated, or malicious LLM-generated instructions on high-privilege server tools&#x20;
* Prompt injection attacks gaining direct access to server capabilities&#x20;
* Bypassing client-side authentication, authorization, validation, and logging layers&#x20;
* Data leakage through uncontrolled LLM requests or responses&#x20;
* Inability to enforce least-privilege and zero-trust boundaries between model and system operations&#x20;
* Loss of auditability because actions occur without the MCP client as an intermediary&#x20;
* Violations of governance, safety, and regulatory obligations for high-risk AI systems&#x20;

**Attack Vector:** LLM output → server action path

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** Critical

**Availability Impact:** Medium to High&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM:** LLM01 (Prompt/Instruction Injection), LLM04 (Behavior Manipulation), LLM05 (Data Disclosure), LLM06 (Hallucination Risks), LLM08 (Transparency & Audit Failures)
* **Mar 2025 – OWASP-ASI:** ASI-01 (Input/Output Integrity), ASI-04 (Governance), ASI-05 (Safe Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus Annex III robustness & safety controls for high-risk systems&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Clear exit or termination criteria must exist for the agent to consider its task complete and stop executing</summary>

AI\_APP\_SEC\_034

**Violation Summary**\
When an AI agent is not given explicit, enforceable exit or termination criteria, it may continue executing indefinitely, escalate actions beyond intended scope, repeatedly invoke tools, consume excessive compute, or enter unsafe operational loops. Lack of defined stopping conditions increases the risk of runaway behavior, unintended system modifications, resource exhaustion, privacy violations, and unbounded interaction with external systems or MCP tools. Agents without termination logic become unpredictable, ungovernable, and potentially harmful. &#x20;

**Affected Assets**

AI Agent

**Severity**

High/Critical

**Technical Details**\
Missing termination criteria introduces several risks including:&#x20;

* Infinite or runaway task execution that triggers unnecessary or harmful actions&#x20;
* Repeated tool invocation (MCP or external APIs), leading to data exposure or workflow corruption&#x20;
* Accidental escalation of privileges as the agent searches endlessly for ways to complete the task
* Hallucination-driven decisions due to self-reinforcing reasoning loops&#x20;
* Excessive resource consumption or uncontrolled cost&#x20;
* Increased attack surface for prompt injection that pushes the agent into unsafe recursive behavior&#x20;
* Violations of safety, oversight, and accountability requirements&#x20;

**Attack Vector:** Agent reasoning cycle / task execution loop

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** Medium to High (depending on tool access)

**Integrity Impact:** High

**Availability Impact:** Medium to High&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01 (Injection), LLM04 (Behavior Manipulation), LLM06 (Hallucination Risks), LLM08 (Transparency & Audit Failures)
* **Mar 2025 – OWASP-ASI**: ASI-01 (Integrity), ASI-04 (Governance), ASI-07 (Reliability), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act**: Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), Annex III requirements for safe, predictable operation of high-risk systems&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Agents must log all interactions with an LLM</summary>

AI\_APP\_SEC\_035

**Violation Summary**\
If agents do not log their interactions with an LLM, including prompts, responses, tool requests, and reasoning triggers, organizations lose visibility into how decisions were made, what data was exchanged, and whether harmful or unauthorized actions occurred. Missing LLM interaction logs break auditability, hinder incident response, obscure the source of incorrect or unsafe outputs, and prevent compliance verification. Lack of logging also enables attackers to exploit the agent–LLM channel without detection. &#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**\
Not logging agent ↔LLM interactions introduces several risks including:&#x20;

* Inability to reconstruct how the agent reached a decision or triggered an action&#x20;
* Loss of forensic evidence needed for incident response or regulatory review&#x20;
* Undetected prompt injection, harmful outputs, or unsafe tool invocations&#x20;
* Unmonitored leakage of sensitive data or PII through prompts or responses&#x20;
* Difficulty identifying hallucination-driven failures or behavioral drift&#x20;
* Loss of traceability required for governance, transparency, and safety assurance&#x20;
* Violations of logging, documentation, and accountability requirements&#x20;

**Attack Vector:** Agent ↔ LLM communication channel

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** High

**Availability Impact:** Low&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01 (Injection), LLM03 (Data Leakage), LLM04 (Behavior Manipulation), LLM08 (Transparency & Audit Failures)
* **Mar 2025 – OWASP-ASI**: ASI-01 (Integrity), ASI-05 (Data Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act**: Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), with Annex III traceability requirements for high-risk systems&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/>  &#x20;

</details>

<details>

<summary>The AI Model must validate and sanitize any input before processing</summary>

AI\_APP\_SEC\_038

**Violation Summary**&#x20;

When AI Model accepts input without proper validation and sanitization, it becomes highly susceptible to prompt injection, encoded or hidden instructions, malicious payloads, and adversarial manipulation. Unsanitized inputs—originating from users, agents, tools, MCP servers, or external systems—can override system instructions, bypass guardrails, contaminate reasoning, and trigger unsafe downstream actions. This risk is amplified in agentic and tool-enabled environments where AI Model output directly influences real systems.&#x20;

**Affected Assets**

LLM

**Severity**

Critical

**Input Validation**&#x20;

Validation ensures that input conforms to expected structure, type, length, format, and policy constraints before it is processed by the AI Model. It answers the question if the input is allowed to be processed.&#x20;

**Examples of Validation**

* Rejecting prompts longer than a defined maximum length&#x20;
* Enforcing schema compliance (e.g., JSON with specific fields only)&#x20;
* Blocking inputs containing disallowed patterns (e.g., ignore previous instructions, system override)&#x20;
* Restricting input sources to authenticated or trusted origins&#x20;
* Ensuring prompts match an approved task or intent category&#x20;

**Input Sanitization**&#x20;

Sanitization transforms input to remove, neutralize, or normalize unsafe elements while preserving legitimate intent. Sanitization ensures that the input is made safe before processing. &#x20;

**Examples of Sanitization**

* Normalizing Unicode to remove obfuscation (e.g., leetspeak, homoglyphs)&#x20;
* Stripping zero-width or invisible characters&#x20;
* Decoding and inspecting encoded content (Base64, hex) before use&#x20;
* Escaping or isolating untrusted text so it cannot be interpreted as instructions&#x20;
* Removing or redacting sensitive data (PII, secrets)&#x20;

**Technical Details**&#x20;

Failure to validate and sanitize AI Model input introduces several risks including:&#x20;

* Prompt injection that overrides system and developer intent&#x20;
* Encoded, obfuscated, or hidden instructions bypassing safety controls&#x20;
* Injection of malicious content that manipulates tool usage or agent behavior&#x20;
* Leakage of sensitive data caused by adversarial prompt construction&#x20;
* Hallucination amplification driven by malformed or hostile inputs&#x20;
* Propagation of unsafe or untrusted instructions to downstream systems&#x20;
* Loss of transparency, auditability, and policy enforcement across AI workflows&#x20;

**Attack Vector**: AI Model input channel (user, agent, tool, MCP, external system)&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (especially in autonomous or agent-driven flows)&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: Medium&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM**: LLM01 (Prompt Injection), LLM04 (Model Behavior Manipulation), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)&#x20;
* **March 2025 – OWASP-ASI**: ASI-01 (Input Integrity), ASI-04 (Governance), ASI-07 (Reliability), ASI-09 (Traceability), ASI-12 (Operational Monitoring)&#x20;
* **Aug 1, 2024 – EU AI Act**: Articles 11 (Technical Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus Annex III robustness, safety, and risk-management requirements&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Sanitize and validate all input to the AI Model</summary>

AI\_APP\_SEC\_039

**Violation Summary**\
If input sent to an LLM is not validated and sanitized, the system becomes vulnerable to prompt injection, obfuscated or encoded instructions, malformed payloads, and adversarial manipulation. Unchecked inputs originating from users, agents, tools, uploaded files, MCP services, or external systems can override system intent, bypass safety controls, contaminate reasoning, and trigger unsafe downstream actions. This risk is amplified in agentic, tool-enabled, and autonomous workflows where LLM output directly influences real systems.

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**

Failure to validate and sanitize input before sending it to an LLM introduces several risks including:

* Prompt injection that overrides system and developer intent
* Encoded, hidden, or obfuscated instructions (Base64, leetspeak, zero-width characters) bypassing guardrails
* Injection of malicious content that manipulates tool usage or agent behavior
* Leakage of sensitive data caused by adversarial prompt construction
* Hallucination amplification driven by malformed or hostile inputs
* Propagation of unsafe or untrusted instructions to downstream systems
* Loss of transparency, auditability, and policy enforcement across AI workflows

**Attack Vector**: Input channel → LLM (user, agent, tool, file ingestion, MCP, external system)

**Attack Complexity**: Low

**Privileges Required**: None

**User Interaction**: None (especially in autonomous or agent-driven flows)

**Confidentiality Impact**: High

**Integrity Impact**: Critical

**Availability Impact**: Medium

**Framework**

* **OWASP-LLM**: LLM01, LLM04, LLM05, LLM08
* **OWASP-ASI**: ASI-01 (Input Integrity), ASI-04, ASI-07, ASI-09, ASI-12
* **EU AI Act**: Articles 11, 12, 13, 50, plus Annex III robustness, safety, and risk-management requirements

**References**

[https://genai.owasp.org/llm-top-10/ ](https://genai.owasp.org/llm-top-10/)\
<https://genai.owasp.org/initiatives/agentic-security-initiative/>\
<https://artificialintelligenceact.eu/ai-act-explorer/>

</details>

<details>

<summary>Do not allow malicious content via prompts included in uploaded files</summary>

AI\_APP\_SEC\_040

**Violation Summary**\
If uploaded files (documents, PDFs, spreadsheets, images with OCR, code files, logs) are ingested by an LLM or agent without inspection for malicious prompts, attackers can embed hidden, encoded, or context-manipulating instructions that influence model behavior. These “prompt-in-files” attacks allow adversaries to bypass input controls, poison agent reasoning, extract sensitive data, or trigger unauthorized tool actions—often without any visible user prompt.

**Affected Assets**

AI Agent

**Severity**

Critical

**Technical Details**

Failing to scan uploaded files for malicious prompts introduces several risks including:

* Hidden prompt injection embedded in document text, comments, metadata, or OCR layers
* Encoded or obfuscated instructions (Base64, leetspeak, zero-width characters) evading detection  \
  Cross-context contamination where file content overrides system or developer instructions
* Unauthorized tool invocation or workflow manipulation driven by file-based prompts
* Leakage of sensitive data due to adversarial instructions embedded in files
* Loss of explainability when behavior is influenced by unseen file content
* Violations of transparency, auditability, and policy enforcement requirements

**Attack Vector**: File upload > document ingestion / OCR / parsing pipeline

**Attack Complexity**: Low

**Privileges Required**: None

**User Interaction**: None, especially in automated ingestion or agent workflows.

**Confidentiality Impact**: High

**Integrity Impact**: Critical

**Availability Impact**: Medium

**Framework**

* **OWASP-LLM**: LLM01, LLM04, LLM05, LLM08
* **OWASP-ASI**: ASI-01, ASI-04, ASI-07, ASI-09, ASI-12
* **EU AI Act**: Articles 11, 12, 13, 50, plus Annex III robustness, safety, and risk-management requirements

**References**

<https://genai.owasp.org/llm-top-10/>\
<https://genai.owasp.org/initiatives/agentic-security-initiative/>\
<https://artificialintelligenceact.eu/ai-act-explorer/>

</details>

<details>

<summary>Do not allow prompts that can execute malicious commands at runtime</summary>

AI\_APP\_SEC\_059

**Violation Summary**\
If prompts are not checked for content that may trigger runtime command execution, attackers can inject malicious instructions that cause the system to execute shell commands, file operations, or other high-privilege actions. In agentic or tool-enabled environments, LLM outputs can directly influence interpreters, orchestration engines, or automation pipelines. Without execution safeguards, prompt injection can escalate from text manipulation to full system compromise.&#x20;

**Affected Assets**&#x20;

AI Agent&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Failing to enforce checks on prompts that may execute commands introduces several risks including:&#x20;

* Command injection through prompts that trigger shell operations&#x20;
* Execution of malicious primitives such as eval, exec, subprocess calls, or dynamic interpreters&#x20;
* Corruption of files, configurations, or operational state&#x20;
* Loss of integrity and trust in AI-driven automation&#x20;
* Violations of secure coding, governance, and runtime safety controls&#x20;

**Attack Vector**: Prompt → runtime interpreter &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (especially in autonomous or agent-driven workflows)&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: High&#x20;

**Framework**&#x20;

* **OWASP LLM**: LLM01, LLM04, LLM05, LLM08&#x20;
* **OWASP ASI**: ASI-01, ASI-04, ASI-05&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)&#x20;

</details>

<details>

<summary>Enforce synthetic content provenance, labeling, and watermarking for AI-generated outputs</summary>

AI\_APP\_SEC\_064

**Violation Summary**

AI systems that generate text, images, audio, video, or code without provenance metadata, content labels, or embedded watermarks enable disinformation, impersonation, and undetectable forgery. AI risk management transparency and accountability expectations and content authenticity guidance require disclosed origin, machine-readable provenance, and tamper-evident markers on all synthetic outputs.&#x20;

**Affected Assets**&#x20;

* AI Model&#x20;
* AI Agent&#x20;
* LLM &#x20;

**Severity**

Critical

**Technical Details**&#x20;

Missing or weak synthetic content provenance, labeling, and watermarking for ai-generated outputs controls introduces risks including:&#x20;

* Returning AI-generated content without machine-readable provenance metadata, such as model identifiers, timestamps, or content origin tags.&#x20;
* Serving AI-generated content without a clear label indicating its synthetic origin.&#x20;
* Producing AI-generated images, audio, or video without embedding a steganographic or cryptographic watermark.&#x20;
* Utilizing unsigned provenance metadata, which allows the origin information to be silently removed or tampered with.&#x20;
* Accepting externally supplied content claimed as human-authored without verifying the absence of AI provenance markers upon ingestion.&#x20;
* Failing open when labeling or watermarking errors occur, resulting in the delivery of unlabeled synthetic content instead of safely blocking the request.&#x20;

**Attack Vector**: Network / Application &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: Low&#x20;

**User Interaction**: None &#x20;

**Confidentiality Impact**: None&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: None&#x20;

**Framework**

* **Jan 26, 2023 — NIST AI RMF**: GOVERN 4.1 (Accountability), MEASURE 2.11 (Transparency & Traceability) and NIST AI 100-4 (Digital Content Transparency)&#x20;
* **Aug 1, 2024 — EU AI Act**: Obligations for transparency and content authenticity (Art. 52), and requirements for high-risk AI systems regarding technical security, auditability, and human oversight (Art. 15, 14, 9)&#x20;
* **Nov 18, 2024 — OWASP-LLM**: LLM09 (Misinformation), LLM02 (Insecure Output Handling), LLM05 (Supply Chain Vulnerabilities) &#x20;

**References**

<https://genai.owasp.org/llm-top-10/>

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

<https://artificialintelligenceact.eu/>&#x20;

</details>

<details>

<summary>Do not allow prompts with high-risk system commands</summary>

AI\_APP\_SEC\_066

**Violation Summary**\
If prompts are not inspected for high-risk system or shell commands, attackers can embed instructions that lead to execution of destructive or unauthorized operations. In agentic and tool-enabled environments, LLM outputs may be interpreted as executable instructions, allowing prompt content to directly trigger command execution. Without detection and filtering of high-risk commands, prompt injection can escalate into command execution, system compromise, data destruction, or unauthorized access.&#x20;

**Affected Assets**&#x20;

AI Agent&#x20;

**Severity**

Critical

**Technical Details**&#x20;

* Failing to check prompts for high-risk system and shell commands introduces several risks including:&#x20;
* Execution of dangerous commands such as file deletion, privilege escalation, or system modification&#x20;
* Command injection through prompts interpreted by shell, scripting engines, or automation tools&#x20;
* Unauthorized access to system resources, environment variables, or sensitive files&#x20;
* Data exfiltration via command-line utilities or scripted API calls&#x20;
* Lateral movement across systems using injected commands&#x20;
* Corruption or deletion of critical data and configurations&#x20;
* Abuse of automation pipelines and agent toolchains&#x20;
* Loss of integrity, reliability, and control over AI-driven operations&#x20;

**Attack Vector**: Prompt → shell / system interpreter / agent tool execution&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (especially in automated or agent-driven workflows)&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: High&#x20;

**Framework**&#x20;

* **OWASP LLM**: LLM01, LLM04, LLM05, LLM08&#x20;
* **OWASP ASI**: ASI-01, ASI-04, ASI-05, ASI-09, ASI-12 &#x20;
* **EU AI Act**: Article 11, 12, 13, 50&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Detect direct string interpolation of untrusted input into LLM prompts</summary>

AI\_APP\_SEC\_067

**Violation Summary**

Direct embedding of unvalidated, unsensitized, or unescaped user-controlled data into LLM prompt strings via f-strings, str.format(), string concatenation, or template substitution allows adversaries to inject instructions that override system context, hijack agent behavior, or exfiltrate sensitive information. Code must enforce structural separation between prompt templates and untrusted data so that user input is never interpreted as instructions by the model.&#x20;

**Affected Assets**

* AI Agent&#x20;
* MCP Server&#x20;
* AI Model&#x20;

**Severity**

Critical

**Technical Details**\
Direct string interpolation introduces several risks including:&#x20;

* Undetectable prompt injection, unpredictable, unsafe or incorrect output&#x20;
* Bypass safety and governance controls&#x20;
* Unsafe or inconsistent agent behavior, data exfiltration and, system context overrides&#x20;

**Attack Vector**: Prompt&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentially Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 - OWASP-LLM**: LLM01&#x20;
* **Jan 26, 2023 - NIST AI RMF**: MAP 1.5, MEASURE 2.4 Oct 31, 2023
* **MITRE ATLAS**: AML.T0051&#x20;

**References**

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://atlas.mitre.org/techniques/AML.T0051/>&#x20;

</details>

<details>

<summary>Detect LLM output used directly in security-sensitive decisions without Human-in-the-Loop (HITL) validation </summary>

AI\_APP\_SEC\_068

**Violation Summary**&#x20;

Detects and prevents the direct use of unvalidated LLM outputs in security-sensitive decisions without Human-in-the-Loop (HITL) validation. LLM output is probabilistic and may hallucinate or be adversarially manipulated, so it must never be the sole authority for a security decision.&#x20;

**Affected Assets**

* AI Agent &#x20;
* LLM Application &#x20;
* AI Model &#x20;

**Severity**

Critical

**Technical Details** &#x20;

Missing or weak Human-in-the-Loop (HITL) validation or rule-based check controls introduces risks including: &#x20;

* Relying on raw LLM output as the sole condition in an access, authentication, or permission check without a HITL gate or rule-based check.&#x20;
* Feeding LLM output directly into a function enforcing security policy or classification without a HITL gate or rule-based check.&#x20;
* Granting elevated privileges or expanded scope based on LLM output without a HITL gate or rule-based check.&#x20;
* Failing to implement a deterministic validation step (like an allowlist or schema enforcement) between the LLM response and a security-sensitive operation.&#x20;

**Attack Vector:** Network / Application &#x20;

**Attack Complexity:** Low &#x20;

**Privileges Required:** None to Low &#x20;

**User Interaction:** None to Limited &#x20;

**Confidentiality Impact:** High &#x20;

**Integrity Impact:** High &#x20;

**Availability Impact:** High &#x20;

**Framework**

* **Oct 2023 — OWASP-LLM (v1.1)**: LLM09 (Overreliance). &#x20;
* **Jan 26, 2023 — NIST AI RMF 1.0**: MANAGE, GOVERN &#x20;
* **Oct 2020 — MITRE ATLAS** &#x20;
* **Aug 2024 — EU AI Act**&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://artificialintelligenceact.eu/article/14/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework> &#x20;

</details>

<details>

<summary>AI Agent must implement Human-in-the-Loop (HITL) approval flow for risky operations like delete, purge, destroy</summary>

AI\_APP\_SEC\_069

**Violation Summary**\
If high-risk operations (e.g., delete, purge, destroy, move, export) are executed without Human-in-the-Loop (HITL) oversight, AI agents and automated systems can perform destructive or unauthorized actions without validation. In agentic and tool-enabled environments, LLM outputs can directly trigger these operations, and prompt injection, hallucination, or misinterpretation can lead to irreversible damage. Absence of HITL removes a critical control layer needed to prevent catastrophic system, data, or business impact.&#x20;

**Affected Assets**

AI Agent&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Not implementing HITL for high-risk operations introduces several risks including:&#x20;

* Accidental or malicious deletion, purge, or destruction of critical data or resources&#x20;
* Execution of irreversible actions triggered by prompt injection or adversarial inputs&#x20;
* Hallucination-driven decisions resulting in destructive operations&#x20;
* Unauthorized changes due to compromised agents or misconfigured permissions&#x20;
* Inability to detect or stop harmful actions before execution&#x20;
* Lack of accountability and validation for critical system changes&#x20;
* Increased blast radius in automated workflows without human checkpoint&#x20;
* Violations of governance, risk management, and safety requirements&#x20;

**Attack Vector**: LLM output → agent tool execution / automation pipeline

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None (in absence of HITL controls)&#x20;

**Confidentiality Impact**: Medium&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: Critical&#x20;

**Framework**

* **OWASP LLM**: LLM01, LLM06, LLM08&#x20;
* **OWASP ASI**: ASI-02, ASI-10&#x20;
* **EU AI Act**: Article 13, Article 50, Annex III requirements for human oversight, safety, and risk management in high-risk systems&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Detect and block all forms of prompt injection attacks in user inputs and file contents</summary>

AI\_APP\_SEC\_070

**Violation Summary**

Failure to detect and block prompt injection attacks in user inputs, uploaded files, retrieved content, emails, web pages, or other external context sources can allow attackers to manipulate LLM behavior, override system instructions, bypass safety controls, exfiltrate sensitive data, trigger unauthorized tool execution, and compromise autonomous agent decision-making. Prompt injection is recognized by industry and government guidance as one of the most critical security risks affecting generative AI and agentic systems.&#x20;

**Affected Assets**&#x20;

* AI Model&#x20;
* AI Agent&#x20;

**Severity**

Critical

**Technical Details**

Prompt injection attacks exploit the inability of LLMs to reliably distinguish trusted instructions from untrusted external content. Malicious instructions may be embedded directly in prompts or indirectly within files, documents, web pages, emails, images, repositories, or retrieved content.&#x20;

Failure to detect and block such attacks can result in:&#x20;

* Override or suppression of system instructions&#x20;
* Leakage of sensitive or confidential data&#x20;
* Unauthorized tool invocation or API execution&#x20;
* Execution of destructive or unsafe operations&#x20;
* Manipulation of autonomous agent reasoning&#x20;
* Cross-context or cross-agent contamination&#x20;
* Safety guardrail bypass and policy evasion&#x20;
* Credential theft or token exposure&#x20;
* Privilege escalation through indirect instructions&#x20;
* Compromise of downstream systems and workflows&#x20;
* Corruption of outputs, decisions, or generated code&#x20;
* Persistent propagation across multi-agent environments&#x20;

**Attack Vector:** Prompt / Context Injection&#x20;

**Attack Complexity:** Low to Medium&#x20;

**Privileges Required:** None&#x20;

**User Interaction:** Required&#x20;

**Confidentiality Impact:** High&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** Medium&#x20;

**Framework**&#x20;

* **Nov 2024 - OWASP-LLM**: LLM01, LLM06&#x20;
* **Mar 2025 - OWASP-ASI**: ASI-01, ASI-02, ASI-03, ASI-07&#x20;
* **Aug 2024 - EU AI Act**: Article 15&#x20;
* **Jan 2023 - NIST AI RMF**: 1.0 - GV-3, MP-1, MP-3, MG-1, MG-2&#x20;

**References**

<https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com>

<https://artificialintelligenceact.eu/ai-act-explorer/?utm_source=chatgpt.com>

<https://www.nist.gov/itl/ai-risk-management-framework?utm_source=chatgpt.com>

</details>

<details>

<summary>Enforce chemical, biological, radiological, or nuclear (CBRN) threat prevention safeguards in AI-enabled systems</summary>

AI\_APP\_SEC\_071

**Violation Summary**

AI systems that generate, design, or analyze chemical, biological, radiological, or nuclear (CBRN) content without screening and biological safeguards can enable misuse for weapons development, regulated-pathogen work, or other high-consequence harm. &#x20;

**Affected Assets**

* LLM &#x20;
* AI Agent &#x20;
* LLM Application &#x20;

**Severity**

High

**Technical Details** &#x20;

Missing or weak CBRN controls in deployment code introduces risks including: &#x20;

* Unscreened nucleic acid synthesis, lab routing, or bioinformatics uploads to external providers &#x20;
* CBRN-capable model, agent, or scientific-tool paths without classifiers, policy gates, or tool allowlists &#x20;
* Exposure of pathogen, clinical, or dual-use biological data without authentication, least privilege, labels, or encryption &#x20;
* Fail-open behavior when screening fails, times out, is ambiguous, or is bypassed &#x20;
* Soft handling (log or warn only) of high-confidence screening hits while the pipeline continues &#x20;
* Client-controlled flags or parameters that disable hazardous-sequence screening or biosafety filters &#x20;

**Attack Vector:** Network &#x20;

**Attack Complexity:** Medium &#x20;

**Privileges Required:** None &#x20;

**User Interaction:** None &#x20;

**Confidentially Impact:** High &#x20;

**Integrity Impact:** High &#x20;

**Availability Impact:** Low &#x20;

**Framework**

* **Oct 30, 2023** - Executive Order 14110 (Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence) &#x20;
* **Jan 26, 2023** - NIST AI Risk Management Framework (AI RMF 1.0) &#x20;

**References**

<https://www.federalregister.gov/documents/2023/11/01/2023-24283/safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework/ai-rmf-1-0> &#x20;

</details>

### Data Security and Privacy

Protects PII, prevents data leakage, and enforces privacy controls across AI models and agents.

<details>

<summary>Do not store secrets in code</summary>

AI\_DAT\_SEC\_001

**Violation Summary**\
Storing secrets, such as API keys, access tokens, service credentials, MCP tokens, encryption keys, or database passwords, directly in code, configuration files, or agent prompt templates introduces an immediate and critical security vulnerability. Hard-coded secrets are easily exposed through source control, logs, error messages, LLM interactions, and dependency analysis. Once leaked, these credentials can be used to impersonate services, manipulate AI agent behavior, exfiltrate data, or compromise entire environments.

**Affected Assets**

* LLM
* AI Agent
* MCP Server

**Severity**

Critical

**Technical Details**\
Storing secrets in code introduces several risks including:&#x20;

* Unauthorized access to internal systems, APIs, and third-party services&#x20;
* Full environment compromise if privileged keys (e.g., root tokens) are exposed&#x20;
* Impersonation of agents, MCP clients, or downstream services&#x20;
* Lateral movement enabled through leaked credentials&#x20;
* Leakage via LLM outputs, agent error messages, or repository scans&#x20;
* Irreversible compromise of production systems due to difficult secret rotation&#x20;
* Violations of governance, transparency, and credential-handling requirements&#x20;

**Attack Vector:** Source code / repository / agent configuration\
**Attack Complexity:** Low\
**Privileges Required:** None\
**User Interaction:** None\
**Confidentiality Impact:** Critical\
**Integrity Impact:** Critical\
**Availability Impact:** Medium&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM**: LLM01, LLM03, LLM05, LLM08
* **March 2025 – OWASP-ASI**: ASI-01, ASI-04, ASI-09, ASI-12
* **Aug 1, 2024 – EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>If PII data must be shared, it must be encrypted</summary>

AI\_DAT\_SEC\_009

**Violation Summary**\
Transmitting personally identifiable information (PII) without encryption exposes sensitive user data to interception, tampering, unauthorized access, and regulatory non-compliance. Unencrypted PII flowing between AI agents, MCP clients and servers, microservices, or external APIs can be harvested by attackers or internal adversaries through network sniffing, logging systems, or compromised intermediaries. Such exposure creates severe privacy, legal, operational, and reputational risks.&#x20;

**Affected Assets**

* LLM
* AI Agent
* MCP Server

**Severity**

Critical

**Technical Details**\
Transmitting unencrypted PII introduces several risks including:&#x20;

* Exposure of sensitive user information through network interception&#x20;
* Unauthorized access to identity data, enabling fraud or impersonation&#x20;
* Regulatory violations (GDPR, EU AI Act, state privacy laws)&#x20;
* Inability to ensure integrity or authenticity of transmitted data&#x20;
* Leakage through LLM logs, telemetry, or debugging outputs&#x20;
* Lateral movement or privilege escalation through harvested identity data&#x20;
* Failure to meet encryption, security, and risk-management obligations&#x20;

**Attack Vector:** Network transit / API calls / agent communication\
**Attack Complexity:** Low\
**Privileges Required:** None\
**User Interaction:** None\
**Confidentiality Impact:** Critical\
**Integrity Impact:** High\
**Availability Impact:** Low&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM:** LLM03 (Data Leakage), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)
* **March 2025 – OWASP-ASI:** ASI-01 (Input/Output Integrity), ASI-05 (Data Security & Handling), ASI-09 (Audit & Traceability), ASI-12 (Operational Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 1.11 (Technical Documentation), 2.12 (Record-Keeping), 3.13 (Transparency), 4.50 (General Transparency Obligations), plus GDPR-aligned data protection expectations reflected across the Act&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Do not log PII</summary>

AI\_DAT\_SEC\_010

**Violation Summary**\
Logging personally identifiable information (PII) exposes sensitive user data to unauthorized access, replication, and long-term retention in unsecured or low-visibility systems. Log files are frequently accessible to broader engineering, operations, analytics, or third-party tools and often persist indefinitely. Once PII enters logs, it becomes extremely difficult to control, delete, audit, or protect—creating severe privacy, compliance, and security risks across all AI and MCP-enabled environments.&#x20;

**Affected Assets**

* LLM
* AI Agent
* MCP Server

**Severity**

Critical

**Technical Details**

Logging PII introduces several risks including:&#x20;

* Unauthorized internal access or external compromise of sensitive information&#x20;
* Accidental disclosure through debugging tools, telemetry pipelines, or log aggregators&#x20;
* Persistent exposure that violates data minimization and retention requirements&#x20;
* Inability to satisfy deletion, correction, or subject rights requests&#x20;
* Propagation of PII through downstream systems (LLM training data, observability tools, backups)&#x20;
* Legal and regulatory violations under GDPR, state privacy laws, and the EU AI Act&#x20;
* Loss of trust and reputational damage due to preventable data leakage&#x20;

**Attack Vector:** Logging systems / telemetry pipelines / observability tooling

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** Critical

**Integrity Impact:** Medium

**Availability Impact:** Low&#x20;

**Framework**&#x20;

* **Nov 18, 2024 - OWASP-LLM:** LLM03 (Data Leakage), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)
* **March, 2025 - OWASP-ASI:** ASI-01 (Integrity), ASI-05 (Data Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug, 1, 2024 - EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), GDPR-aligned data-minimization principles&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Do not send PII to AI Models</summary>

AI\_DAT\_SEC\_011

**Violation Summary**\
Sending personally identifiable information (PII) to an AI Model exposes sensitive data to uncontrolled processing, persistence, training retention, unauthorized internal access, and unintended disclosure. AI Models are not guaranteed to handle PII according to data-minimization or privacy-by-design principles, and model outputs may inadvertently reveal, transform, or propagate sensitive information. This creates severe privacy, regulatory, and security risks across all AI-driven workflows.&#x20;

**Affected Assets**

AI Agent

**Severity**

Critical

**Technical Details**\
Sending PII to an AI Model introduces several risks including:

* Leakage of sensitive user information through outputs or indirect inference&#x20;
* Inclusion of PII in model logs, telemetry, or monitoring systems&#x20;
* Potential model retention or memorization of PII, enabling future extraction&#x20;
* Non-compliance with privacy regulations due to uncontrolled third-party processing&#x20;
* Exposure through prompt injection attacks that pull stored or inferred PII&#x20;
* Inability to enforce deletion, consent, or data subject rights&#x20;
* Violations of transparency, purpose limitation, and privacy-by-design obligations&#x20;

**Attack Vector:** AI Model input channel

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** Critical

**Integrity Impact:** Medium

**Availability Impact:** Low&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM03 (Data Leakage), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)
* **Mar 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-05 (Data Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus GDPR-aligned data-minimization and purpose-limitation requirements&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Mask PII on user interfaces</summary>

AI\_DAT\_SEC\_012

**Violation Summary**\
Displaying unmasked personally identifiable information (PII) on user interfaces exposes sensitive data to unauthorized viewing, shoulder surfing, screen sharing leaks, and over-privileged internal access. Any UI that renders full PII—names, addresses, SSNs, phone numbers, emails, financial data, or identifiers—creates a high risk of accidental disclosure and non-compliance. Unmasked PII can also be captured in screenshots, monitoring tools, session replay systems, or logs, further amplifying exposure.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**\
Not masking PII on UI introduces several risks including:&#x20;

* Unauthorized access or accidental exposure of sensitive identity information&#x20;
* Violation of least-privilege and data-minimization principles&#x20;
* &#x20;Increased likelihood of data leakage via screenshots, video recordings, demos, or shared sessions&#x20;
* Compromise through malicious insiders or overexposed customer support tools&#x20;
* Replication of PII into frontend logs, browser telemetry, or third-party analytics&#x20;
* Regulatory violations related to privacy, transparency, and secure data handling&#x20;
* Loss of user trust and potential legal liability&#x20;

**Attack Vector:** User interface display layer

**Attack Complexity:** Low

**Privileges Required:** None (visual exposure requires only observation)

**User Interaction:** None

**Confidentiality Impact:** Critical

**Integrity Impact:** Low

**Availability Impact:** Low&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM03 (Data Leakage), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)
* **March 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-05 (Data Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), GDPR-aligned principles of data minimization and privacy-by-design&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Redact PII from uploaded files</summary>

AI\_DAT\_SEC\_023

**Violation Summary**\
If uploaded files (documents, PDFs, spreadsheets, images with OCR, logs, archives) are ingested without redacting personally identifiable information (PII), sensitive data can be unintentionally exposed, propagated, or retained across AI systems. Unredacted PII may be processed by LLMs, logged, cached, embedded in prompts, or transmitted to external services—creating severe privacy, regulatory, and security risks that are difficult to detect and remediate after ingestion.

**Affected Assets**

AI Agent

**Severity**

Critical

**Technical Details**

Failing to redact PII from uploaded files introduces several risks including:

* Exposure of sensitive personal data through LLM processing, outputs, or logs
* Propagation of PII into prompts, embeddings, vector stores, and downstream systems
* Accidental disclosure via summaries, citations, or extracted insights
* Inability to honor data minimization, retention limits, or subject rights requests
* Increased blast radius when files are shared across agents or external tools
* Elevated risk of data leakage through prompt injection or model inference
* Violations of privacy-by-design, transparency, and record-keeping requirements

**Attack Vector**: File upload → parsing / OCR / document ingestion pipeline

**Attack Complexity**: Low

**Privileges Required**: None

**User Interaction**: None, especially in automated ingestion or agent workflows.

**Confidentiality Impact**: Critical

**Integrity Impact**: Medium

**Availability Impact**: Low

**Framework**

* **OWASP-LLM**: LLM03, LLM05, LLM08
* **OWASP-ASI**: ASI-01, ASI-05, ASI-09, ASI-12
* **EU AI Act**: Articles 11, 12, 13, plus GDPR-aligned data-minimization and privacy-by-design principles embedded across the Act

**References**

<https://genai.owasp.org/llm-top-10/>\
<https://genai.owasp.org/initiatives/agentic-security-initiative/>\
<https://artificialintelligenceact.eu/ai-act-explorer/>

</details>

<details>

<summary>Uploaded files must not contain PII (Singapore)</summary>

AI\_DAT\_SEC\_024

**Violation Summary**\
This applies to the information that is considered PII in Singapore. If files uploaded to an AI agent are ingested without redacting PII, sensitive data can be unintentionally exposed, propagated, retained, or disclosed through agent reasoning, LLM prompts, logs, embeddings, or downstream tool calls. Because AI agents often summarize, transform, store, and share file contents across systems, unredacted PII significantly amplifies privacy, compliance, and security risks and makes post-incident remediation extremely difficult.&#x20;

**Affected Assets**&#x20;

AI Agent&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Failing to redact PII from files uploaded to AI agents introduces several risks including:&#x20;

* Exposure of sensitive personal data through agent outputs, summaries, or citations&#x20;
* Propagation of PII into prompts, vector databases, caches, logs, and telemetry&#x20;
* Uncontrolled sharing of PII with external LLM providers or third-party tools&#x20;
* Inability to comply with data minimization, retention limits, or deletion requests&#x20;
* Increased risk of data leakage via prompt injection or inference attacks&#x20;
* Broader blast radius when agents reuse or redistribute file content&#x20;
* Violations of privacy-by-design, transparency, and record-keeping requirements&#x20;

**Attack Vector**: File upload → agent ingestion / parsing \
**Attack Complexity**: Low \
**Privileges Required**: None \
**User Interaction**: None (especially in automated or agent-driven workflows) \
**Confidentiality Impact**: Critical \
**Integrity Impact**: Medium \
**Availability Impact**: Low&#x20;

**Framework**&#x20;

* **OWASP LLM**: LLM03, LLM05, LLM08 &#x20;
* **OWASP ASI**: ASI-01, ASI-05, ASI-09, ASI-12 &#x20;
* **Singapore PDPA**: Section 11, 13, 18, 24&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>No file should contain any PII</summary>

AI\_DAT\_SEC\_025

**Violation Summary**\
If files contain unredacted personally identifiable information (PII), there is risk of sensitive data access to unrestricted access, misuse, and downstream propagation. This increases the likelihood of data leakage, unauthorized scraping, AI training contamination, and regulatory violations. Once exposed, PII can be copied, indexed, cached, or redistributed beyond the organization’s control.&#x20;

**Affected Assets**

AI Agent

**Severity**

Critical

**Technical Details**&#x20;

Failing to redact PII from publicly accessible files introduces several risks including:&#x20;

* Unrestricted access to sensitive personal data by internal users, contractors, or the public&#x20;
* Mass data harvesting, scraping, or indexing by automated tools and AI systems&#x20;
* Propagation of PII into LLM prompts, embeddings, search indexes, and external datasets&#x20;
* Inability to enforce consent, purpose limitation, or access controls&#x20;
* Permanent exposure due to caching, backups, screenshots, or mirrors&#x20;
* Increased insider threat and accidental disclosure risk&#x20;
* Severe regulatory, legal, and reputational impact&#x20;

**Attack Vector**: Public file access / shared repositories / collaboration platforms&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: Medium&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **OWASP LLM**: LLM03, LLM05, LLM08 &#x20;
* **OWASP ASI**: ASI-01, ASI-05, ASI-09, ASI-12 &#x20;
* **EU AI Act**: Article 11, Article 12, Article 13, Article 50, GDPR-aligned data minimization and privacy-by-design principles&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Enforce output data minimization for model, tool, and API responses</summary>

AI\_DAT\_SEC\_027

**Violation Summary**

Adversaries and misconfigured pipelines can cause the system to return excessive internal, sensitive, or user-identifying data in completions, tool results, logs, or client payloads. Without minimization, PII, secrets, full records, and operational metadata leak through normal response paths.&#x20;

**Affected Assets**

* AI Model&#x20;
* AI Agent&#x20;
* LLM&#x20;
* MCP Server&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Missing or weak output data minimization for model, tool, and API responses controls introduces risks including:&#x20;

* Forwarding entire database rows, user profiles, complete document bodies, or unbounded retrieval sets into model prompts or client responses when only a subset is required.&#x20;
* Exposing raw internal metadata, such as primary keys, session identifiers, stack traces, file paths, or hostnames, to end users without redaction.&#x20;
* Placing secrets, API keys, tokens, passwords, private keys, or connection strings into completion text, tool results, error messages, or logs exposed to clients.&#x20;
* Returning unredacted tool, MCP, or plugin results to the user or upstream LLM without field allowlists, leading to arbitrary keys appearing in final outputs.&#x20;
* Appending full conversation histories, memory dumps, or prior-session transcripts to new responses instead of using a minimal summary or targeted excerpt.&#x20;
* Failing open on minimization or redaction errors, resulting in the return of original unfiltered payloads to the client rather than blocking or returning a safe error message.&#x20;

**Attack Vector**: Network / Application &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None to Low&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: None to Low&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 — OWASP-LLM**: LLM06 (Sensitive Information Disclosure), LLM02 (Insecure Output Handling)&#x20;
* **March 2025 — OWASP-ASI**: ASI-05 (Data Leakage & Privacy Violations)&#x20;
* **Aug 1, 2024 — EU AI Act**: Obligations for high-risk AI systems around data governance, privacy, and security (e.g., Art. 10, Art. 15)&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

</details>

<details>

<summary>Enforce decision logging, audit trail, and forensic readiness for AI-driven actions</summary>

AI\_DAT\_SEC\_029

**Violation Summary**

AI systems that make or influence decisions affecting people, data, or operations without immutable audit trails and forensic-ready logs create accountability gaps and obstruct incident response. AI risk management accountability, transparency, and post-deployment monitoring expectations require traceable, tamper-evident records of every consequential AI-driven action.&#x20;

**Affected Assets**

* AI Model&#x20;
* AI Agent&#x20;
* LLM&#x20;
* MCP Server&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Missing or weak decision logging, audit trail, and forensic readiness for ai-driven actions controls introduces risks including:&#x20;

* Failing to log model identifiers, input hashes, outputs, and acting principals creates significant accountability gaps, making it impossible to trace consequential decisions back to specific origins.&#x20;
* Permitting mutable logs that can be updated, truncated, or deleted compromises the integrity of the audit trail, undermining trust and forensic investigations.&#x20;
* The lack of shared correlation identifiers across multi-step processes prevents administrators from reconstructing the full causal chain of events.&#x20;
* Operating without configured retention or log rotation policies can lead to compliance violations and unmanaged storage.&#x20;
* Catching logging exceptions silently without alerting or failing closed can result in permanent loss of audit data when the logging sink becomes unreachable.&#x20;
* Omitting essential forensic context, such as data lineage and retrieval metadata, makes it impossible to reproduce, explain, or thoroughly investigate AI-driven actions post-deployment.&#x20;

**Attack Vector**: Network / Application &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None to Low &#x20;

**User Interaction**: None &#x20;

**Confidentiality Impact**: Low&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: None to Low&#x20;

**Framework**

* **Jan 26, 2023 — NIST AI RMF**: GOVERN 5.2 (Data Privacy & Security), MEASURE 3.1 (Privacy-Enhanced Evaluation)&#x20;
* **Nov 18, 2024 — OWASP-LLM**: LLM01 (Prompt Injection), LLM06 (Sensitive Information Disclosure), LLM08 (Excessive Agency)&#x20;
* **March 2025 — OWASP-ASI**: ASI-03 (Identity & Privilege Abuse); related themes ASI-01, ASI-04, ASI-07, ASI-09&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

</details>

<details>

<summary>Enforce minimum six-month log retention for high-risk AI systems</summary>

AI\_DAT\_SEC\_030

**Violation Summary**

Operators of high-risk AI systems must preserve automatically generated logs for at least six months in tamper-evident storage and must not delete or rotate them early. Without that floor, regulators and internal governance lose the evidence needed to reconstruct incidents, prove compliance, and defend decisions made by models and agents over their full evaluation window.&#x20;

**Affected Assets**

* AI Model&#x20;
* AI Agent&#x20;
* LLM Application&#x20;
* MCP Server&#x20;

**Severity**

Critical

**Technical Details**

Missing or weak six-month minimum log retention for high-risk AI systems controls introduces risks including:&#x20;

* Writing or routing AI logs to storage without an explicit retention period or TTL makes it impossible to prove that records will survive the minimum compliance window.&#x20;
* Retention settings, expirations, rotation schedules, or cleanup jobs shorter than 180 days silently discard inference, decision, and error history before the regulatory floor is met.&#x20;
* Unconditional deletion, purge, truncate, or overwrite paths can erase high-risk AI evidence even when records have not aged past six months.&#x20;
* Size- or count-capped rotation and time-based rotation without durable archival can drop rotated segments permanently instead of retaining them for the full 180 days.&#x20;
* High-risk code paths that perform inference, scoring, classification, or decisions without automatic lifetime event logging leave gaps where critical activity was never captured.&#x20;
* Lack of mechanisms for deployers to collect, store, and interpret auto-generated logs blocks independent oversight and slows breach or misuse investigations.&#x20;
* Storing retention-sensitive records in non-tamper-evident systems increases the chance that historical logs are altered or repudiated after the fact.&#x20;
* Drift between environments—where one stack honors 180 days and another does not—produces inconsistent audit posture across the same high-risk AI workload.&#x20;

**Attack Vector:** Network / Application&#x20;

**Attack Complexity:** Low&#x20;

**Privileges Required:** Low&#x20;

**User Interaction:** None&#x20;

**Confidentiality Impact:** Low&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** None to Low&#x20;

**Framework**&#x20;

* **July 12, 2024 — EU AI Act**: Regulation (EU) 2024/1689 published in the Official Journal of the European Union&#x20;
* **Jan 26, 2023 — NIST AI RMF**: publication of the Artificial Intelligence Risk Management Framework (AI RMF 1.0)&#x20;

**References**

<https://artificialintelligenceact.eu/article/26/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

</details>

### Identity and Access Control

Enforces authentication and least-privilege controls between AI agents, MCP servers, and endpoints.

<details>

<summary>MCP client must authenticate the MCP server</summary>

AI\_IAC\_002

**Violation Summary**\
If an MCP client does not authenticate the MCP server, it cannot verify the identity, legitimacy, or trustworthiness of the system providing tool responses, commands, or data. This creates an opportunity for attackers to impersonate the MCP server, intercept or modify traffic, inject malicious tool responses, or deliver falsified data that influences downstream agent reasoning. Without authentication, the MCP trust boundary collapses, enabling man-in-the-middle attacks, data manipulation, workflow corruption, and full compromise of AI-driven operations.&#x20;

**Affected Assets**

* MCP Server
* MCP Client
* AI Agent

**Severity**

Critical

**Technical Details**\
Not authenticating the MCP server introduces several risks including:&#x20;

* Server impersonation leading to injection of malicious or misleading responses&#x20;
* Man-in-the-middle interception and modification of MCP traffic&#x20;
* Unauthorized access to sensitive MCP capabilities, tools, and agent operations&#x20;
* Corruption of workflows through falsified output or manipulated data&#x20;
* Leakage of PII or sensitive context exchanged with the illegitimate server&#x20;
* Loss of integrity, trust, and accountability in MCP-driven decisions&#x20;
* Violations of transparency, security, and traceability requirements&#x20;

**Attack Vector:** Network/API communication between MCP client and server

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** Critical

**Availability Impact:** Medium&#x20;

**Framework**

* **Nov 18, 2024 – OWASP-LLM:** LLM01 (Prompt/Instruction Injection), LLM04 (Model Behavior Manipulation), LLM05 (Sensitive Information Disclosure), LLM08 (Insufficient Transparency & Auditability)
* **March 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-04 (Governance), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus robustness and security obligations for high-risk systems&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>MCP server must authenticate all clients</summary>

AI\_IAC\_006

**Violation Summary**\
If an MCP server does not authenticate the client making requests, any unauthorized entity—including compromised agents, external attackers, or untrusted processes—can impersonate a legitimate client. This allows attackers to invoke privileged tools, access sensitive data, manipulate workflows, or trigger system actions without detection. Lack of client authentication effectively removes all trust boundaries, enabling full compromise of server-side capabilities and AI-driven operations.&#x20;

**Affected Assets**

* LLM
* AI Agent

**Severity**

Critical

**Technical Details**\
Not authenticating the MCP client introduces several risks including:&#x20;

* Unauthorized invocation of high-privilege tools or system actions&#x20;
* Full impersonation of trusted agents, enabling malicious or deceptive requests&#x20;
* Data exposure through unrestricted access to server responses, APIs, or internal systems&#x20;
* Manipulation of downstream workflows via falsified or maliciously crafted requests&#x20;
* Escalation of privilege or lateral movement across connected systems&#x20;
* Loss of accountability, traceability, and auditability for all client-driven actions&#x20;
* Violations of integrity, transparency, and security obligations for regulated AI systems&#x20;

**Attack Vector:** Client → MCP server request path

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** Critical

**Availability Impact:** Medium&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM01 (Instruction Injection), LLM04 (Behavior Manipulation), LLM05 (Sensitive Data Exposure), LLM08 (Transparency & Auditability Failures)
* **Mar 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-04 (Governance), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus Annex III robustness & security requirements&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Inter-agent communication must be authenticated</summary>

AI\_IAC\_007

**Violation Summary**\
If agents communicate without authentication, any unauthorized party—including rogue agents, compromised services, or external attackers—can impersonate a legitimate agent and issue commands, request data, or alter system behavior. Non-authenticated inter-agent communication destroys trust boundaries between autonomous components and enables impersonation, privilege escalation, data leakage, workflow manipulation, and full compromise of multi-agent systems. Without identity guarantees, agent-to-agent messaging becomes a high-risk attack surface.

**Affected Assets**

AI Agent

**Severity**

Critical

**Technical Details**\
Missing authentication between agents introduces several risks including:&#x20;

* Unauthorized agents impersonating trusted components to issue commands&#x20;
* Manipulation of agent workflows or decision chains through falsified messages&#x20;
* Leakage of sensitive data exchanged during inter-agent coordination&#x20;
* Injection of malicious instructions into distributed reasoning processes&#x20;
* Loss of accountability and inability to attribute harmful actions&#x20;
* Increased lateral movement risk across agent networks&#x20;
* Violations of integrity, trust, and regulatory controls for autonomous systems&#x20;

**Attack Vector:** Inter-agent message channel / network communication

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** High

**Integrity Impact:** Critical

**Availability Impact:** Medium&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM01 (Instruction Injection), LLM04 (Behavior Manipulation), LLM05 (Sensitive Data Disclosure), LLM08 (Transparency & Audit Failures)
* **March 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-04 (Governance), ASI-07 (Reliability), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), plus Annex III security & robustness expectations for high-risk systems&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>Agents must not hold excessive external system credentials</summary>

AI\_IAC\_008

**Violation Summary**\
When agents are configured with credentials to access more than three external systems, the blast radius of a compromise dramatically increases. Each additional credential expands the agent’s privilege footprint and creates new pathways for lateral movement, data exfiltration, unauthorized actions, and multi-system compromise. Over-privileged agents become single points of systemic failure if the agent is hijacked, attacked, misconfigured, or manipulated by an LLM prompt, all connected external systems are at risk simultaneously.&#x20;

**Affected Assets**

AI Agent

**Severity**

High/Critical

**Technical Details**\
Allowing an agent to hold multiple (3+) external system credentials introduces several risks including:&#x20;

* Large blast radius: compromising the agent compromises all connected systems
* Increased likelihood of credential leakage through logs, LLM outputs, prompts, or tool interactions&#x20;
* Prompt injection enabling unauthorized use of high-privilege multi-system access&#x20;
* Lateral movement across different platforms (e.g., Jira → GitHub → AWS → Snowflake)&#x20;
* Violation of least-privilege and separation-of-duties principles&#x20;
* Difficulty revoking or rotating credentials in incident response&#x20;
* Loss of governance and traceability when many systems are accessed through one agent identity&#x20;

**Attack Vector:** Agent credential store / agent-initiated external API calls

**Attack Complexity:** Low

**Privileges Required:** None

**User Interaction:** None

**Confidentiality Impact:** Critical

**Integrity Impact:** Critical

**Availability Impact:** Medium to High&#x20;

**Framework**&#x20;

* **Nov 18, 2024 – OWASP-LLM:** LLM01 (Injection), LLM04 (Behavior Manipulation), LLM05 (Sensitive Data Disclosure), LLM08 (Transparency & Audit Failures)
* **Mar 2025 – OWASP-ASI:** ASI-01 (Integrity), ASI-04 (Governance), ASI-05 (Safe Handling), ASI-09 (Traceability), ASI-12 (Monitoring)
* **Aug 1, 2024 – EU AI Act:** Articles 11 (Documentation), 12 (Record-Keeping), 13 (Transparency), 50 (Transparency Obligations), Annex III controls emphasizing least privilege, system robustness, and secure integration&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com)\
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com)\
<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

</details>

<details>

<summary>A user must authenticate before accessing the AI Agent</summary>

AI\_IAC\_014

**Violation Summary**

Allowing access to an AI Agent without user authentication enables unauthorized actors to invoke agent capabilities, consume resources, and perform malicious actions under anonymous or unverified identities. This weakens accountability, bypasses access governance, and increases the risk of abuse in production and enterprise environments. &#x20;

**Affected Assets** &#x20;

AI Agent &#x20;

**Severity**

High

**Technical Details**

Missing authentication before AI Agent access introduces several risks including: &#x20;

* Unauthorized use of AI Agent functionality by external or internal actors &#x20;
* Bypass of identity-based controls and audit accountability &#x20;
* Potential misuse for malicious prompting, automation abuse, or harmful output generation&#x20;
* Increased exposure of sensitive responses or restricted agent behaviors to unverified users compliance and governance violations related to access control requirements &#x20;

**Attack Vector**: Network/Application Access &#x20;

**Attack Complexity**: Low &#x20;

**Privileges Required**: None &#x20;

**User Interaction**: None &#x20;

**Confidentially Impact**: High &#x20;

**Integrity Impact**: High &#x20;

**Availability Impact**: Medium &#x20;

**Framework**&#x20;

* **Dec 2025 – OWASP-ASI**: ASI-03&#x20;
* **Mar 2025 – OWASP-LLM**: LLM-07&#x20;
* **Jul 2024 – EU AI Act**: Article 15 and Recital 66 (high-risk AI systems: accuracy, robustness, cybersecurity)&#x20;

**References** \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
<https://artificialintelligenceact.eu/ai-act-explorer/>&#x20;

</details>

<details>

<summary>Enforce URL allowlists for agent fetches, tools, and outbound HTTP</summary>

AI\_IAC\_015

**Violation Summary**

Unrestricted URLs let prompt injection or tool abuse drive server-side request forgery, data exfiltration, and lateral movement. LLM and agentic security guidance and operational controls for AI systems expect explicit allowlisting and scheme restrictions for any automated retrieval or callback.&#x20;

**Affected Assets**&#x20;

* AI Model&#x20;
* MCP Server
* LLM &#x20;

**Severity**

Critical

**Technical Details**

Missing or weak URL allowlists for agent fetches, tools, and outbound http controls introduces risks including:&#x20;

* Allowing arbitrary URL fetches enables attackers to use the AI agent to execute server-side request forgery (SSRF), accessing sensitive internal networks, link-local addresses, and cloud metadata endpoints.&#x20;
* Permitting unvalidated model-chosen or user-supplied URLs without strict host and path rules allows malicious actors to covertly exfiltrate data to external attacker-controlled servers.&#x20;
* Permitting dangerous or custom schemes (such as [file://](file:///), gopher://, dict://, or ftp\://) instead of enforcing a default deny-list allows attackers to pivot and move laterally within the internal network.&#x20;
* Failing to re-validate each HTTP redirect hop against the allowlist allows attackers to bypass initial URL checks and redirect the agent to restricted targets.&#x20;

**Attack Vector**: Network &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None to Low &#x20;

**User Interaction**: None &#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low to Medium&#x20;

**Framework**

* **Nov 18, 2024 — OWASP-LLM**: LLM06 (Excessive Agency), LLM07 (Insecure Plugin Design / System Interaction)&#x20;
* **March 2025 — OWASP-ASI**: ASI-02 (Tool Misuse & Exploitation); ASI-01, ASI-03, ASI-04, ASI-07, ASI-09&#x20;
* **Aug 1, 2024 — EU AI Act**: Obligations for high-risk AI systems around security, accuracy, robustness, and human oversight (e.g., Art. 15 Cybersecurity & Robustness, Art. 14 Human Oversight, Art. 9 Risk Management) &#x20;

**References**

[https://genai.owasp.org/llm-top-10/ ](<https://genai.owasp.org/llm-top-10/ >)

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

</details>

<details>

<summary>Detect and block agent privilege escalation attempts</summary>

AI\_IAC\_016

**Violation Summary**&#x20;

When agent orchestration does not detect and block privilege escalation, attackers or flawed automation can expand OAuth scopes, administrative APIs, roles, or dangerous tools beyond what the user or policy approved. That breaks least-privilege expectations for agentic systems and enables unauthorized or destructive actions with the appearance of legitimate agent behavior.&#x20;

**Affected Assets**

* AI Agent&#x20;
* LLM &#x20;
* MCP Server&#x20;

**Severity**

Critical

**Technical Details**

Missing or weak escalation controls introduces risks including:&#x20;

* Unbounded scope and capability growth driven by model output, chat, or tool results without human-in-the-loop approval or static policy comparison&#x20;
* Dynamic registration or use of admin-only, destructive, or credential-bearing tools without verifying the bound identity’s role&#x20;
* Role or permission changes triggered by model-generated content instead of governed identity workflows&#x20;
* Execution of tool plans without comparing requested tools, HTTP methods, or resource paths to a maximum-privilege profile captured at bind time&#x20;
* Fail-open behavior that logs suspected escalation but still executes the action&#x20;

**Attack Vector**: Network / Application (abuse of agent tooling and authorization flows)&#x20;

**Attack Complexity**: Low to Medium&#x20;

**Privileges Required**: Low (often none beyond normal agent or user session use)&#x20;

**User Interaction**: None to Limited&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low to Medium&#x20;

**Framework**

* **Nov 18, 2024 — OWASP-LLM**: LLM01 (Prompt Injection), LLM06 (Sensitive Information Disclosure), LLM08 (Excessive Agency)&#x20;
* **March 2025 — OWASP-ASI**: ASI-03 (Identity & Privilege Abuse); related themes ASI-01, ASI-04, ASI-07, ASI-09&#x20;
* **Aug 1, 2024 — EU AI Act**: obligations for high-risk AI systems around security, accuracy, robustness, and human oversight (e.g. Art. 15, 14, 9 context-dependent)&#x20;

**References**

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

</details>

<details>

<summary>Maintain session token integrity with signing, verification, expiry, and binding</summary>

AI\_IAC\_017

**Violation Summary**

Weak or improperly handled session tokens for agents, APIs, or MCP clients enable forgery, session fixation, confused-deputy behavior, and long-lived unauthorized access. Without integrity, authenticity, expiry, binding, and consistent validation, session identifiers become a high-value target and a weak control for delegated agent actions.&#x20;

**Affected Assets**

* AI Agent&#x20;
* LLM
* MCP Server&#x20;
* MCP Client&#x20;

**Severity**

Critical

**Technical Details**

Inadequate session token design and validation introduce risks including:&#x20;

* Accepting unsigned or unauthenticated tokens (e.g. trusting opaque random IDs with no MAC or signature verification)&#x20;
* Guessable or sequential identifiers instead of cryptographically secure randomness&#x20;
* Missing or unenforced expiry and lack of server-side invalidation&#x20;
* Timing-sensitive comparison of secrets (e.g. plain == on HMACs or bearer tokens) exposing tokens to guessing or side channels&#x20;
* Session tokens in URLs, logs, or client-visible errors (leakage and replay)&#x20;
* Continuing to honor old session tokens after role or scope elevation without re-issuing bound tokens&#x20;

**Attack Vector**: Network&#x20;

**Attack Complexity**: Low to Medium&#x20;

**Privileges Required**: None to Low&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low&#x20;

**Framework**

* **Nov 18, 2024 — OWASP-LLM**: LLM07 (Insecure Plugin/Tool Design) where sessions gate tool use; LLM06 (Sensitive Information Disclosure) for token exposure&#x20;
* **March 2025 — OWASP-ASI**: ASI-03 (Identity & Privilege Abuse)&#x20;

**References**

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

</details>

<details>

<summary>Enforce cryptographically verified user-to-agent binding for every request</summary>

AI\_IAC\_018

**Violation Summary**

Without binding, attackers or confused deputies can steer an agent with another user's identity, reuse pooled agents across principals, or override the authenticated subject using client-supplied fields or model text. AI governance expectations and agentic identity guidance require stable, verified binding between human or service identity and agent execution context.&#x20;

**Affected Assets**

* AI Agent&#x20;
* LLM
* MCP Server&#x20;

**Severity**

Critical

**Technical Details**

Missing or weak cryptographically verified user-to-agent binding for every request control introduces risks including:&#x20;

* Threat actors overriding authenticated subjects by supplying unverified JSON fields, query parameters, or unsigned headers, allowing them to impersonate other users for tool calls, MCP invocations, and downstream API requests.&#x20;
* Parsing LLM or tool output to change the bound user, role, tenant, or OAuth subject allows the model or attacker to manipulate the execution identity before privileged actions.&#x20;
* Reusing a single long-lived agent instance, connection, or MCP session across different HTTP requests or WebSocket sessions without explicitly re-initializing and re-verifying identity allows attackers to hijack pooled agents and access data across principals.&#x20;
* Invoking high-risk tools (e.g., payments, admin APIs, PII export) without explicitly verifying that the currently bound subject matches the resource owner or an authorized delegate enables unauthorized actions.&#x20;
* Continuing agent execution with a default or anonymous user when authentication or binding validation fails provides a fallback mechanism that attackers can exploit to bypass access controls entirely.&#x20;

**Attack Vector**: Network &#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None to Low&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: Low to Medium&#x20;

**Timeline**

* **Jan 26, 2023 — NIST AI RMF 1.0**: GOVERN 4.1, MAP 4.1.&#x20;
* **Nov 18, 2024 — OWASP Top 10 for LLM Applications**: LLM08 (Excessive Agency) and LLM06 (Sensitive Information Disclosure).&#x20;
* **March 2025 — OWASP Agentic Security Initiative (ASI)**: ASI-01 (Agent Identity) , ASI-03 (Identity & Privilege Abuse).&#x20;

**References**

[https://genai.owasp.org/llm-top-10/ ](<https://genai.owasp.org/llm-top-10/ >)

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

</details>

<details>

<summary>Restrict AI agents to an explicit tool allowlist</summary>

AI\_IAC\_SEC\_020

**Violation Summary**

AI agents that can invoke arbitrary local or remote tools are highly susceptible to prompt-injection-driven abuse, privilege creep, and unintended data exfiltration. Enforcing a centrally managed allow list ensures only approved capabilities are reachable and prevents fallback to broad or dynamic tool access.&#x20;

**Affected Assets**

* AI Agent&#x20;
* LLM
* MCP Server&#x20;
* MCP Client&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Missing or weak restrict ai agents to an explicit tool allow list controls introduces risks including:&#x20;

* Executing tools from a registry, plugin catalog, MCP server list, or runtime-discovered manifest without verifying membership in an approved allow list.&#x20;
* Permitting configurations that use '\*' tool names, default-all behavior, or interpret empty allow lists as granting full access.&#x20;
* Enabling or adding tools directly from LLM outputs, prompt text, or untrusted metadata without server-side policy validation.&#x20;
* Reusing a single global allow list across distinct user roles or task classes instead of applying narrower, scoped lists.&#x20;
* Proceeding with tool execution when an allow list fetch, parse, signature check, or policy service lookup fails, rather than failing closed.&#x20;
* Failing to log attempted tool IDs, actors, policy versions, and denial reasons to a protected audit sink when unauthorized tools are blocked.&#x20;

**Attack Vector**: Network / Application&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: None to Low &#x20;

**User Interaction**: None to Limited&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: High&#x20;

**Framework**

* **Oct 2023 — OWASP-LLM (v1.1)**: LLM01 (Prompt Injection), LLM06 (Sensitive Information Disclosure), LLM08 (Excessive Agency).&#x20;
* **March 2025 — OWASP-ASI**: ASI-05; ASI-01, ASI-03, ASI-04, ASI-07, ASI-09.&#x20;
* **Jan 26, 2023 — NIST AI RMF 1.0**: MANAGE, GOVERN&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework> &#x20;

</details>

<details>

<summary>Enforce resource bounds, termination limits, and traceability for subagent spawning</summary>

AI\_IAC\_022

**Violation Summary**&#x20;

Unbounded spawning of AI subagents without strict lifecycle constraints can lead to runaway autonomous processes, infinite loops, and 'agentic fork-bombs' that exhaust compute, memory, or API budgets. Agent orchestration code must enforce hard timeouts, maximum execution step counts, concurrency limits, and structured parent-child traceability at every point a subagent is created. &#x20;

**Affected Assets**&#x20;

* AI Agent&#x20;
* LLM

**Severity**

High

**Technical Details**

Missing or weak autonomous subagent spawning and execution controls introduces risks including:&#x20;

* Unsafe or inconsistent agent behavior from missing execution bounds (no timeouts or step counts), allowing subagents to run indefinitely and exhaust compute or API quotas.&#x20;
* Agentic fork-bomb conditions arising from loop-based or recursive spawning with no counter guard, causing uncontrolled replication of child processes.&#x20;
* Prompt/task injection via unvalidated instruction passthrough, raw LLM or upstream agent outputs passed directly to subagents without format verification or assertion checks.&#x20;
* Loss of forensic traceability when spawn events lack correlation or parent-agent identifiers, making incident reconstruction and chain-of-custody auditing impossible.&#x20;
* Resource exhaustion (memory, API budget, concurrency slots) due to absent concurrency limits and unbounded spawning across parallel workflows.&#x20;
* Compliance and governance failures stemming from absent structured audit logs at spawn sites, preventing detection of policy violations and accountability attributes.&#x20;

**Attack Vector**: Network&#x20;

**Attack Complexity**: Low&#x20;

**Privileges Required**: Low&#x20;

**User Interaction**: None&#x20;

**Confidentially Impact**: Low&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: High&#x20;

**Framework**

**Jan 26, 2023 - NIST AI RMF**: MANAGE 3.1&#x20;

**Jan 26, 2023 - NIST AI RMF**: MEASURE 2.4&#x20;

**Nov 18, 2024 - OWASP LLM Top 10 (v1.1)**: LLM04&#x20;

**Nov 18, 2024 - OWASP LLM Top 10 (v1.1)**: LLM01&#x20;

**Dec 2025 - OWASP Agentic Security Initiative Top 10 (2026)**: ASI08&#x20;

**References**

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://nvlpubs.nist.gov/nistpubs/ai/NIST.AI.100-1.pdf>&#x20;

</details>

<details>

<summary>Chatbot and AI interfaces must disclose AI identity to the user</summary>

AI\_IAC\_023

**Violation Summary**

AI systems that interact directly with natural persons must inform users they are communicating with an AI. When chatbot and conversational AI interfaces fail to disclose their AI nature, users are deceived into believing they are engaging with a human agent. This undermines informed consent, enables psychological manipulation, and violates transparency obligations under the EU AI Act Article 50. Suppressing or omitting AI identity disclosure exposes organizations to regulatory penalties and erodes user trust. &#x20;

**Affected Assets** &#x20;

* AI Agent &#x20;
* LLM Application &#x20;
* AI Model &#x20;

**Severity**

Critical

**Technical Details**

Failing to disclose AI identity in chatbot and conversational interfaces introduces several risks including: &#x20;

* Users deceived into believing they are communicating with a human, enabling manipulation and false trust &#x20;
* Inability of users to make informed decisions about disclosing private or sensitive details &#x20;
* Regulatory penalties and legal liability arising from non-compliance with mandatory disclosure requirements &#x20;

**Attack Vector:** Network &#x20;

**Attack Complexity:** Low &#x20;

**Privileges Required:** None &#x20;

**User Interaction:** Required &#x20;

**Confidentiality Impact:** Medium &#x20;

**Integrity Impact:** Medium &#x20;

**Availability Impact:** Low &#x20;

**Frameworks** &#x20;

**EU AI Act**:  Article 50 &#x20;

**NIST AI RMF**: GOVERN 1.1, GOVERN 1.2, MEASURE 2.8 &#x20;

**OWASP LLM**: LLM09 &#x20;

**OWASP ASI**: ASI-03, ASI-09 &#x20;

**References**

<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/> &#x20;

</details>

<details>

<summary>General purpose AI model integrations must reference a model card or technical documentation</summary>

AI\_IAC\_024

**Violation Summary**

Code that integrates a general-purpose AI (GPAI) model such as GPT, Claude, Gemini, Mistral, or Llama must reference a model card or technical documentation in the same module. When model card references are absent, developers, auditors, and incident responders cannot verify model provenance, capabilities, known limitations, or bias characteristics. This absence creates compliance gaps under EU AI Act Article 53, undermines AI system auditability, and increases the risk of deploying models with undisclosed failure modes or unsafe behaviors. &#x20;

**Affected Assets**

* AI Agent &#x20;
* LLM Application &#x20;
* AI Model &#x20;

**Severity**

Medium

**Technical Details**

Failing to reference a model card or technical documentation in GPAI integrations introduces several risks including: &#x20;

* Inability to audit AI model provenance, versioning, and known limitations during development and review &#x20;
* Deployment of deprecated, unvalidated, or unsafe model versions without awareness of documented risks &#x20;
* Developers and auditors unable to assess model risk, biases, or behavioral constraints without documentation reference &#x20;
* Lack of traceability for AI-generated outputs during incident response and forensic investigation &#x20;
* Hidden use of models with known biases, harmful capabilities, or usage restrictions not communicated to the team &#x20;

**Attack Vector:** Local &#x20;

**Attack Complexity:** Low &#x20;

**Privileges Required:** Low &#x20;

**User Interaction:** None &#x20;

**Confidentiality Impact:** Low &#x20;

**Integrity Impact:** Medium &#x20;

**Availability Impact:** Low &#x20;

**Framework**

* **EU AI Act**: Article 53 &#x20;
* **NIST AI RMF**: GOVERN 1.1, GOVERN 1.6, MEASURE 2.9 &#x20;
* **OWASP LLM**: LLM03, LLM09 &#x20;
* **OWASP ASI**: ASI-04 &#x20;

**References**

<https://artificialintelligenceact.eu/ai-act-explorer/> &#x20;

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/> &#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

</details>

### Vulnerability

Continuously detects and remediates critical software weaknesses inside AI assets.

<details>

<summary>Do not allow dependencies with critical or high severity vulnerabilities</summary>

AI\_VULN\_SEC\_001

**Violation Summary**\
Allowing AI agents to depend on libraries or packages with known critical or high-severity vulnerabilities introduces a direct and exploitable attack surface into the agent runtime. AI agents typically operate with elevated privileges, access sensitive data, invoke tools, and interact with external systems. Vulnerable dependencies can be exploited to execute arbitrary code, escalate privileges, leak secrets, poison agent behavior, or compromise downstream systems—often without direct interaction with the LLM itself.&#x20;

**Affected Assets**&#x20;

* AI Agent&#x20;
* MCP Server&#x20;
* LLM&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Using dependencies with critical or high vulnerabilities introduces several risks including:&#x20;

* Remote code execution or arbitrary command execution within the agent environment&#x20;
* Credential theft, token leakage, or exposure of secrets used by the agent&#x20;
* Supply-chain attacks where malicious code is introduced via compromised packages&#x20;
* Manipulation or poisoning of agent logic, tool invocation, or decision flows&#x20;
* Lateral movement across systems accessed by the agent&#x20;
* Persistence mechanisms established through compromised libraries&#x20;
* Inability to trust agent outputs or actions due to compromised runtime integrity&#x20;
* Violations of secure development, patch management, and risk-management obligations&#x20;

**Attack Vector**: Vulnerable third-party dependency / supply-chain compromise \
**Attack Complexity**: Low \
**Privileges Required**: None (exploits often execute with agent privileges) \
**User Interaction**: None \
**Confidentiality Impact**: Critical \
**Integrity Impact**: Critical \
**Availability Impact**: Medium to High&#x20;

**Framework**

* **OWASP LLM**: LLM05, LLM08&#x20;
* **OWASP ASI**: ASI-05 &#x20;
* **NIST SSDF**: Practice RV.1&#x20;

**References**

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://csrc.nist.gov/pubs/sp/800/218/final> &#x20;

</details>

<details>

<summary>Do not allow critical or high vulnerabilities in the code</summary>

AI\_VULN\_SEC\_002

**Violation Summary**\
The presence of vulnerabilities in application or agent code introduces direct security risks that can be exploited to compromise confidentiality, integrity, and availability. Vulnerable code paths—such as injection flaws, insecure deserialization, broken authentication, improper authorization, or unsafe file handling—can be abused by attackers to execute arbitrary code, access sensitive data, manipulate AI behavior, or disrupt operations. In AI and agent-based systems, these vulnerabilities are especially dangerous because compromised code can influence autonomous decisions and propagate impact across multiple systems.&#x20;

**Affected Assets**&#x20;

* AI Agent&#x20;
* MCP Server&#x20;
* LLM&#x20;

**Severity**

Critical

**Technical Details**&#x20;

Having vulnerabilities in code introduces several risks including:&#x20;

* Remote code execution or command injection through exploitable code paths&#x20;
* Unauthorized access to sensitive data, credentials, or internal APIs&#x20;
* Manipulation of AI agent logic, reasoning flows, or tool invocation&#x20;
* Privilege escalation or bypass of authorization controls&#x20;
* Lateral movement across integrated systems and services&#x20;
* Persistence mechanisms established through exploited vulnerabilities&#x20;
* Loss of trust in application outputs and automated decisions&#x20;
* Violations of secure development lifecycle and regulatory requirements&#x20;

**Attack Vector** : Vulnerable application or agent code&#x20;

**Attack Complexity**: Low to Medium (depending on vulnerability type)

**Privileges Required**: None (for many common vulnerabilities)&#x20;

**User Interaction**: None or minimal&#x20;

**Confidentiality Impact**: Critical&#x20;

**Integrity Impact**: Critical&#x20;

**Availability Impact**: Medium to High&#x20;

**Framework**

* **OWASP LLM**: LLM05, LLM08&#x20;
* **OWASP ASI**: ASI-05 &#x20;
* **NIST SSDF**: Practice RV.2&#x20;

**References**&#x20;

[https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com) \
[https://genai.owasp.org/initiatives/agentic-security-initiative/](https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com) \
<https://csrc.nist.gov/pubs/sp/800/218/final&#x20>;

</details>

<details>

<summary>Enforce foundation model identity, version pinning, and approved model registry for all AI workloads</summary>

AI\_VULN\_SEC\_005&#x20;

**Violation Summary**

Using unverified, unversioned, or unregistered foundation models exposes the system to supply-chain compromise, silent model swaps, and untracked behavioral drift. AI risk management third-party supply chain controls and pre-trained model monitoring expectations require every model loaded at inference or fine-tuning time to be resolved from an approved registry with cryptographic identity verification and pinned versions.&#x20;

**Affected Assets**&#x20;

* AI Model&#x20;
* AI Agent&#x20;
* MCP Server&#x20;
* LLM&#x20;

**Severity**

Critical

**Technical Details**

Missing foundation model identity, version pinning, and approved model registry for all AI workloads controls introduces risks including:&#x20;

* Loading models from arbitrary, unregistered URLs or paths exposes the system to malicious or tampered artifacts and supply-chain compromise.&#x20;
* Allowing user input or prompt text to select active models or failing to verify cryptographic hashes/signatures against a known-good manifest, permits unauthorized execution and silent model swaps.&#x20;
* Using mutable tags (like \`latest\`) instead of immutable identifiers or digest pins causes untracked behavioral drift, making it impossible to guarantee consistent model behavior over time.&#x20;
* Serving predictions without recording the resolved model identifier and version in request metadata prevents effective forensic readiness, monitoring, and accountability. &#x20;
* Failing open and proceeding to load models when registry lookups or hash verifications fail bypasses critical security gates and leads to insecure fallback states.&#x20;

**Attack Vector**: Network &#x20;

**Attack Complexity**: Medium to High&#x20;

**Privileges Required**: High&#x20;

**User Interaction**: None&#x20;

**Confidentiality Impact**: High&#x20;

**Integrity Impact**: High&#x20;

**Availability Impact**: High&#x20;

**Timeline**

* **Jan 26, 2023 — NIST AI RMF 1.0**: GOVERN 4 and MAP 4.2.&#x20;
* **Nov 18, 2024 — OWASP-LLM**: LLM01 (Prompt Injection), LLM06 (Sensitive Information Disclosure), LLM08 (Excessive Agency)&#x20;
* **March 2025 — OWASP-ASI**: ASI-03 (Identity & Privilege Abuse); related themes ASI-01, ASI-04, ASI-07, ASI-09&#x20;

**References**

<https://genai.owasp.org/llm-top-10/>&#x20;

<https://genai.owasp.org/initiatives/agentic-security-initiative/>&#x20;

<https://www.nist.gov/itl/ai-risk-management-framework>&#x20;

</details>

<details>

<summary>Memory safety and buffer overflow prevention in native AI code (C/C++/Rust)</summary>

AI\_VULN\_SEC\_006

**Violation Summary**&#x20;

Native C, C++, and Rust used in AI workloads can suffer memory corruption and undefined behavior when bounds, ownership, and concurrency are not enforced, enabling crashes, data leakage, and remote code execution along the AI inference or tooling path. &#x20;

**Affected Assets**

* LLM &#x20;
* MCP Server &#x20;
* AI Agent &#x20;

**Severity**

Critical

**Technical Details**&#x20;

Unsafe native patterns in AI-related code introduce risks including: &#x20;

* Unbounded or unchecked copies (strcpy, strcat, gets, sprintf, unsafe memcpy/memmove/memset lengths) &#x20;
* Off-by-one and wrapping size arithmetic that exceeds allocated objects &#x20;
* Stack buffers filled from untrusted network, file, IPC, or tensor dimensions without clamping &#x20;
* Double free, use-after-free, invalid free, and ambiguous ownership of raw pointers &#x20;
* Missing paired release on all return paths (including exceptions) without RAII &#x20;
* Unsound Rust unsafe (raw deref from unvalidated input, bad transmute, unchecked indexing) &#x20;
* Data races on shared buffers without locks or atomics &#x20;
* Disabled sanitizers or bounds checks in production builds &#x20;

**Attack Vector:** Network &#x20;

**Attack Complexity:** High &#x20;

**Privileges Required:** None &#x20;

**User Interaction:** None &#x20;

**Confidentially Impact:** High &#x20;

**Integrity Impact:** High &#x20;

**Availability Impact:** High &#x20;

**Framework** &#x20;

2024–2025 - OWASP GenAI LLM Top 10 (native components in AI stacks) &#x20;

**References**

<https://genai.owasp.org/llm-top-10/> &#x20;

<https://cwe.mitre.org/> &#x20;

</details>

### Skills

Automatically discovers, scans, and tracks skills in your inventory.

<details>

<summary>Do not allow use of malicious skills</summary>

AI\_SKILL\_SEC\_001&#x20;

**Violation Summary**&#x20;

Use of malicious or untrusted skills introduces risks of unauthorized actions, data theft, remote command execution, privilege abuse, and compromise of AI agent integrity.&#x20;

**Affected Assets**

* LLM&#x20;
* AI Agent &#x20;
* Skill

**Severity**

Critical

**Technical Details**

Malicious skills introduce several risks including:&#x20;

* Unauthorized execution of privileged actions&#x20;
* Data exfiltration through tool or API access&#x20;
* Remote code execution through unsafe tool invocation&#x20;
* Abuse of implicit trust between agent and skill&#x20;
* Credential theft or token misuse&#x20;
* Manipulation of agent reasoning and decision-making&#x20;
* Persistence through malicious extensions or runtime modifications&#x20;
* Supply chain compromise through third-party skills&#x20;
* Bypass of governance, policy, or approval controls&#x20;
* Unsafe autonomous behavior and unintended tool chaining&#x20;

**Attack Vector:** Skill Extension&#x20;

**Attack Complexity:** Low&#x20;

**Privileges Required:** Low to None&#x20;

**User Interaction:** Required&#x20;

**Confidentiality Impact:** High&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** Medium&#x20;

**Framework**

* **Nov 18, 2024** **- OWASP-LLM**: LLM01, LLM02, LLM03&#x20;
* **Mar 2025** **- OWASP-ASI**: ASI-01, ASI-02, ASI-04, ASI-05, ASI-06&#x20;
* **Mar 31, 2026 – OWASP Skills Top 10**: AST01, AST02 &#x20;
* **Aug 1, 2024 - EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;
* **Jan 26, 2023 - NIST AI RMF 1.0**: GOVERN 1.6, GOVERN 3.2, MAP 4.1, MANAGE 2.1, MANAGE 4.1&#x20;

**References**

<https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com>

<https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com>

<https://owasp.org/www-project-agentic-skills-top-10/>

<https://artificialintelligenceact.eu/ai-act-explorer/?utm_source=chatgpt.com>

<https://www.nist.gov/itl/ai-risk-management-framework?utm_source=chatgpt.com>

</details>

<details>

<summary>Do not allow use of suspicious skills</summary>

AI\_SKILL\_SEC\_002

**Violation Summary**

Use of malicious or untrusted skills introduces risks of unauthorized actions, data theft, remote command execution, privilege abuse, and compromise of AI agent integrity.&#x20;

**Affected Assets**

* LLM&#x20;
* AI Agent&#x20;
* Skill

**Severity**

High

**Technical Details**

Malicious skills introduce several risks including:&#x20;

* Unauthorized execution of privileged actions&#x20;
* Data exfiltration through tool or API access&#x20;
* Remote code execution through unsafe tool invocation&#x20;
* Abuse of implicit trust between agent and skill&#x20;
* Credential theft or token misuse&#x20;
* Manipulation of agent reasoning and decision-making&#x20;
* Persistence through malicious extensions or runtime modifications&#x20;
* Supply chain compromise through third-party skills&#x20;
* Bypass of governance, policy, or approval controls&#x20;
* Unsafe autonomous behavior and unintended tool chaining&#x20;

**Attack Vector:** Skill Extension&#x20;

**Attack Complexity:** Low&#x20;

**Privileges Required:** Low to None&#x20;

**User Interaction:** Required&#x20;

**Confidentiality Impact:** High&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** Medium&#x20;

**Framework**

* **Nov 18, 2024 - OWASP-LLM**: LLM01, LLM02, LLM03&#x20;
* **Mar 2025 - OWASP-ASI**: ASI-01, ASI-02, ASI-04, ASI-05, ASI-06&#x20;
* **Mar 31, 2026 – OWASP Skills Top 10**: AST01, AST02 &#x20;
* **Aug 1, 2024 - EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;
* **Jan 26, 2023 - NIST AI RMF 1.0**: GOVERN 1.6, GOVERN 3.2, MAP 4.1, MANAGE 2.1, MANAGE 4.1&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com>

<https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com>

<https://owasp.org/www-project-agentic-skills-top-10/>

<https://artificialintelligenceact.eu/ai-act-explorer/?utm_source=chatgpt.com>

<https://www.nist.gov/itl/ai-risk-management-framework?utm_source=chatgpt.com>

</details>

<details>

<summary>Do not allow use of unknown skills</summary>

AI\_SKILL\_SEC\_003

**Violation Summary**

Use of malicious or untrusted skills introduces risks of unauthorized actions, data theft, remote command execution, privilege abuse, and compromise of AI agent integrity.&#x20;

**Affected Assets**&#x20;

* LLM&#x20;
* AI Agent&#x20;
* Skills

**Severity**

High

**Technical Details**

Malicious skills introduce several risks including:&#x20;

* Unauthorized execution of privileged actions&#x20;
* Data exfiltration through tool or API access&#x20;
* Remote code execution through unsafe tool invocation&#x20;
* Abuse of implicit trust between agent and skill&#x20;
* Credential theft or token misuse&#x20;
* Manipulation of agent reasoning and decision-making&#x20;
* Persistence through malicious extensions or runtime modifications&#x20;
* Supply chain compromise through third-party skills&#x20;
* Bypass of governance, policy, or approval controls&#x20;
* Unsafe autonomous behavior and unintended tool chaining&#x20;

**Attack Vector:** Skill Extension&#x20;

**Attack Complexity:** Low&#x20;

**Privileges Required:** Low to None&#x20;

**User Interaction:** Required&#x20;

**Confidentiality Impact:** High&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** Medium&#x20;

**Timeline**

**Nov 18, 2024 - OWASP-LLM**: LLM01, LLM02, LLM03&#x20;

**Mar 2025 - OWASP-ASI**: ASI-01, ASI-02, ASI-04, ASI-05, ASI-06&#x20;

**Mar 31, 2026 – OWASP Skills Top 10**: AST01, AST02 &#x20;

**Aug 1, 2024 - EU AI Act**: 1.11, 2.12, 3.13, 4.50&#x20;

**Jan 26, 2023 - NIST AI RMF 1.0**: GOVERN 1.6, GOVERN 3.2, MAP 4.1, MANAGE 2.1, MANAGE 4.1&#x20;

**References**&#x20;

<https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com>

<https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com>

<https://owasp.org/www-project-agentic-skills-top-10/>

<https://artificialintelligenceact.eu/ai-act-explorer/?utm_source=chatgpt.com>

<https://www.nist.gov/itl/ai-risk-management-framework?utm_source=chatgpt.com>

</details>

<details>

<summary>Do not allow skills that exfiltrate data</summary>

AI\_SKILL\_DAT\_SEC\_001

**Violation Summary**

Allowing AI agents to use skills that can exfiltrate data introduces significant confidentiality, integrity, operational, and regulatory risks. Modern agent skills often execute with the same permissions and credentials as the hosting agent, allowing them to access API keys, tokens, SSH credentials, browser data, cloud storage, enterprise documents, memory stores, and internal systems. &#x20;

**Affected Assets**

* LLM&#x20;
* AI Agent&#x20;
* Skills&#x20;

**Technical Details**

Key risks include:&#x20;

* Unauthorized disclosure of sensitive data including PII, credentials, intellectual property, financial records, source code, and regulated data.&#x20;
* Abuse of inherited agent permissions to access systems beyond the user’s intent.&#x20;
* Covert exfiltration through legitimate APIs, retrieval tools, cloud synchronization, or outbound network calls.&#x20;
* Prompt-driven manipulation where instruction files steer agents into leaking secrets.&#x20;
* Regulatory exposure under privacy and AI governance regulations when protected data is leaked.&#x20;
* Loss of trustworthiness and governance due to opaque autonomous behavior and insufficient auditability.&#x20;

**Attack Vector:** Skill Extension&#x20;

**Attack Complexity:** Low&#x20;

**Privileges Required:** Low to None&#x20;

**User Interaction:** Required&#x20;

**Confidentiality Impact:** High&#x20;

**Integrity Impact:** High&#x20;

**Availability Impact:** Medium&#x20;

**Timeline**

* **Nov 18, 2024 - OWASP-LLM**: LLM01, LLM02&#x20;
* **Mar 2025 - OWASP-ASI**: ASI-02, ASI-03, ASI-04&#x20;
* **Mar 31, 2026 – OWASP Skills Top 10**: AST03&#x20;
* **Aug 1, 2024 - EU AI Act**: Articles 9, 10, 15&#x20;

**References**

<https://genai.owasp.org/llm-top-10/?utm_source=chatgpt.com>

<https://genai.owasp.org/initiatives/agentic-security-initiative/?utm_source=chatgpt.com>

<https://owasp.org/www-project-agentic-skills-top-10/>

<https://artificialintelligenceact.eu/ai-act-explorer/?utm_source=chatgpt.com>

</details>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.veedna.com/unifai/policies/unifai-policies.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
