AI Security Threats — Microsoft Defense Map

🪤

Prompt Injection

// Direct & Indirect · Jailbreak · Agent Hijack

External Internal

▾

What It Is

An attacker embeds malicious instructions inside user input or external content (websites, documents, emails) that the AI model processes. The goal is to hijack the model's behavior — bypassing safety rules, leaking data, or triggering unauthorized actions.

Direct injection — User directly crafts malicious prompts.
Indirect injection — Malicious instructions hidden inside content the AI reads (e.g., a webpage summarized by a Copilot agent).

Threat Actor Profile

🌐 ExternalJailbreaking public-facing AI apps, manipulating chatbots, injecting prompts via RAG data sources.

👤 InternalEmployees bypassing enterprise AI guardrails, manipulating AI agents to exfiltrate corporate data.

Microsoft Defense Coverage

Solution	How It Helps	License
Azure AI Content Safety	Prompt Shields detect & block direct and indirect injection in real time before reaching the model	Azure AI Services
Defender for Cloud — AI Threat Protection	Monitors Azure OpenAI & Azure ML for injection patterns; generates security alerts	Defender P2
Microsoft Sentinel	Ingests AI alerts; UEBA correlates anomalous query patterns; custom KQL detection rules	Sentinel
Microsoft Purview DLP	Inspects prompts submitted to Copilot/AI tools for sensitive data being sent outbound	M365 E5 Compliance
Entra ID Conditional Access	Restricts who can access AI-powered apps and from which device/network posture	Entra P1/P2

☣️

Data Poisoning

// Training Data Corruption · Backdoor Injection · RAG Manipulation

External Internal

▾

What It Is

An attacker corrupts the training data (or fine-tuning data) fed into an AI model so it learns incorrect behaviors — creating backdoors, biased outputs, or manipulated classifications. This is a pre-deployment attack with long-lasting production consequences.

Threat Actor Profile

🌐 ExternalPoisoning publicly scraped datasets, supply chain attacks on open-source data, RAG database manipulation.

👤 InternalMalicious data engineers inserting backdoors into labeled datasets or fine-tuning pipelines.

Microsoft Defense Coverage

Solution	How It Helps	License
Purview — Sensitivity Labels	Labels and classifies training datasets; enforces access controls preventing unauthorized modification	M365 E3/E5
Purview IRM	Detects suspicious bulk data modification by insiders in data lake / ML storage environments	M365 E5 Compliance
Defender for Cloud — DevOps Security	Scans ML pipelines (GitHub Actions, ADO) for unauthorized changes; monitors Azure ML data stores	Defender for Cloud
Microsoft Sentinel + UEBA	Detects anomalous write/modify patterns on training datasets in Azure Storage, ADLS, Fabric	Sentinel
Entra ID PIM	Just-in-time access to training data pipelines; limits who can write to ML data stores	Entra P2
Purview Data Map	Tracks data lineage — where training data came from and whether it was altered	Purview

🕵️

Model Theft

// Model Extraction · API Harvesting · IP Theft

External Internal

▾

What It Is

An attacker queries a model thousands/millions of times via its API to reconstruct a functional "shadow model" without authorization. This allows IP theft, internal vulnerability analysis, and further targeted attacks — without ever accessing training data.

Threat Actor Profile

🌐 ExternalCompetitors or nation-state actors systematically harvesting model behavior via public APIs.

👤 InternalEmployees exfiltrating model weights or cloning behavior through authorized API access.

Microsoft Defense Coverage

Solution	How It Helps	License
MCAS / Defender for Cloud Apps	Detects abnormal API call volume/patterns to AI endpoints; triggers anomaly alerts	Defender for Cloud Apps
Azure API Management	Rate limiting, IP throttling, subscription key revocation — chokes extraction at the API layer	Azure APIM
Microsoft Sentinel	Custom detection on Azure OpenAI diagnostic logs for high-volume systematic queries	Sentinel
Entra ID + PIM	Restricts access to model endpoints and Azure ML model registry; JIT for sensitive roles	Entra P2
Purview IRM	Monitors insider behavioral signals for employees making large volumes of model queries	M365 E5 Compliance
Defender for Cloud — AI Security Posture	Identifies publicly exposed AI model endpoints without authentication	Defender for Cloud

🔓

Privacy Leakage

// Memorization · PII Exposure · GDPR/HIPAA Risk

External Internal

▾

What It Is

AI models — especially LLMs — can inadvertently memorize and reproduce fragments of their training data, including PII, credentials, proprietary documents, or medical records. This is a post-deployment risk that can result in GDPR/HIPAA violations and reputational damage.

Threat Actor Profile

🌐 ExternalCrafting adversarial prompts to extract memorized sensitive data from public-facing models.

👤 InternalEmployees using Copilot to inadvertently surface sensitive HR/legal/financial data they shouldn't access.

Microsoft Defense Coverage

Solution	How It Helps	License
Purview — AI Hub	Visibility into what sensitive data users submit to and receive from AI apps; detects privacy leakage in outputs	M365 E5 Compliance
Purview DLP	Detects and blocks responses containing SSNs, credit cards, health info from AI output channels	M365 E5 Compliance
Purview Sensitivity Labels	Labels flow into Copilot — won't surface Highly Confidential data to unauthorized users	M365 E3/E5
Azure AI Content Safety	Output filtering — strips or blocks PII patterns from model responses at the API layer	Azure AI Services
Entra ID — Access Reviews	Ensures only authorized users have access to AI apps that can surface sensitive data	Entra P2
Microsoft Priva	Privacy risk management; identifies over-exposure of personal data in AI training sets or outputs	Microsoft Priva

📉

Model Drift

// Distribution Shift · Performance Degradation · Governance Gap

External Internal

▾

What It Is

Over time, the statistical distribution of real-world data diverges from training data, causing model predictions to degrade or become biased. While not always a direct attack, adversaries can deliberately trigger drift by feeding adversarial inputs continuously — a slow-burn attack. Shadow AI models running without governance are especially vulnerable.

Threat Actor Profile

🌐 ExternalSlowly poisoning production inference data to shift model behavior over time (concept drift injection).

👤 InternalNegligent ML ops ignoring drift signals; shadow AI models running without monitoring or governance.

Microsoft Defense Coverage

Solution	How It Helps	License
Azure Machine Learning — Data Drift Monitor	Tracks statistical drift between baseline and production datasets; triggers alerts when thresholds exceeded	Azure ML
Defender for Cloud — AI Security Posture	Flags unmonitored AI models and pipelines lacking observability controls	Defender for Cloud
Microsoft Sentinel	Ingests Azure ML monitoring logs; custom workbooks to visualize model performance degradation over time	Sentinel
Purview — AI Hub	Governance visibility into all AI models deployed in the tenant, including unmanaged/shadow models	M365 E5 Compliance
Azure Monitor + Log Analytics	Captures inference telemetry; anomaly detection on output distribution changes	Azure

🎭

Adversarial Examples

// Input Perturbation · Evasion · Misclassification

External

▾

What It Is

Specially crafted inputs — slightly perturbed images, audio, or text — cause a model to misclassify with high confidence. A stop sign with stickers fools an autonomous vehicle. A slightly modified PDF evades AI-based malware detection. These attacks exploit the mathematical fragility of neural networks.

Threat Actor Profile

🌐 ExternalAttackers crafting adversarial inputs to evade AI-powered security controls (malware detection, fraud scoring, content moderation).

Microsoft Defense Coverage

Solution	How It Helps	License
Azure AI Content Safety	Input robustness checks; adversarial input filtering before reaching models	Azure AI Services
Defender for Cloud	AI Threat Protection flags unusual inference patterns that may indicate adversarial probing	Defender P2
Azure ML — Responsible AI Dashboard	Adversarial robustness evaluation during model development and testing phases	Azure ML

🔬

Model Inversion

// Training Data Reconstruction · Output Analysis

External

▾

What It Is

By querying the model repeatedly and analyzing confidence scores/probabilities in its outputs, an attacker reconstructs approximate samples of the original training data — effectively reversing the training process to recover sensitive individual records.

Threat Actor Profile

🌐 ExternalResearchers or attackers exploiting model confidence outputs to reconstruct faces, medical records, or proprietary data from API responses.

Microsoft Defense Coverage

Solution	How It Helps	License
Azure APIM	Rate limiting and output sanitization — suppresses raw probability scores from being exposed via API responses	Azure APIM
MCAS / Defender for Cloud Apps	Anomaly detection on API access patterns consistent with inversion attack behavior	Defender for Cloud Apps
Microsoft Priva	Privacy risk management; differential privacy tooling integration in Azure ML training pipelines	Microsoft Priva

🔍

Membership Inference

// GDPR Right-to-Erasure · Training Data Detection

External

▾

What It Is

An attacker queries the model to determine whether a specific individual's data was used in training. This has direct GDPR implications (right to erasure — was my data used?). By comparing model confidence on known vs. unknown records, attackers can confirm membership in the training set.

Threat Actor Profile

🌐 ExternalIndividuals or adversaries confirming whether specific personal data appeared in a model's training set — used for legal pressure, extortion, or targeted privacy attacks.

Microsoft Defense Coverage

Solution	How It Helps	License
Microsoft Priva	Privacy risk management and GDPR compliance tooling; data subject request management	Microsoft Priva
Purview DLP	Blocks model outputs that could confirm membership in sensitive data categories	M365 E5 Compliance
Azure ML — Differential Privacy	Applies statistical noise during training to make membership inference mathematically infeasible	Azure ML

🔗

AI Supply Chain Attack

// Compromised Models · Malicious Libraries · Pipeline Tampering

External Internal

▾

What It Is

Compromising a pre-trained model, open-source ML library (e.g., a malicious Hugging Face model), or fine-tuning dataset before it enters the organization's environment. This is the SolarWinds of AI — trust in third-party components weaponized against the consumer.

Threat Actor Profile

🌐 ExternalNation-state actors or criminals poisoning open-source models, libraries, or datasets on public repositories.

👤 InternalNegligent intake of unverified pre-trained models or datasets without provenance validation.

Microsoft Defense Coverage

Solution	How It Helps	License
Defender for DevOps	Scans ML pipeline code and dependencies for tampering; integrates with GitHub/ADO	Defender for Cloud
Defender for Cloud — Container Scanning	Scans model containers and base images for malicious components before deployment	Defender for Cloud
GitHub Advanced Security	Dependency scanning and secret detection in ML pipeline code repositories	GHAS
Purview Data Map / Lineage	Tracks provenance of models and datasets — where they came from and whether they were altered	Purview

🤖

Insecure Agentic / Plugin Execution

// LLM Agent Hijack · Tool Abuse · Autonomous Action

External Internal

▾

What It Is

AI agents with tool-use capabilities (web browsing, code execution, email sending) can be hijacked via indirect prompt injection to execute malicious actions — sending phishing emails, deleting files, or exfiltrating data — all under the guise of the authorized user. As Copilot agents proliferate, this surface area grows dramatically.

Threat Actor Profile

🌐 ExternalEmbedding injection payloads in websites or emails that AI agents browse, causing autonomous malicious actions.

👤 InternalEmployees crafting prompts to make AI agents perform unauthorized actions on corporate systems.

Microsoft Defense Coverage

Solution	How It Helps	License
Microsoft Copilot for Security	Detection and investigation of agent-based attack chains across Defender XDR signals	Copilot for Security
Purview — AI Hub	Agent activity visibility — tracks what actions AI agents are taking on behalf of users	M365 E5 Compliance
Entra ID — OAuth Scope Restriction	Restricts OAuth permission scopes granted to AI plugins, limiting blast radius of hijacked agents	Entra P1/P2
MCAS — Session Controls	Real-time session monitoring on agent-connected SaaS apps; blocks abnormal automated actions	Defender for Cloud Apps
Azure AI Content Safety	Indirect prompt injection detection specifically for agentic workflows and RAG pipelines	Azure AI Services

👤

Shadow AI / Unauthorized AI Usage

// Unsanctioned Tools · Data Governance Bypass · BYOAI

Internal

▾

What It Is

Employees using unauthorized AI tools (consumer ChatGPT, personal Claude, free Gemini, etc.) and pasting sensitive corporate data into them — bypassing all enterprise data governance, DLP, and compliance controls. Often well-intentioned but highly risky; the data leaves the tenant boundary entirely.

Threat Actor Profile

👤 Internal (Negligent / Insider)Employees, contractors, and partners using personal AI tools with corporate data — often without malicious intent but creating significant data loss and compliance exposure.

Microsoft Defense Coverage

Solution	How It Helps	License
MCAS / Defender for Cloud Apps	Blocks or audits access to unsanctioned AI SaaS apps; cloud app catalog flags AI tools by risk score	Defender for Cloud Apps
Purview — AI Hub	Shows data submitted to external AI services; detects sensitive data flowing outside approved channels	M365 E5 Compliance
Endpoint DLP	Blocks copy-paste of sensitive data to non-approved applications and browsers on managed endpoints	M365 E5 Compliance
Entra ID — Conditional Access	Restricts non-managed/non-compliant devices from accessing corporate data, limiting BYOAI on personal devices	Entra P1/P2
Purview IRM — Adaptive Protection	Elevates DLP restrictions for users flagged as high risk based on AI tool usage patterns	M365 E5 Compliance

AI Attack Vectors
& Microsoft Defenses

Prompt Injection

Data Poisoning

Model Theft

Privacy Leakage

Model Drift

Adversarial Examples

Model Inversion

Membership Inference

AI Supply Chain Attack

Insecure Agentic / Plugin Execution

Shadow AI / Unauthorized AI Usage

Microsoft Solution Coverage Map

Licensing Reality Check