1 Key Risks
The most critical security risks an operator inherits when deploying this agent in its documented default configuration. Dash presents moderate input risk from unscanned connected app content, minimal blast from read-only operation with sandboxed code execution, and vendor-documented defenses without DLP or AI-specific monitoring.
2 AIRQ Scores
The four headline scores quantify how exposed the agent is, how damaging a successful attack would be, and how much the agent’s own controls reduce that risk. The composite score captures how trifecta-complete exposure, constrained blast from read-only defaults, and moderate vendor defenses combine for this enterprise search agent.
Dropbox Dash lands in the Strong Lightweights quadrant near the boundary of moderate attack exposure; expanding into autonomous write actions or unrestricted network access could push the classification into a higher-risk quadrant. Blast stays contained by read-only defaults and vendor-documented defenses.
Each axis measures a distinct dimension of agent risk: attack surface and blast radius out of 10, defense controls out of 15, and the AIRQ composite reflecting all three.
| Metric | Score | Comments |
|---|---|---|
| AIRQ Score | 3.42 | Moderate capability-to-risk ratio reflecting a primarily read-only agent with vendor-documented defenses but trifecta-complete exposure patterns. |
| Blast Radius | 3 / 10 | Low blast driven by sandboxed code execution, read-only connected app access, and no deployment or autonomous action capabilities. |
| Attack Surface | 4.8 / 10 | Trifecta floor applied; external data ingestion from connected apps is the dominant driver with all three trifecta conditions met. |
| Defense Controls | 9 / 15 | Lakera Guard covers input and output filtering with Protect and Control for governance; no DLP or AI-specific anomaly detection documented. |
3 Attack Surface
Attack surfaces are the entry points and interaction patterns through which adversarial input can reach the agent’s reasoning loop and steer its behavior. The dominant exposures are unfiltered document content from connected SaaS applications entering the RAG pipeline, multiple input channels across web, mobile, and MCP endpoints, and a sandboxed code interpreter.
One surface scores above the baseline at external data ingestion; the remaining nine reflect moderate or low exposure from validated input channels and controlled execution.
Each row maps a named attack surface to its scored exposure level and a brief comment citing the architectural evidence that grounds the assessment.
| Surface | Score | Comments |
|---|---|---|
| User Input | 2 / 4 | Web UI, browser extension with all-sites data access, desktop, mobile, and MCP endpoints accept prompts filtered by Lakera Guard with no independent adversarial testing [3]. |
| External Data | 3 / 4 | Over thirty connected SaaS applications ingest external content into the RAG pipeline with permission-aware but content-unvalidated retrieval; class-level RAG poisoning research demonstrates the attack pattern [2][7]. |
| Memory | 1 / 4 | Session-scoped chat context with no cross-session persistent memory; connected app indexes are platform-maintained, not reasoning-loop-managed [8]. |
| Reasoning | 2 / 4 | Multi-step RAG reasoning decomposes queries into retrieval and synthesis steps constrained to the declared task scope with visible chain-of-thought [11]. |
| Planning | 2 / 4 | AI agents compile query plans into a domain-specific language with user-visible results and no autonomous background execution [11]. |
| Tool Execution | 2 / 4 | Minimal Python interpreter with static analysis validation and restricted functionality executes agent-generated code in a sandboxed environment [11]. |
| Orchestration | 2 / 4 | Multi-step task execution within a single user-supervised session without background processes, scheduling, or daemon operation [11]. |
| Inter-Agent | 1 / 4 | Dash MCP server exposes read-only search and document retrieval tools through a vendor-managed protocol with OAuth identity verification [9]. |
| Output Processing | 2 / 4 | Lakera Guard screens outputs for harmful content with citation-linked answers rendered in the vendor-controlled SaaS web interface [10]. |
| Configuration | 2 / 4 | Admin-managed connector configuration through Protect and Control with policy-based governance; the broader Dropbox authentication stack has had SAML library vulnerabilities [1][12]. |
The Lethal Trifecta is triggered when an agent processes untrusted content, accesses private data, and communicates externally in the same session — the three conditions that turn an isolated prompt injection into full-chain exfiltration. Dropbox Dash ingests externally authored documents from connected applications, reads private enterprise data across email and files, and sends queries and content to external API endpoints by default.
Dropbox Dash exhibits all three of these conditions in its documented default configuration:
- Untrusted input — Connected app connectors ingest email, Slack messages, and shared documents authored by external parties into the RAG retrieval pipeline [7].
- Sensitive data — The agent reads private enterprise data including email content, files, customer records, and source code across connected applications [7].
- External egress — User queries and retrieved content are sent to the OpenAI API for LLM processing, and search results are exposed via the MCP server [8].
4 Blast Radius
The blast radius is what an attacker who controls the agent can reach — which systems they touch, which credentials they read, and which actions they take without operator approval. A compromised Dash instance reaches a sandboxed code interpreter, read-only connected app content through OAuth-scoped connectors, and domain-restricted network endpoints with no deployment capabilities.
Higher scores would indicate unrestricted execution, credential exposure, or autonomous action capability; this agent stays low across all factors.
Each row ties a blast factor to the specific capability boundary the agent holds in its default configuration with evidence from vendor documentation.
| Factor | Score | Comments |
|---|---|---|
| Code execution | 1 / 4 | Sandboxed interpreter runs agent-generated Python with restricted standard library access and pre-execution code analysis; no full shell available [11]. |
| File system access | 1 / 4 | Read-only access to connected app content through permission-aware retrieval with no arbitrary file write capability on host systems [7]. |
| Network access | 2 / 4 | Domain-restricted outbound requests to the OpenAI API and connected app endpoints; no unrestricted outbound HTTP capability documented [8]. |
| Credential access | 2 / 4 | Platform-managed OAuth tokens for connected applications with AES-256 encryption at rest; tokens are not directly exposed to user sessions [6]. |
| Autonomous action | 1 / 4 | All interactions require user initiation with no background tasks, scheduled operations, or autonomous workflow triggers [8]. |
| Deployment access | 0 / 4 | No deployment, CI/CD pipeline, or infrastructure modification capabilities documented in the vendor architecture overview [6]. |
5 Defense Controls
Defense controls are what the agent’s own architecture does to detect, contain, and report attacks before they reach the operator’s systems. The vendor publishes Lakera Guard for input and output filtering and Protect and Control for governance, while DLP and AI-specific anomaly detection remain absent from the documented default posture.
Higher scores reflect stronger vendor-implemented safeguards; the inverted color scale highlights the gap between documented and missing controls.
Each component is scored on what the vendor implements by default versus what requires operator-managed configuration or is absent entirely.
| Component | Score | Comments |
|---|---|---|
| Input Guardrails | 2 / 3 | Lakera Guard provides ML-based prompt injection and jailbreak detection across all Dash LLM interactions; not independently red-teamed [4][5][10]. |
| Execution Isolation | 2 / 3 | Cloud-hosted on AWS with AES-256 encryption and the minimal Python interpreter restricting available functionality through static analysis [6][11]. |
| Action Controls | 2 / 3 | Protect and Control provides admin-managed permission governance with policy-based automated remediation and no single-step bypass documented [12]. |
| Output Guardrails | 1 / 3 | Lakera Guard screens outputs for content safety violations at the moderation layer only; no DLP, exfiltration channel blocking, or credential redaction is documented [10]. |
| Monitoring | 2 / 3 | Action history logs permission changes under SOC 2 Type II and ISO 27001 certified organizational controls; these certifications cover process compliance, not AI behavioral monitoring [13]. |
6 Hardening Tips
Concrete actions an operator can take to reduce the risks reported above, grouped by which defense control each tip strengthens. Operators should prioritize breaking the trifecta by adding content-level scanning on the retrieval channel and deploying DLP on the output pipeline.
Input Guardrails
Input guardrails intercept adversarial content before it reaches the reasoning loop.
- Policy Require secondary prompt injection review on all retrieved document content before it enters the RAG synthesis step.
- Configuration Restrict browser extension permissions to a curated domain allowlist rather than the default all-sites access.
- Engineering Deploy a content classifier between connected app connectors and the retrieval index to flag adversarial instruction patterns.
Execution Isolation
Execution isolation contains what a compromised agent can do on the host.
- Policy Require periodic third-party security audits of the minimal Python interpreter boundary and its static analysis validation.
- Configuration Restrict the interpreter's available standard library modules to the minimum required set for documented agent tasks.
- Engineering Instrument the interpreter with execution telemetry logging every system call and resource access attempt.
Action Controls
Action controls govern which tools and actions the agent can invoke autonomously.
- Policy Require explicit admin approval before enabling MCP server access for external AI agent connections.
- Configuration Configure Protect and Control policies to enforce least-privilege connector access with mandatory periodic reviews.
- Engineering Deploy request-rate limiting on the Dash MCP server search endpoint to detect and throttle bulk data extraction attempts.
Output Guardrails
Output guardrails inspect what the agent sends to other systems and users.
- Policy Require deployment of a DLP solution on the output pipeline to detect and block sensitive data in generated answers.
- Configuration Restrict citation URLs in generated answers to a curated allowlist of connected application domains.
- Engineering Deploy output tokenization or redaction rules for PII patterns before answers are rendered or returned via MCP.
Monitoring
Monitoring captures what the agent did and surfaces anomalies for review.
- Policy Forward Dash action history events to the organization SIEM for correlation with broader security monitoring.
- Configuration Enable alerting on anomalous query patterns including high-volume retrieval bursts and repeated sensitive content access.
- Engineering Instrument the MCP server with per-request audit logging capturing external agent identity, query content, and result volume.
7 References
The evidence base behind every score and finding in the profile, grouped by source type so the reader can verify any claim. Numbers in brackets throughout the report (e.g. [7, 13]) refer to entries below, listed in citation order.
Selected Vulnerabilities
- CVE-2026-28809 esaml XXE NVD CVE 9.1 -- XXE in Dropbox-maintained esaml SAML library
- PoisonedRAG knowledge base poisoning USENIX Security 2025 -- RAG poisoning yields 90%+ attack success with 5 docs in 1M corpus
Selected Research
- Dropbox prompt injection via control characters Dropbox security research on control-char injection in OpenAI ChatGPT models
- Dropbox repeated-token divergence attacks Dropbox research on training data extraction from GPT-3.5 and GPT-4 via repeated tokens
- Lakera case study on Dropbox GenAI security Third-party case study on Lakera Guard deployment for Dropbox prompt injection defense
Vendor Documentation
- Dash Security Architecture Overview Vendor whitepaper on data protection and encryption and compliance posture
- Dash connected apps Vendor product page listing 30+ enterprise SaaS connectors for unified search
- Dropbox AI Transparency Center -- Dash Vendor AI transparency entry documenting model data flows and NIST/OWASP alignment
- Dash MCP server documentation Vendor docs on MCP server OAuth setup and available read-only tools
- Lakera Guard integration for Dash LLMs Vendor engineering blog on Lakera Guard Docker deployment for LLM input/output filtering
- Dash RAG and AI agents architecture Vendor engineering blog on RAG pipeline and minimal Python interpreter and agent architecture
Other Sources
- Dash Protect and Control admin Admin documentation covers permission management, policy enforcement, and action history for the Protect and Control suite.
- Dropbox bug bounty on Intigriti Dropbox runs an active bug bounty program on Intigriti covering products including Dash, which informs the external vulnerability reporting posture.