Architecture and Attack Surface
Supply Chain Attacks
Prompt Injection
Malicious Tool Definitions
Data Exfiltration
Known Vulnerabilities
Mitigation Strategies
See Also
References

Security Risks and Dangers of Using OpenClaw

OpenClaw is an open-source AI agent framework that runs locally on user hardware, connecting large language models to messaging apps, local files, shell commands, browsers, and third-party tools for task automation. ¹⁾ While its local-first design and extensibility make it a powerful personal assistant, these same features introduce significant security risks that organizations and individuals must understand before deployment.

Architecture and Attack Surface

OpenClaw grants LLMs system-level access including file I/O, script execution, web automation, and integrations with email, calendars, and smart home devices. ²⁾ This effectively gives AI models eyes, ears, and hands without built-in governance by default, requiring users to manually implement controls like sandboxing. ³⁾

The framework uses a local gateway for control, persistent memory stored in Markdown files, multi-agent routing, a heartbeat scheduler for autonomous operation, and extensibility via community AgentSkills or plugins from repositories like ClawHub. ⁴⁾

Supply Chain Attacks

Users download community-contributed skills (automation scripts) from central repositories like ClawHub, which could be compromised to inject malware, backdoors, or malicious code executed with system privileges. ⁵⁾ As an open-source project with over 200,000 GitHub stars, its dependency on unvetted third-party extensions mirrors broader supply chain vulnerabilities in agentic AI.

Prompt Injection

OpenClaw assembles large prompts from system instructions (AGENTS.md, SOUL.md, TOOLS.md), conversation history, memory, and logs, making it susceptible to injections via messaging channels or external content such as documents, emails, and webpages. ⁶⁾ Malicious inputs can override instructions, tricking the LLM into unauthorized actions like data access or tool misuse, as the framework lacks inherent prompt guards.

Malicious Tool Definitions

Extensible AgentSkills and tool schemas allow over 100 preconfigured functions for shell commands, file management, and browser control. ⁷⁾ Without strict validation, tools could execute harmful scripts such as deleting files or installing payloads, especially in non-sandboxed modes offering full system access.

Data Exfiltration

Direct local access to files, browsers, and integrations enables agents to read sensitive data and send it outbound via API-connected LLMs or chat apps. ⁸⁾ Persistent local storage of memory and preferences in editable Markdown files increases exposure if the gateway is compromised.

Known Vulnerabilities

CrowdStrike has identified OpenClaw's ability to reason over and act on external content as a broad attack surface for security teams. ⁹⁾ The ClawJacked vulnerability demonstrated that malicious websites could hijack locally running OpenClaw instances via WebSocket connections. Additionally, community experience indicates that smaller local models (below 32B parameters) may produce unreliable and potentially unsafe actions. ¹⁰⁾