Claude Skills — Security Guide

CLAUDE.md is loaded on every conversation. If it’s compromised, every interaction is compromised.

Researchers at LayerX Security disclosed a 10/10 CVSS score zero-click RCE in Claude Desktop — exploited through nothing more than a malicious CLAUDE.md file that a user opened in their project. No click required. The file was read, the instructions were executed, and the machine was owned. This is not a hypothetical threat class.

CLAUDE.md sits at the root of every Claude Code project. It instructs the agent what tools to use, what constraints to respect, and how to behave across every session. Compromise it once and you own the agent’s behavior persistently — even after the original attack vector is removed.

Firmis scans CLAUDE.md files, .claude/ settings, and skill definition code across 209 detection rules covering prompt injection, hardcoded credentials, agent memory persistence attacks, and tool poisoning.

What Firmis detects

Threat Category	Rules	Coverage	Example Finding
Tool Poisoning	10	High	Hidden Unicode characters in tool descriptions
Prompt Injection	13	High	`ignore all previous instructions` in CLAUDE.md
Secret Detection	60	High	Hardcoded Anthropic API key in `.claude/settings.json`
Agent Memory Poisoning	7	High	Code writing to `.claude/` config directory
Access Control	3	High	API key passed as URL query parameter
Insecure Config	3	Medium	`DEBUG=true` in Claude skill config
Permission Overgrant	7	Medium	Wildcard tool permissions in MCP config
Supply Chain	8	Medium	Known malicious npm package in skill dependencies
Data Exfiltration	12	Medium	Bulk file read followed by HTTP POST

Files Firmis scans

File Pattern	What It Contains
`CLAUDE.md`	Project-level instructions, tool use rules, persistent agent behavior
`.claude/settings.json`	Claude Code agent settings and feature flags
`.claude/memory/*.md`	Cross-session persistent memory injected into every prompt
`src/*/.ts`, `src/*/.js`	Skill handler code making tool calls and network requests
`package.json`	Dependency declarations, install scripts

Scan Claude Skills

npx firmis scan --platform claude

Common findings and remediation

Prompt injection in CLAUDE.md

CRITICAL  prompt-001  Instruction Override in Tool Description
          CLAUDE.md:23
          Pattern: "ignore all previous instructions"

What it means. An attacker has inserted instruction-override text into your CLAUDE.md. The mechanism is straightforward: Claude reads this file at startup before processing any user message. The injected phrase attempts to displace your legitimate instructions and redirect agent behavior from that point forward — silently, in every subsequent session.

The attack surface is wider than it looks. CLAUDE.md can be poisoned through a malicious npm postinstall script, a compromised project template, a pull request from an external contributor, or fetched content that was merged into the file. You may not notice until the agent starts behaving strangely.

How to fix. Remove the injection pattern from CLAUDE.md immediately. Audit the file’s git history to identify when it was introduced and by which commit. Treat CLAUDE.md as a security boundary equivalent to your application’s authentication configuration: it should contain only your explicit, manually reviewed instructions. Never merge external content into it without reading every line.

Hardcoded API key in skill code

CRITICAL  sd-014  Anthropic API Key
          src/tools/llm-call.ts:8
          Pattern: sk-ant-...

What it means. A real Anthropic API key is committed directly in source code. Every person with repository access — current contributors, future forks, CI runners, and anyone who clones the repo — can extract and use it. If this is a public repository, the key has likely already been scraped by automated credential harvesters that index GitHub within minutes of a push.

How to fix. Remove the key immediately and rotate it in your Anthropic console before doing anything else — rotation is more urgent than cleanup. Load secrets via environment variables (process.env.ANTHROPIC_API_KEY) or a secrets manager. Add a pre-commit hook or CI secret scanning step to prevent recurrence.

Agent memory poisoning via config write

HIGH  mem-003  Agent Config File Modification
      src/tools/setup.ts:44
      Pattern: writeFile(...'.claude/')

What it means. A skill handler is writing to the .claude/ configuration directory at runtime. This is a persistence attack: the write happens once, but its effects carry forward into every subsequent Claude session. A malicious skill can use this vector to inject persistent instructions, silently register rogue MCP servers, or modify tool permissions — and the modifications survive even after the skill itself is uninstalled.

How to fix. Skills must not modify agent platform configuration files under any circumstances. Configuration changes must be explicit, user-initiated actions — not side effects of running a skill. Remove the write operation entirely. If a skill genuinely needs to configure agent behavior, document the required manual steps and let the user perform them.