MCP security guide · reviewed June 20, 2026
Scanning vs sandboxing vs manual review: how to actually vet an MCP server
Short answer: they're not competitors — they catch different things. Scanning tells you what a server does (fast, before install). Sandboxing limits what it can do to you if it misbehaves (but doesn't assess trust). Manual review catches the novel stuff a scanner misses (but is slow). The right move is to layer them — and use the cheap, instant one first to decide where the expensive ones are worth your time.
What each actually catches — and misses
| Approach | Catches | Misses | Cost |
|---|---|---|---|
| Scanning (static + AI) | Shell exec, credential/file reads, network egress, download-and-run, install scripts, prompt injection in tool text — with the exact file & line. | Novel or heavily obfuscated logic; intent. It flags patterns, not motives. | Seconds, free, repeatable. |
| Sandboxing (container, no-net, least-priv) | Limits blast radius — contains a server that does turn out hostile at runtime. | Tells you nothing about whether to trust it; breaks servers that legitimately need the access you removed. | Setup + ongoing ops; can defeat the server's purpose. |
| Manual review | Everything, in principle — including clever, novel attacks a scanner can't pattern-match. | Human time and attention; doesn't scale; easy to skip under deadline. | Slow, expensive, inconsistent. |
How to combine them (in order)
- Scan first. It's instant and free, so there's no reason not to. The grade sets your effort budget for the next two steps.
- Let the grade route your attention. Clean on a simple server is usually enough. Medium/high points you at the exact lines worth a human read — that's where manual review pays off, instead of reading the whole repo.
- Sandbox what you run regularly. For servers you depend on, run them with least privilege and no network-by-default to cap the downside — then re-scan on every version change, because approval is an event, not a permanent state.
Run it from the web, or inside Claude Code so your agent scans a server before it installs it:
claude mcp add mcpvet -- npx -y github:volohq-info/mcpvet
See also: Is that MCP server safe? — the risks + a manual checklist · Browse scanned servers.
FAQ
Is sandboxing enough on its own?
It limits damage but doesn't assess trust, and it breaks servers that need real access. Pair it with a scan.
Can a scanner replace a manual review?
No — it makes the cheap pass instant and tells you where a manual review is worth it. It can miss novel/obfuscated logic.
Fastest safe way to vet a server?
Scan first; let the grade decide how much manual review and sandboxing it's worth.
MCPVet is an automated heuristic aid, not a human security audit or a guarantee. A clean grade doesn't prove a server is safe; always review code and instructions you don't trust. Guide reviewed June 20, 2026.