MCP vs Skills vs Subagents: When Each One Wins

Member-only story

Stop installing MCP servers for everything. After shipping all three in production, here’s the decision framework that saved 60k context tokens.

9 min read

Just now

Press enter or click to view image in full size

Three primitives, three sizes of overhead. Created with Nano Banana Pro.

I had 12 MCP servers installed. My Claude Code session was eating 67k tokens before I typed a single prompt. That’s 33% of my context window gone, just to register tools I touched maybe twice a week.

So I rebuilt my setup from scratch. Most of what I had as MCP servers should have been skills. A few should have been subagents. And only three actually needed to be MCPs. The result: 14% startup overhead, sharper tool selection, and zero “Claude forgot what we were doing” moments mid-task.

This post is the decision framework I wish someone had handed me six months ago.

· Why this matters now
· What does each one actually do?
· Skills are markdown runbooks
· Subagents are isolated Claude sessions
· MCP servers are live sockets to other systems
· How do you decide which to reach for?
· When should you build an MCP server instead?
· How do subagents actually save context?
· What does a production setup look like?
· Reality check: what still breaks
· FAQ
∘ When should I build an MCP server instead of a…

MCP vs Skills vs Subagents: When Each One Wins

Stop installing MCP servers for everything. After shipping all three in production, here’s the decision framework that saved 60k context tokens.

Table of Contents