ethanhouseworth

← back to projects
House Agents Architecture
Four specialized sub-agents quarantine heavy operations from the main context

House Agents

Oct 2025 Infrastructure

Ethan Houseworth + Ultron

𝕏in

95-98% token reduction through specialized sub-agents.

187,000 tokens went in. 5,000 came back. that's not compression, that's quarantine.

when you're working in Claude Code on something complex, your context window fills up fast. search results from 100 files. API docs. npm install logs. build output. all that noise accumulates and makes the AI worse at its actual job. more context doesn't mean better answers, it means diluted attention.

the pattern

house agents are specialized Claude Code sub-agents that run in their own context windows. each one handles a specific category of heavy operations and returns only the condensed result to your main conversation. the heavy lifting happens in isolation. your main context stays clean.

there are four agents. house-research handles file and documentation search, it processed 70,100 tokens and returned 3,246, a 95.4% reduction. house-git does diff and commit analysis, 42,900 tokens in, ~500 back, 98.8% savings. house-bash runs commands and parses output, 20,600 to ~700, 96.6%. house-mcp handles MCP tool configuration, 53,300 to ~540, 99.0%.

why it works

the main conversation uses whatever model you want, Sonnet, Opus, whatever. the sub-agents run on Haiku, which is 67% cheaper and 2x faster. this isn't cutting corners. Haiku performs at 90% of Sonnet's capability for the focused tasks these agents handle: grep, bash parsing, git analysis. you use the expensive model for thinking, the cheap model for looking.

it's the same pattern as hiring specialists. you don't send the CEO to photocopy documents. you send the intern, they bring back a summary, and the CEO makes the decision with clean information.

real numbers

in production, house agents quarantined 187,000 tokens total and added only ~5,000 to the main context. that's not theoretical. that's measured from actual usage. the main conversation stays sharp because it never sees the noise.

▶️ featured in: 800+ hours of Learning Claude Code in 8 minutes by Edmond Yong

then Anthropic shipped the same thing

house agents went live on October 14, 2025. thirteen days later, Anthropic released Claude Code v2.0.28 with sub-agent resumption and dynamic model selection, the exact same pattern baked into the official product. not claiming credit, just noting the timing. the problem was obvious enough that we both arrived at the same solution independently.

what's next

house-vision for image analysis. house-data for database operations. the pattern scales to any domain where the input is noisy and the output should be clean. which is basically everything.

Share LinkedIn

monthly updates on new projects and experiments.